btrfs: make read time repair to be in sectorsize unit

Currently btrfs_submit_read_repair() will try to repair the read range
from the bvec.

It works fine for regular sectorsize case, as each bvec covers just one
sector, thus we can do per-sector repair without any problem.

But for subpage case, it can be complex, as one bvec can covers several
sectors.

Previously we hack btrfs_io_needs_validation() for subpage so that we
can submit the whole bvec range for subpage.
This behavior reduces the repair granularity, and make subpage unable to
repair the following file layout:
                0       4K      8K
  Mirror 1      |xxxxxxx|       |
  Mirror 2      |       |xxxxxxx|

As for subpage case we submit one bvec which covers 2 sectors, if any
csum mismatch happens, the whole bvec is considered corrupted, and
above case will be considered both copies are corrupted.

This patch will fix this problem by only submitting the repair for the
corrupted sector(s).

This patch will:
- Introduce repair_one_sector()
  The main code submitting repair, which is more or less the same as old
  btrfs_submit_read_repair().
  But this time, it only repair one sector.

- Make btrfs_verify_data_csum() to return an error bitmap
  So that new btrfs_submit_read_repair() can know exactly which
  sector(s) needs repair.

- Make btrfs_submit_read_repair() to handle sectors differently
  For sectors without csum error, just release them like what we did
  in end_bio_extent_readpage().
  Although in this context we don't have process_extent structure, thus
  we have to do extent tree operations sector by sector.
  This is slower, but since it's only in csum mismatch path, it should
  be fine.

  For sectors with csum error, we submit repair for each sector.

- Remove btrfs_io_needs_validation() and its callers
  In end_bio_extent_readpage(), we already have an ASSERT() to make sure
  all bio passed are not cloned, thus that "bio_flagged(bio, BIO_CLONED)"
  branch never gets executed.

  Then for bvec check, since we're only going to submit repair for each
  sector, then it will always return false anyway.

  Thus we're safe to remove this function and its callers.

With this modification, both regular sectorsize and subpage can handle
repair without problem.

Signed-off-by: Qu Wenruo <wqu@suse.com>
---
Although this patch is more suitable for subpage patchset, this patch
works fine for both sectorsize cases.

Furthermore, considering how many code is modified, I'd prefer this
patch get reviewed out of the subpage patchset.
---
 fs/btrfs/extent_io.c | 229 +++++++++++++++++++------------------------
 fs/btrfs/extent_io.h |   1 +
 fs/btrfs/inode.c     |  14 ++-
 3 files changed, 115 insertions(+), 129 deletions(-)

Message ID	20210429075617.213770-1-wqu@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-18.7 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED, USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 938A5C433ED for <linux-btrfs@archiver.kernel.org>; Thu, 29 Apr 2021 07:56:24 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 5A7FD613F7 for <linux-btrfs@archiver.kernel.org>; Thu, 29 Apr 2021 07:56:24 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239385AbhD2H5J (ORCPT <rfc822;linux-btrfs@archiver.kernel.org>); Thu, 29 Apr 2021 03:57:09 -0400 Received: from mx2.suse.de ([195.135.220.15]:49280 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239347AbhD2H5I (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Thu, 29 Apr 2021 03:57:08 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1619682981; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc: mime-version:mime-version: content-transfer-encoding:content-transfer-encoding; bh=fo2ziI29M4jRgtz+ARSr9xQm5sFCZAbLs1uxj0j2NM0=; b=rihp3eo2exrygSCzv7z6ZcJ3jjICfe1eaPm/3pGZUPmZiktnflzqD2mdK+3cVsJgLA7OXY 2JVRJCfQP9fD9+ISe35hEzQ085IdDhaVBG8XQBoCljpDK+Hp3qfHJsEpLkfcPY+i5JcZNi a320NDzJ99buM+SX9+fHHjrQ2U25YF8= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id B0474AF65 for <linux-btrfs@vger.kernel.org>; Thu, 29 Apr 2021 07:56:21 +0000 (UTC) From: Qu Wenruo <wqu@suse.com> To: linux-btrfs@vger.kernel.org Subject: [PATCH] btrfs: make read time repair to be in sectorsize unit Date: Thu, 29 Apr 2021 15:56:17 +0800 Message-Id: <20210429075617.213770-1-wqu@suse.com> X-Mailer: git-send-email 2.31.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org
Series	btrfs: make read time repair to be in sectorsize unit \| expand btrfs: make read time repair to be in sectorsize unit

btrfs: make read time repair to be in sectorsize unit

Commit Message

Comments

Patch