[v3,01/54] btrfs: fix error handling in commit_fs_roots

Message ID	502d2273052e95e19366d785ee85e542e86fe61e.1606938211.git.josef@toxicpanda.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> From: Josef Bacik <josef@toxicpanda.com> To: linux-btrfs@vger.kernel.org, kernel-team@fb.com Subject: [PATCH v3 01/54] btrfs: fix error handling in commit_fs_roots Date: Wed, 2 Dec 2020 14:50:19 -0500 Message-Id: <502d2273052e95e19366d785ee85e542e86fe61e.1606938211.git.josef@toxicpanda.com> In-Reply-To: <cover.1606938211.git.josef@toxicpanda.com> References: <cover.1606938211.git.josef@toxicpanda.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	Cleanup error handling in relocation \| expand [v3,00/54] Cleanup error handling in relocation [v3,01/54] btrfs: fix error handling in commit_fs_roots [v3,02/54] btrfs: allow error injection for btrfs_search_slot and btrfs_cow_block [v3,03/54] btrfs: fix lockdep splat in btrfs_recover_relocation [v3,04/54] btrfs: keep track of the root owner for relocation reads [v3,05/54] btrfs: noinline btrfs_should_cancel_balance [v3,06/54] btrfs: do not cleanup upper nodes in btrfs_backref_cleanup_node [v3,07/54] btrfs: pass down the tree block level through ref-verify [v3,08/54] btrfs: make sure owner is set in ref-verify [v3,09/54] btrfs: don't clear ret in btrfs_start_dirty_block_groups [v3,10/54] btrfs: convert some BUG_ON()'s to ASSERT()'s in do_relocation [v3,11/54] btrfs: convert BUG_ON()'s in relocate_tree_block [v3,12/54] btrfs: return an error from btrfs_record_root_in_trans [v3,13/54] btrfs: handle errors from select_reloc_root() [v3,14/54] btrfs: convert BUG_ON()'s in select_reloc_root() to proper errors [v3,15/54] btrfs: check record_root_in_trans related failures in select_reloc_root [v3,16/54] btrfs: do proper error handling in record_reloc_root_in_trans [v3,17/54] btrfs: handle btrfs_record_root_in_trans failure in btrfs_rename_exchange [v3,18/54] btrfs: handle btrfs_record_root_in_trans failure in btrfs_rename [v3,19/54] btrfs: handle btrfs_record_root_in_trans failure in btrfs_delete_subvolume [v3,20/54] btrfs: handle btrfs_record_root_in_trans failure in btrfs_recover_log_trees [v3,21/54] btrfs: handle btrfs_record_root_in_trans failure in create_subvol [v3,22/54] btrfs: btrfs: handle btrfs_record_root_in_trans failure in relocate_tree_block [v3,23/54] btrfs: handle btrfs_record_root_in_trans failure in start_transaction [v3,24/54] btrfs: handle record_root_in_trans failure in qgroup_account_snapshot [v3,25/54] btrfs: handle record_root_in_trans failure in btrfs_record_root_in_trans [v3,26/54] btrfs: handle record_root_in_trans failure in create_pending_snapshot [v3,27/54] btrfs: do not panic in __add_reloc_root [v3,28/54] btrfs: have proper error handling in btrfs_init_reloc_root [v3,29/54] btrfs: do proper error handling in create_reloc_root [v3,30/54] btrfs: validate ->reloc_root after recording root in trans [v3,31/54] btrfs: handle btrfs_update_reloc_root failure in commit_fs_roots [v3,32/54] btrfs: change insert_dirty_subvol to return errors [v3,33/54] btrfs: handle btrfs_update_reloc_root failure in insert_dirty_subvol [v3,34/54] btrfs: handle btrfs_update_reloc_root failure in prepare_to_merge [v3,35/54] btrfs: do proper error handling in btrfs_update_reloc_root [v3,36/54] btrfs: convert logic BUG_ON()'s in replace_path to ASSERT()'s [v3,37/54] btrfs: handle initial btrfs_cow_block error in replace_path [v3,38/54] btrfs: handle the loop btrfs_cow_block error in replace_path [v3,39/54] btrfs: handle btrfs_search_slot failure in replace_path [v3,40/54] btrfs: handle errors in reference count manipulation in replace_path [v3,41/54] btrfs: handle extent reference errors in do_relocation [v3,42/54] btrfs: check for BTRFS_BLOCK_FLAG_FULL_BACKREF being set improperly [v3,43/54] btrfs: remove the extent item sanity checks in relocate_block_group [v3,44/54] btrfs: do proper error handling in create_reloc_inode [v3,45/54] btrfs: handle __add_reloc_root failure in btrfs_recover_relocation [v3,46/54] btrfs: handle __add_reloc_root failure in btrfs_reloc_post_snapshot [v3,47/54] btrfs: cleanup error handling in prepare_to_merge [v3,48/54] btrfs: handle extent corruption with select_one_root properly [v3,49/54] btrfs: do proper error handling in merge_reloc_roots [v3,50/54] btrfs: check return value of btrfs_commit_transaction in relocation [v3,51/54] btrfs: do not WARN_ON() if we can't find the reloc root [v3,52/54] btrfs: print the actual offset in btrfs_root_name [v3,53/54] btrfs: fix reloc root leak with 0 ref reloc roots on recovery [v3,54/54] btrfs: splice remaining dirty_bg's onto the transaction dirty bg list

Message ID

502d2273052e95e19366d785ee85e542e86fe61e.1606938211.git.josef@toxicpanda.com (mailing list archive)

State

New, archived

Headers

From: Josef Bacik <josef@toxicpanda.com>
To: linux-btrfs@vger.kernel.org, kernel-team@fb.com
Subject: [PATCH v3 01/54] btrfs: fix error handling in commit_fs_roots
Date: Wed,  2 Dec 2020 14:50:19 -0500
Message-Id: 
 <502d2273052e95e19366d785ee85e542e86fe61e.1606938211.git.josef@toxicpanda.com>
In-Reply-To: <cover.1606938211.git.josef@toxicpanda.com>
References: <cover.1606938211.git.josef@toxicpanda.com>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Precedence: bulk

Series

Cleanup error handling in relocation | expand

Commit Message

Josef Bacik Dec. 2, 2020, 7:50 p.m. UTC

While doing error injection I would sometimes get a corrupt file system.
This is because I was injecting errors at btrfs_search_slot, but would
only do it one time per stack.  This uncovered a problem in
commit_fs_roots, where if we get an error we would just break.  However
we're in a nested loop, the main loop being a loop to find all the dirty
fs roots, and then subsequent root updates would succeed clearing the
error value.

This isn't likely to happen in real scenarios, however we could
potentially get a random ENOMEM once and then not again, and we'd end up
with a corrupted file system.  Fix this by moving the error checking
around a bit to the nested loop, as this is the only place where
something will fail, and return the error as soon as it occurs.

With this patch my reproducer no longer corrupts the file system.

Signed-off-by: Josef Bacik <josef@toxicpanda.com>
---
 fs/btrfs/transaction.c | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

Comments

Qu Wenruo Dec. 3, 2020, 1:45 a.m. UTC | #1

On 2020/12/3 上午3:50, Josef Bacik wrote:
> While doing error injection I would sometimes get a corrupt file system.
> This is because I was injecting errors at btrfs_search_slot, but would
> only do it one time per stack.  This uncovered a problem in
> commit_fs_roots, where if we get an error we would just break.  However
> we're in a nested loop, the main loop being a loop to find all the dirty
> fs roots, and then subsequent root updates would succeed clearing the
> error value.
> 
> This isn't likely to happen in real scenarios, however we could
> potentially get a random ENOMEM once and then not again, and we'd end up
> with a corrupted file system.  Fix this by moving the error checking
> around a bit to the nested loop, as this is the only place where
> something will fail, and return the error as soon as it occurs.
> 
> With this patch my reproducer no longer corrupts the file system.
> 
> Signed-off-by: Josef Bacik <josef@toxicpanda.com>

Reviewed-by: Qu Wenruo <wqu@suse.com>

Yep, that err can be overwritten by next loop, so definitely a problem.

Thanks,
Qu
> ---
>  fs/btrfs/transaction.c | 9 +++++----
>  1 file changed, 5 insertions(+), 4 deletions(-)
> 
> diff --git a/fs/btrfs/transaction.c b/fs/btrfs/transaction.c
> index 8e0f7a1029c6..a614f7699ce4 100644
> --- a/fs/btrfs/transaction.c
> +++ b/fs/btrfs/transaction.c
> @@ -1319,7 +1319,6 @@ static noinline int commit_fs_roots(struct btrfs_trans_handle *trans)
>  	struct btrfs_root *gang[8];
>  	int i;
>  	int ret;
> -	int err = 0;
>  
>  	spin_lock(&fs_info->fs_roots_radix_lock);
>  	while (1) {
> @@ -1331,6 +1330,8 @@ static noinline int commit_fs_roots(struct btrfs_trans_handle *trans)
>  			break;
>  		for (i = 0; i < ret; i++) {
>  			struct btrfs_root *root = gang[i];
> +			int err;
> +
>  			radix_tree_tag_clear(&fs_info->fs_roots_radix,
>  					(unsigned long)root->root_key.objectid,
>  					BTRFS_ROOT_TRANS_TAG);
> @@ -1353,14 +1354,14 @@ static noinline int commit_fs_roots(struct btrfs_trans_handle *trans)
>  			err = btrfs_update_root(trans, fs_info->tree_root,
>  						&root->root_key,
>  						&root->root_item);
> -			spin_lock(&fs_info->fs_roots_radix_lock);
>  			if (err)
> -				break;
> +				return err;
> +			spin_lock(&fs_info->fs_roots_radix_lock);
>  			btrfs_qgroup_free_meta_all_pertrans(root);
>  		}
>  	}
>  	spin_unlock(&fs_info->fs_roots_radix_lock);
> -	return err;
> +	return 0;
>  }
>  
>  /*
>

Johannes Thumshirn Dec. 3, 2020, 8:09 a.m. UTC | #2

On 02/12/2020 20:54, Josef Bacik wrote:
> While doing error injection I would sometimes get a corrupt file system.
> This is because I was injecting errors at btrfs_search_slot, but would
> only do it one time per stack.  This uncovered a problem in
> commit_fs_roots, where if we get an error we would just break.  However
> we're in a nested loop, the main loop being a loop to find all the dirty
> fs roots, and then subsequent root updates would succeed clearing the
> error value.
> 
> This isn't likely to happen in real scenarios, however we could
> potentially get a random ENOMEM once and then not again, and we'd end up
> with a corrupted file system.  Fix this by moving the error checking
> around a bit to the nested loop, as this is the only place where
> something will fail, and return the error as soon as it occurs.
> 
> With this patch my reproducer no longer corrupts the file system.

Better to abort the transaction than to corrupt the FS,
Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>

diff --git a/fs/btrfs/transaction.c b/fs/btrfs/transaction.c
index 8e0f7a1029c6..a614f7699ce4 100644
--- a/fs/btrfs/transaction.c
+++ b/fs/btrfs/transaction.c
@@ -1319,7 +1319,6 @@  static noinline int commit_fs_roots(struct btrfs_trans_handle *trans)
 	struct btrfs_root *gang[8];
 	int i;
 	int ret;
-	int err = 0;
 
 	spin_lock(&fs_info->fs_roots_radix_lock);
 	while (1) {
@@ -1331,6 +1330,8 @@  static noinline int commit_fs_roots(struct btrfs_trans_handle *trans)
 			break;
 		for (i = 0; i < ret; i++) {
 			struct btrfs_root *root = gang[i];
+			int err;
+
 			radix_tree_tag_clear(&fs_info->fs_roots_radix,
 					(unsigned long)root->root_key.objectid,
 					BTRFS_ROOT_TRANS_TAG);
@@ -1353,14 +1354,14 @@  static noinline int commit_fs_roots(struct btrfs_trans_handle *trans)
 			err = btrfs_update_root(trans, fs_info->tree_root,
 						&root->root_key,
 						&root->root_item);
-			spin_lock(&fs_info->fs_roots_radix_lock);
 			if (err)
-				break;
+				return err;
+			spin_lock(&fs_info->fs_roots_radix_lock);
 			btrfs_qgroup_free_meta_all_pertrans(root);
 		}
 	}
 	spin_unlock(&fs_info->fs_roots_radix_lock);
-	return err;
+	return 0;
 }
 
 /*

[v3,01/54] btrfs: fix error handling in commit_fs_roots

Commit Message

Comments

Patch