[v2,2/2] btrfs: extent-tree: Ensure we trim ranges across block group boundary

[BUG]
When deleting large files (which cross block group boundary) with discard
mount option, we find some btrfs_discard_extent() calls only trimmed part
of its space, not the whole range:

  btrfs_discard_extent: type=0x1 start=19626196992 len=2144530432 trimmed=1073741824 ratio=50%

type:		bbio->map_type, in above case, it's SINGLE DATA.
start:		Logical address of this trim
len:		Logical length of this trim
trimmed:	Physically trimmed bytes
ratio:		trimmed / len

Thus leading some unused space not discarded.

[CAUSE]
When discard mount option is specified, after a transaction is fully
committed (super block written to disk), we begin to cleanup pinned
extents in the following call chain:

btrfs_commit_transaction()
|- write_all_supers()
|- btrfs_finish_extent_commit()
   |- find_first_extent_bit(unpin, 0, &start, &end, EXTENT_DIRTY);
   |- btrfs_discard_extent()

However pinned extents are recorded in an extent_io_tree, which can
merge adjacent extent states.

When a large file get deleted and it has adjacent file extents across
block group boundary, we will get a large merged range.

Then when we pass the large range into btrfs_discard_extent(),
btrfs_discard_extent() will just trim the first part, without trimming
the remaining part.

Furthermore, this bug is not that reliably observed, as if the whole
block group is empty, there will be another trim for that block group.

So the most obvious way to find this missing trim needs to delete large
extents at block group boundary without empting involved block groups.

[FIX]
- Allow __btrfs_map_block_for_discard() to modify @length parameter
  btrfs_map_block() uses its @length paramter to notify the caller how
  many bytes are mapped in current call.
  With __btrfs_map_block_for_discard() also modifing the @length,
  btrfs_discard_extent() now understands if it needs to do next trim.

- Call btrfs_map_block() in a loop until we hit the range end
  Since we now know how many bytes are mapped each time, we can iterate
  through each block group boundary and issue correct trim for each
  range.

Signed-off-by: Qu Wenruo <wqu@suse.com>
---
 fs/btrfs/extent-tree.c | 40 ++++++++++++++++++++++++++++++----------
 fs/btrfs/volumes.c     |  6 ++++--
 2 files changed, 34 insertions(+), 12 deletions(-)

Message ID	20191023125648.30840-3-wqu@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=avBk=YQ=vger.kernel.org=linux-btrfs-owner@kernel.org> Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id B8ADB112B for <patchwork-linux-btrfs@patchwork.kernel.org>; Wed, 23 Oct 2019 12:56:59 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id A0EBB2173B for <patchwork-linux-btrfs@patchwork.kernel.org>; Wed, 23 Oct 2019 12:56:59 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404261AbfJWM46 (ORCPT <rfc822;patchwork-linux-btrfs@patchwork.kernel.org>); Wed, 23 Oct 2019 08:56:58 -0400 Received: from mx2.suse.de ([195.135.220.15]:51090 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S2404165AbfJWM46 (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Wed, 23 Oct 2019 08:56:58 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 58787B74A for <linux-btrfs@vger.kernel.org>; Wed, 23 Oct 2019 12:56:56 +0000 (UTC) From: Qu Wenruo <wqu@suse.com> To: linux-btrfs@vger.kernel.org Subject: [PATCH v2 2/2] btrfs: extent-tree: Ensure we trim ranges across block group boundary Date: Wed, 23 Oct 2019 20:56:48 +0800 Message-Id: <20191023125648.30840-3-wqu@suse.com> X-Mailer: git-send-email 2.23.0 In-Reply-To: <20191023125648.30840-1-wqu@suse.com> References: <20191023125648.30840-1-wqu@suse.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-btrfs-owner@vger.kernel.org Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org
Series	btrfs: trim: Fix a bug certain range may not be trimmed properly \| expand [v2,0/2] btrfs: trim: Fix a bug certain range may not be trimmed properly [v2,1/2] btrfs: volumes: Use more straightforward way to calculate map length [v2,2/2] btrfs: extent-tree: Ensure we trim ranges across block group boundary

[v2,2/2] btrfs: extent-tree: Ensure we trim ranges across block group boundary

Commit Message

Comments

Patch