[7/7] xfs_scrub: tune fstrim minlen parameter based on free space histograms

From: Darrick J. Wong <djwong@kernel.org>

From: Darrick J. Wong <djwong@kernel.org>

Currently, phase 8 runs very slowly on filesystems with a lot of small
free space extents.  To reduce the amount of time spent on fstrim
activities during phase 8, we want to balance estimated runtime against
completeness of the trim.  In short, the goal is to reduce runtime by
avoiding small trim requests.

At the start of phase 8, a CDF is computed in decreasing order of extent
length from the histogram buckets created during the fsmap scan in phase
7.  A point corresponding to the fstrim percentage target is chosen from
the CDF and mapped back to a histogram bucket, and free space extents
smaller than that amount are ommitted from fstrim.

On my aging /home filesystem, the free space histogram reported by
xfs_spaceman looks like this:

   from      to extents    blocks    pct blkcdf extcdf
      1       1  121953    121953   0.04 100.00 100.00
      2       3  124741    299694   0.09  99.96  81.16
      4       7  113492    593763   0.18  99.87  61.89
      8      15  109215   1179524   0.36  99.69  44.36
     16      31   76972   1695455   0.52  99.33  27.48
     32      63   48655   2219667   0.68  98.82  15.59
     64     127   31398   2876898   0.88  98.14   8.08
    128     255    8014   1447920   0.44  97.27   3.23
    256     511    4142   1501758   0.46  96.82   1.99
    512    1023    2433   1768732   0.54  96.37   1.35
   1024    2047    1795   2648460   0.81  95.83   0.97
   2048    4095    1429   4206103   1.28  95.02   0.69
   4096    8191    1045   6162111   1.88  93.74   0.47
   8192   16383     791   9242745   2.81  91.87   0.31
  16384   32767     473  10883977   3.31  89.06   0.19
  32768   65535     272  12385566   3.77  85.74   0.12
  65536  131071     192  18098739   5.51  81.98   0.07
 131072  262143     108  20675199   6.29  76.47   0.04
 262144  524287      80  29061285   8.84  70.18   0.03
 524288 1048575      39  29002829   8.83  61.33   0.02
1048576 2097151      25  36824985  11.21  52.51   0.01
2097152 4194303      32 101727192  30.95  41.30   0.01
4194304 8388607       7  34007410  10.35  10.35   0.00

From this table, we see that free space extents that are 16 blocks or
longer constitute 99.3% of the free space in the filesystem but only
27.5% of the extents.  If we set the fstrim minlen parameter to 16
blocks, that means that we can trim over 99% of the space in one third
of the time it would take to trim everything.

Add a new -o fstrim_pct= option to xfs_scrub just in case there are
users out there who want a different percentage.  For example, accepting
a 95% trim would net us a speed increase of nearly two orders of
magnitude, ignoring system call overhead.  Setting it to 100% will trim
everything, just like fstrim(8).

Signed-off-by: Darrick J. Wong <djwong@kernel.org>
---
 libfrog/histogram.c  |    2 +
 libfrog/histogram.h  |    1 +
 man/man8/xfs_scrub.8 |   16 +++++++++++
 scrub/phase8.c       |   75 +++++++++++++++++++++++++++++++++++++++++++++++---
 scrub/vfs.c          |    4 ++-
 scrub/vfs.h          |    2 +
 scrub/xfs_scrub.c    |   38 +++++++++++++++++++++++++
 scrub/xfs_scrub.h    |   12 ++++++++
 8 files changed, 141 insertions(+), 9 deletions(-)

Message ID	171988118687.2007921.1260012940783338117.stgit@frogsfrogsfrogs (mailing list archive)
State	Superseded
Headers	show Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 657D27F9 for <linux-xfs@vger.kernel.org>; Tue, 2 Jul 2024 01:04:42 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1719882282; cv=none; b=mCO29VAe9qvvRuQ9JJQLHSBAVkBztVYLmML3UzVvEs9IhTrvebM7xmGtVrIC3nDjWE7Aos+vhD7ovM4Rt92HKZWogS+x0HuiDDpmb1rGRF70wWgnw91fgle0P/W4f4pJwkxOd/YIpi2by6NJOyYZcL5oMc4rwE/VofxFeC3LaBo= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1719882282; c=relaxed/simple; bh=7Apcpxe/g/fbmiP5c6cvjt7ooUIVmNLzMWR+hdld2JM=; h=Date:Subject:From:To:Cc:Message-ID:In-Reply-To:References: MIME-Version:Content-Type; b=MyrQz35kLZiaE6l/EV5GKeyALXWCwEzNCGC9CbDHMFU9AyTbzjbMhLhnq2DCdYURMiMh0SfCtdFY56KTo14XgOflsNtyuFv7S7oPsxcsHihoW453BCQVQ1HdcOe6SAaOYI4Vu/ie4e6ofbq0f9TyiL3B1hg5LgsQj/fwL+Q50nI= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=rDaKxZob; arc=none smtp.client-ip=10.30.226.201 Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="rDaKxZob" Received: by smtp.kernel.org (Postfix) with ESMTPSA id 3843CC116B1; Tue, 2 Jul 2024 01:04:42 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1719882282; bh=7Apcpxe/g/fbmiP5c6cvjt7ooUIVmNLzMWR+hdld2JM=; h=Date:Subject:From:To:Cc:In-Reply-To:References:From; b=rDaKxZobXdAh8nKJ4lsNPjw71SilzV4lw52CVgM02iRyKG+5WwO4MunTapS+j9mw5 H9CCTjEw+Uy2OVCNTU0H/EVBmDcha9nhCRxinDZZIbe8ECK7/ahnr5iJZ1nCeFboA7 x14TY8wDSFbBFR5Btsf89nRoZcHRtQtow0//nJn0oLA9S7rI4J5ZduEC+685zhH24O u0Sv/K7QVXMcudIWAlH/CemwuTV7V5uaSE9+igDMjTrupvdAEvNVuvCvlVufV56C/r 26FCJRfAK8a16o2xOXCwcDw7UgECHcMNfuynY9JuUjq/880l3F9B+u0zxM2wDvqJ8r d36SXPOopfOQA== Date: Mon, 01 Jul 2024 18:04:41 -0700 Subject: [PATCH 7/7] xfs_scrub: tune fstrim minlen parameter based on free space histograms From: "Darrick J. Wong" <djwong@kernel.org> To: djwong@kernel.org, cem@kernel.org Cc: linux-xfs@vger.kernel.org, hch@lst.de Message-ID: <171988118687.2007921.1260012940783338117.stgit@frogsfrogsfrogs> In-Reply-To: <171988118569.2007921.18066484659815583228.stgit@frogsfrogsfrogs> References: <171988118569.2007921.18066484659815583228.stgit@frogsfrogsfrogs> User-Agent: StGit/0.19 Precedence: bulk X-Mailing-List: linux-xfs@vger.kernel.org List-Id: <linux-xfs.vger.kernel.org> List-Subscribe: <mailto:linux-xfs+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:linux-xfs+unsubscribe@vger.kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit
Series	[1/7] libfrog: hoist free space histogram code \| expand [1/7] libfrog: hoist free space histogram code [2/7] libfrog: print wider columns for free space histogram [3/7] libfrog: print cdf of free space buckets [4/7] xfs_scrub: don't close stdout when closing the progress bar [5/7] xfs_scrub: remove pointless spacemap.c arguments [6/7] xfs_scrub: collect free space histograms during phase 7 [7/7] xfs_scrub: tune fstrim minlen parameter based on free space histograms

[7/7] xfs_scrub: tune fstrim minlen parameter based on free space histograms

Commit Message

Comments

Patch