[v2,18/19] btrfs: avoid duplicated resolution of indirect backrefs during fiemap

From: Filipe Manana <fdmanana@suse.com>

From: Filipe Manana <fdmanana@suse.com>

During fiemap, when determining if a data extent is shared or not, if we
don't find the extent is directly shared, then we need to determine if
it's shared through subtrees. For that we need to resolve the indirect
reference we found in order to figure out the path in the inode's fs tree,
which is a path starting at the fs tree's root node and going down to the
leaf that contains the file extent item that points to the data extent.
We then proceed to determine if any extent buffer in that path is shared
with other trees or not.

Currently whenever we find the data extent that a file extent item points
to is not directly shared, we always resolve the path in the fs tree, and
then check if any extent buffer in the path is shared. This is a lot of
work and when we have file extent items that belong to the same leaf, we
have the same path, so we only need to calculate it once.

This change does that, it keeps track of the current and previous leaf,
and when we find that a data extent is not directly shared, we try to
compute the fs tree path only once and then use it for every other file
extent item in the same leaf, using the existing cached path result for
the leaf as long as the cache results are valid.

This saves us from doing expensive b+tree searches in the fs tree of our
target inode, as well as other minor work.

The following test was run on a non-debug kernel (Debian's default kernel
config):

   $ cat test-with-snapshots.sh
   #!/bin/bash

   DEV=/dev/sdi
   MNT=/mnt/sdi

   umount $DEV &> /dev/null
   mkfs.btrfs -f $DEV
   # Use compression to quickly create files with a lot of extents
   # (each with a size of 128K).
   mount -o compress=lzo $DEV $MNT

   # 40G gives 327680 extents, each with a size of 128K.
   xfs_io -f -c "pwrite -S 0xab -b 1M 0 40G" $MNT/foobar

   # Add some more files to increase the size of the fs and extent
   # trees (in the real world there's a lot of files and extents
   # from other files).
   xfs_io -f -c "pwrite -S 0xcd -b 1M 0 20G" $MNT/file1
   xfs_io -f -c "pwrite -S 0xef -b 1M 0 20G" $MNT/file2
   xfs_io -f -c "pwrite -S 0x73 -b 1M 0 20G" $MNT/file3

   # Create a snapshot so all the extents become indirectly shared
   # through subtrees, with a generation less than or equals to the
   # generation used to create the snapshot.
   btrfs subvolume snapshot -r $MNT $MNT/snap1

   umount $MNT
   mount -o compress=lzo $DEV $MNT

   start=$(date +%s%N)
   filefrag $MNT/foobar
   end=$(date +%s%N)
   dur=$(( (end - start) / 1000000 ))
   echo "fiemap took $dur milliseconds (metadata not cached)"
   echo

   start=$(date +%s%N)
   filefrag $MNT/foobar
   end=$(date +%s%N)
   dur=$(( (end - start) / 1000000 ))
   echo "fiemap took $dur milliseconds (metadata cached)"

   umount $MNT

Result before applying this patch:

   (...)
   /mnt/sdi/foobar: 327680 extents found
   fiemap took 1204 milliseconds (metadata not cached)

   /mnt/sdi/foobar: 327680 extents found
   fiemap took 729 milliseconds (metadata cached)

Result after applying this patch:

   (...)
   /mnt/sdi/foobar: 327680 extents found
   fiemap took 732 milliseconds (metadata not cached)

   /mnt/sdi/foobar: 327680 extents found
   fiemap took 421 milliseconds (metadata cached)

That's a -46.1% total reduction for the metadata not cached case, and
a -42.2% reduction for the cached metadata case.

The test is somewhat limited in the sense the gains may be higher in
practice, because in the test the filesystem is small, so we have small
fs and extent trees, plus there's no concurrent access to the trees as
well, therefore no lock contention there.

Signed-off-by: Filipe Manana <fdmanana@suse.com>
---
 fs/btrfs/backref.c   | 64 +++++++++++++++++++++++++++++++++++++-------
 fs/btrfs/backref.h   | 13 +++++++++
 fs/btrfs/extent_io.c |  2 ++
 3 files changed, 69 insertions(+), 10 deletions(-)

Message ID	68efa89bd86900f1cee4b1cca82e5280a33e91cf.1665490019.git.fdmanana@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 275A2C4332F for <linux-btrfs@archiver.kernel.org>; Tue, 11 Oct 2022 12:17:39 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229599AbiJKMRi (ORCPT <rfc822;linux-btrfs@archiver.kernel.org>); Tue, 11 Oct 2022 08:17:38 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41790 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229776AbiJKMRd (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Tue, 11 Oct 2022 08:17:33 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id CB03565665 for <linux-btrfs@vger.kernel.org>; Tue, 11 Oct 2022 05:17:31 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 045ABB815B8 for <linux-btrfs@vger.kernel.org>; Tue, 11 Oct 2022 12:17:30 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 37874C433B5 for <linux-btrfs@vger.kernel.org>; Tue, 11 Oct 2022 12:17:28 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1665490648; bh=MCKoi632MKVdvYeJmUbF8g0OX055YKNRQptgjqV3oNs=; h=From:To:Subject:Date:In-Reply-To:References:From; b=GYYL/xoGk3uNCoVjO7iz5XRUhr4D0HN1y5NjvtasHvk/voZW/vl6gai1U7uoe1R3D KjAHNiXSlbsxyVSlH/537uKkYsgcs6NEI8oqj7a14vprHlZz4hNhPaY6/SRKXftJe1 uO5siKuOIUwLBZssJbGod57dYfXZAjJYoKvzGLLEHKocsJolKTbmWtN9D090Mlflng L3jCxKe9Cy15YZA+fGISODxMGjRcQMol4QbJhUvnZF7Vnz+g1b4xp0AXN3PZvpnFbT VASVnQ1P2P1sME5/W1x5StgXDGYZGajBd3NikaOC4IR1UMAS6s1yuKmwyMDjQD4gVt yVA/iaO7jVSRQ== From: fdmanana@kernel.org To: linux-btrfs@vger.kernel.org Subject: [PATCH v2 18/19] btrfs: avoid duplicated resolution of indirect backrefs during fiemap Date: Tue, 11 Oct 2022 13:17:08 +0100 Message-Id: <68efa89bd86900f1cee4b1cca82e5280a33e91cf.1665490019.git.fdmanana@suse.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <cover.1665490018.git.fdmanana@suse.com> References: <cover.1665490018.git.fdmanana@suse.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org
Series	btrfs: fixes, cleanups and optimizations around fiemap \| expand [v2,00/19] btrfs: fixes, cleanups and optimizations around fiemap [v2,01/19] btrfs: fix processing of delayed data refs during backref walking [v2,02/19] btrfs: fix processing of delayed tree block refs during backref walking [v2,03/19] btrfs: ignore fiemap path cache if we have multiple leaves for a data extent [v2,04/19] btrfs: get the next extent map during fiemap/lseek more efficiently [v2,05/19] btrfs: skip unnecessary extent map searches during fiemap and lseek [v2,06/19] btrfs: skip unnecessary delalloc search during fiemap and lseek [v2,07/19] btrfs: drop pointless memset when cloning extent buffer [v2,08/19] btrfs: drop redundant bflags initialization when allocating extent buffer [v2,09/19] btrfs: remove checks for a root with id 0 during backref walking [v2,10/19] btrfs: remove checks for a 0 inode number during backref walking [v2,11/19] btrfs: directly pass the inode to btrfs_is_data_extent_shared() [v2,12/19] btrfs: turn the backref sharedness check cache into a context object [v2,13/19] btrfs: move ulists to data extent sharedness check context [v2,14/19] btrfs: remove roots ulist when checking data extent sharedness [v2,15/19] btrfs: remove useless logic when finding parent nodes [v2,16/19] btrfs: cache sharedness of the last few data extents during fiemap [v2,17/19] btrfs: move up backref sharedness cache store and lookup functions [v2,18/19] btrfs: avoid duplicated resolution of indirect backrefs during fiemap [v2,19/19] btrfs: avoid unnecessary resolution of indirect backrefs during fiemap

[v2,18/19] btrfs: avoid duplicated resolution of indirect backrefs during fiemap

Commit Message

Patch