[RFC] btrfs: introduce qgroup dirty extents threshold mechanism for snapshot drop and inode truncation

When dropping a lot of btrfs subvolumes with qgroup enabled, there can
be a pretty large latency for btrfs_commit_transaction().

The reason is, dropping subvolumes/snapshots will create a lot of extent
owner change, thus qgroup have to traces such owner changes and cause
latency in btrfs_commit_transaction().

For snapshot/subvolume drop, we don't really have any good way to reduce
the number of dirty qgroup extents.

But least we can still reduce the latency of each
btrfs_commit_transaction() run, by trying to commit transaction when the
dirty qgroup extent number reaches a certain threshold.

By this, we can commit several small transactions instead of a big and
slow transaction.

This patch will introduce the following things:

- The ability to trace how many dirty qgroup extents for one transaction
  A new member, atomic64_t nr_dirty_extents, is introduced to
  btrfs_delayed_ref_root.

- Introduce btrfs_should_commit_trans() helper
  Now btrfs_should_end_transaction() will also call
  btrfs_should_commit_trans() before returning.

- Commit transaction for subvolume drop if we hits the threshold
- Commit transaction for inode truncation if we hits the threshold

There is some quick benchmarking for it.

The fs is created by the following script:

  for (( j = 0; j < 16; j++ )); do
          btrfs subv create $mnt/src/subvol_$j
          for (( i = 0; i < 512; i++)) ; do
                  xfs_io -f -c "pwrite 0 2k" $mnt/src/subvol_$j/file_inline_$i > /dev/null
                  xfs_io -f -c "pwrite 0 4k" $mnt/src/subvol_$j/file_reg_$i > /dev/null
          done
  done

  sync

  btrfs quota enable $mnt
  btrfs quota rescan -w $mnt
  btrfs sub delete $mnt/src/subvol*

I tried several threshold value, the execution time for
btrfs_qgroup_account_extents() are:

 Threshold	| Number of calls	| Average execution time
------------------------------------------------------------------------
 infinite	| 1			| 770.74ms
 8K		| 3			| 280.47ms
 4K		| 5			| 146.41ms
 2K		| 9 			|  72.36ms
 1K		| 18			|  35.97ms

Currently I choose the 4K as the threshold for its minimal impact on the
number of new transactions to be committed, while still keep the latency
more or less acceptable.

There is another hidden pitfall, if all these extents are mostly shared
between different snapshots, current snapshot/subvolume dropping
mechanism (breadth-first search) makes the lower level leaves to trigger
tons of backref walk, while the higher level tree blocks will only
trigger less and less work load.

Thus this enhancement won't be that obvious to drop such mostly shared
snapshots.
To address that, we need to rework how we drop snapshots/subvolumes, and
it would definitely be another story.

Signed-off-by: Qu Wenruo <wqu@suse.com>
---
Reason for RFC:
- The threshold value
  Any immediate number is not flex enough, but I don't have
  better ideas on how to set the value without introducing extra and
  complex on-disk format change.
  This threshold doesn't deserve that large change on on-disk format,
  nor even a mount option.
---
 fs/btrfs/delayed-ref.h | 20 ++++++++++++++++++++
 fs/btrfs/extent-tree.c | 10 +++++++++-
 fs/btrfs/qgroup.c      |  9 ++++++---
 fs/btrfs/transaction.c |  3 +++
 fs/btrfs/transaction.h | 10 ++++++++++
 5 files changed, 48 insertions(+), 4 deletions(-)

Message ID	20201119072828.70909-1-wqu@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-18.7 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED, USER_AGENT_GIT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 2D4B6C2D0E4 for <linux-btrfs@archiver.kernel.org>; Thu, 19 Nov 2020 07:28:36 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id B41F820658 for <linux-btrfs@archiver.kernel.org>; Thu, 19 Nov 2020 07:28:35 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=suse.com header.i=@suse.com header.b="pmLjdbNG" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726269AbgKSH2e (ORCPT <rfc822;linux-btrfs@archiver.kernel.org>); Thu, 19 Nov 2020 02:28:34 -0500 Received: from mx2.suse.de ([195.135.220.15]:53448 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725843AbgKSH2e (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Thu, 19 Nov 2020 02:28:34 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1605770912; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc: mime-version:mime-version: content-transfer-encoding:content-transfer-encoding; bh=buVK0bp4q9/of8nGBsPpGgTV1gGTa87cMVbcCiYAdTI=; b=pmLjdbNGzfl/4GibUFjf7WyGtl+xcqKEKB2UlnKA2qgHx+3IGWe7frtYiYU76NGN3m/NRF 36qw95m6FAlGrwUkUcHEEz1zvwparZqBITHpJZqwOk83Bd1f4D2uwLv+PPpAnwyfsnGhMU G4wvZxcPmZv33EfsoZO2l6BiHI/A9ZA= Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 663E9AEA3 for <linux-btrfs@vger.kernel.org>; Thu, 19 Nov 2020 07:28:32 +0000 (UTC) From: Qu Wenruo <wqu@suse.com> To: linux-btrfs@vger.kernel.org Subject: [PATCH RFC] btrfs: introduce qgroup dirty extents threshold mechanism for snapshot drop and inode truncation Date: Thu, 19 Nov 2020 15:28:28 +0800 Message-Id: <20201119072828.70909-1-wqu@suse.com> X-Mailer: git-send-email 2.29.2 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org
Series	[RFC] btrfs: introduce qgroup dirty extents threshold mechanism for snapshot drop and inode truncation \| expand [RFC] btrfs: introduce qgroup dirty extents threshold mechanism for snapshot drop and inode truncat…

[RFC] btrfs: introduce qgroup dirty extents threshold mechanism for snapshot drop and inode truncation

Commit Message

Comments

Patch