btrfs: Remove the duplicated and sometimes too cautious btrfs_can_relocate()

[BUG]
The following script can easily cause unexpected ENOSPC:
  umount $dev &> /dev/null
  umount $mnt &> /dev/null

  mkfs.btrfs -b 1G -m single -d single $dev -f > /dev/null

  mount $dev $mnt -o enospc_debug

  for i in $(seq -w 0 511); do
  	xfs_io -f -c "pwrite 0 1m" $mnt/inline_$i > /dev/null
  done
  sync

  btrfs balance start --full $mnt || return 1
  sync

  # This will report -ENOSPC
  btrfs balance start --full $mnt || return 1
  umount $mnt

Also, btrfs/156 can also fail due to ENOSPC.

[CAUSE]
The ENOSPC is reported by btrfs_can_relocate().

In btrfs_can_relocate(), it does the following check:
- If the block group is empty
  If empty, definitely we can relocate this block group.
- If we are not the only block group and we have enough space
  Then we can relocate this block group.

Above two checks are completely OK, although I could argue they doesn't
make much sense, but the following check is vague and even sometimes
too cautious to cause ENOSPC:
- If we can allocate a new block group as large as current one.
  If we failed previous two checks, we must pass this to relocate this
  block group.

There are several problems here:
1. We don't need to allocate as large as the source block group.
   E.g. source block group is 1G sized, but only 1M used. We only need
   to allocated a data chunk larger than 1M to continue relocation.

2. The check in btrfs_can_relocate() is vague and impossible to be as
   accurate as __btrfs_alloc_chunk()
   How could this less than 200 lines code do the same work as
   __btrfs_alloc_chunk()? And it's hard to maintain two different
   functions to do similar work.

3. We have more accurate check in btrfs_inc_block_group_ro().
   Btrfs_inc_block_group_ro() is doing similar check but much better.
   In btrfs_inc_block_group_ro() we do:
   * Forced chunk allocation if we're converting

   * Try to mark block group ro first
     in inc_btrfs_block_group_ro(), we will do comprehensive space
     check to ensure we have enough free space for the used and reserved
     space of the block group.
     If succeeded, we're done.

   * Force chunk allocation for more space
     If we failed here, we indeed hits ENOSPC.

   * Try to mark block group ro again
     As we have extra space, we can try again.
     This is the last chance, either we have enough space now and
     success, or the newly allocated space is not large enough, ENOSPC
     is returned.

   Such try-allocate-try behavior is way more accurate in every way
   compared to btrfs_can_relocate(), we can rely on
   btrfs_inc_block_group_ro() to replace btrfs_can_relocate()
   completely.

[FIX]
Since regular balance routine already has a better ENOSPC detector,
there is no need to keep the false-alert-prone btrfs_can_relocate().

Signed-off-by: Qu Wenruo <wqu@suse.com>
---
 fs/btrfs/ctree.h       |   1 -
 fs/btrfs/extent-tree.c | 141 -----------------------------------------
 fs/btrfs/volumes.c     |   4 --
 3 files changed, 146 deletions(-)

Message ID	20190718054857.8970-1-wqu@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-btrfs-owner@kernel.org> Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id EDBE813BD for <patchwork-linux-btrfs@patchwork.kernel.org>; Thu, 18 Jul 2019 05:49:07 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id DB999287AA for <patchwork-linux-btrfs@patchwork.kernel.org>; Thu, 18 Jul 2019 05:49:07 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id CE51F287AE; Thu, 18 Jul 2019 05:49:07 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.9 required=2.0 tests=BAYES_00,MAILING_LIST_MULTI, RCVD_IN_DNSWL_HI autolearn=ham version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 18230287AA for <patchwork-linux-btrfs@patchwork.kernel.org>; Thu, 18 Jul 2019 05:49:07 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1733131AbfGRFtG (ORCPT <rfc822;patchwork-linux-btrfs@patchwork.kernel.org>); Thu, 18 Jul 2019 01:49:06 -0400 Received: from mx2.suse.de ([195.135.220.15]:42682 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725959AbfGRFtF (ORCPT <rfc822;linux-btrfs@vger.kernel.org>); Thu, 18 Jul 2019 01:49:05 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id C66C2AD05 for <linux-btrfs@vger.kernel.org>; Thu, 18 Jul 2019 05:49:02 +0000 (UTC) From: Qu Wenruo <wqu@suse.com> To: linux-btrfs@vger.kernel.org Subject: [PATCH] btrfs: Remove the duplicated and sometimes too cautious btrfs_can_relocate() Date: Thu, 18 Jul 2019 13:48:57 +0800 Message-Id: <20190718054857.8970-1-wqu@suse.com> X-Mailer: git-send-email 2.22.0 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-btrfs-owner@vger.kernel.org Precedence: bulk List-ID: <linux-btrfs.vger.kernel.org> X-Mailing-List: linux-btrfs@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP
Series	btrfs: Remove the duplicated and sometimes too cautious btrfs_can_relocate() \| expand btrfs: Remove the duplicated and sometimes too cautious btrfs_can_relocate()

btrfs: Remove the duplicated and sometimes too cautious btrfs_can_relocate()

Commit Message

Comments

Patch