[v2,2/2] fs: handle freezing from multiple devices

Before [1] freezing a filesystems through the block layer only worked
for the main block device as the owning superblock of additional block
devices could not be found. Any filesystem that made use of multiple
block devices would only be freezable via it's main block device.

For example, consider xfs over device mapper with /dev/dm-0 as main
block device and /dev/dm-1 as external log device. Two freeze requests
before [1]:

(1) dmsetup suspend /dev/dm-0 on the main block device

    bdev_freeze(dm-0)
    -> dm-0->bd_fsfreeze_count++
    -> freeze_super(xfs-sb)

    The owning superblock is found and the filesystem gets frozen.
    Returns 0.

(2) dmsetup suspend /dev/dm-1 on the log device

    bdev_freeze(dm-1)
    -> dm-1->bd_fsfreeze_count++

    The owning superblock isn't found and only the block device freeze
    count is incremented. Returns 0.

Two freeze requests after [1]:

(1') dmsetup suspend /dev/dm-0 on the main block device

    bdev_freeze(dm-0)
    -> dm-0->bd_fsfreeze_count++
    -> freeze_super(xfs-sb)

    The owning superblock is found and the filesystem gets frozen.
    Returns 0.

(2') dmsetup suspend /dev/dm-1 on the log device

    bdev_freeze(dm-0)
    -> dm-0->bd_fsfreeze_count++
    -> freeze_super(xfs-sb)

    The owning superblock is found and the filesystem gets frozen.
    Returns -EBUSY.

When (2') is called we initiate a freeze from another block device of
the same superblock. So we increment the bd_fsfreeze_count for that
additional block device. But we now also find the owning superblock for
additional block devices and call freeze_super() again which reports
-EBUSY.

This can be reproduced through xfstests via:

    mkfs.xfs -f -m crc=1,reflink=1,rmapbt=1, -i sparse=1 -lsize=1g,logdev=/dev/nvme1n1p4 /dev/nvme1n1p3
    mkfs.xfs -f -m crc=1,reflink=1,rmapbt=1, -i sparse=1 -lsize=1g,logdev=/dev/nvme1n1p6 /dev/nvme1n1p5

    FSTYP=xfs
    export TEST_DEV=/dev/nvme1n1p3
    export TEST_DIR=/mnt/test
    export TEST_LOGDEV=/dev/nvme1n1p4
    export SCRATCH_DEV=/dev/nvme1n1p5
    export SCRATCH_MNT=/mnt/scratch
    export SCRATCH_LOGDEV=/dev/nvme1n1p6
    export USE_EXTERNAL=yes

    sudo ./check generic/311

Current semantics allow two concurrent freezers: one initiated from
userspace via FREEZE_HOLDER_USERSPACE and one initiated from the kernel
via FREEZE_HOLDER_KERNEL. If there are multiple concurrent freeze
requests from either FREEZE_HOLDER_USERSPACE or FREEZE_HOLDER_KERNEL
-EBUSY is returned.

We need to preserve these semantics because as they are uapi via
FIFREEZE and FITHAW ioctl()s. IOW, freezes don't nest for FIFREEZE and
FITHAW. Other kernels consumers rely on non-nesting freezes as well.

With freezes initiated from the block layer freezes need to nest if the
same superblock is frozen via multiple devices. So we need to start
counting the number of freeze requests.

If FREEZE_HOLDER_BDEV is passed alongside FREEZE_HOLDER_KERNEL or
FREEZE_HOLDER_USERSPACE we allow the caller to nest freeze calls.

To accommodate the old semantics we split the freeze counter into two
counting kernel initiated and userspace initiated freezes separately. We
can then also stop recording FREEZE_HOLDER_* in struct sb_writers.

We also simplify freezing by making all concurrent freezers share a
single active superblock reference count instead of having separate
references for kernel and userspace. I don't see why we would need two
active reference counts. Neither FREEZE_HOLDER_KERNEL nor
FREEZE_HOLDER_USERSPACE can put the active reference as long as they are
concurrent freezers anwyay. That was already true before we allowed
nesting freezes.

Survives various fstests runs with different options including the
reproducer, online scrub, and online repair, fsfreze, and so on. Also
survives blktests.

Reported-by: Chandan Babu R <chandanbabu@kernel.org>
Link: https://lore.kernel.org/linux-block/87bkccnwxc.fsf@debian-BULLSEYE-live-builder-AMD64
Fixes: [1]: bfac4176f2c4 ("bdev: implement freeze and thaw holder operations") # no backport needed
Signed-off-by: Christian Brauner <brauner@kernel.org>
---
 fs/super.c         | 135 ++++++++++++++++++++++++++++++++++++++++++-----------
 include/linux/fs.h |  17 ++++++-
 2 files changed, 124 insertions(+), 28 deletions(-)

Message ID	20231104-vfs-multi-device-freeze-v2-2-5b5b69626eac@kernel.org (mailing list archive)
State	New, archived
Headers	show Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 3355A1A58A for <linux-fsdevel@vger.kernel.org>; Sat, 4 Nov 2023 14:00:40 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="u2JkiDHf" Received: by smtp.kernel.org (Postfix) with ESMTPSA id 3867EC433C8; Sat, 4 Nov 2023 14:00:37 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1699106440; bh=OfexyeS3odpnq+o7s7SxZag42Y4SNUvcLgg8dkf19bI=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=u2JkiDHfGzmjb6jTCW6uo6sXJnBF7JC7NgCB9OxxtDnFoJIHnqM0Pb7HIq4dH+5PI wzMnLsEbNFA1b8guP/a/VMGJIenTrWHba0LcGVLNhd8i+zrjg3kdOQDxhEFVlesq8r +P1UiWNL9JzdG1rpkNQGpGNryHnzaU6xd/w+Aoz62v7PeCJfHl0jfWgSwV2jhBdGrm YdFUvaA0LlFaJJGDDnw7HpUBy3nh0YyrfueLFwcWwRmvhsRBvg+frmL9kDG4MEwIAa DRTwA75mU2tlRDWbutaX4KkJ49IRzydUJBVLlLpUh01KrRKmo62NUAZRajtSqeJH+t YMcMbCQexDnIQ== From: Christian Brauner <brauner@kernel.org> To: Dave Chinner <dchinner@redhat.com>, Christoph Hellwig <hch@lst.de>, Jan Kara <jack@suse.cz>, "Darrick J. Wong" <djwong@kernel.org> Cc: Christian Brauner <brauner@kernel.org>, linux-fsdevel@vger.kernel.org, Chandan Babu R <chandanbabu@kernel.org> Subject: [PATCH v2 2/2] fs: handle freezing from multiple devices Date: Sat, 4 Nov 2023 15:00:13 +0100 Message-Id: <20231104-vfs-multi-device-freeze-v2-2-5b5b69626eac@kernel.org> X-Mailer: git-send-email 2.34.1 In-Reply-To: <87bkccnwxc.fsf@debian-BULLSEYE-live-builder-AMD64> References: <20231104-vfs-multi-device-freeze-v2-0-5b5b69626eac@kernel.org> Precedence: bulk X-Mailing-List: linux-fsdevel@vger.kernel.org List-Id: <linux-fsdevel.vger.kernel.org> List-Subscribe: <mailto:linux-fsdevel+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:linux-fsdevel+unsubscribe@vger.kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" X-Mailer: b4 0.13-dev-26615 X-Developer-Signature: v=1; a=openpgp-sha256; l=15220; i=brauner@kernel.org; h=from:subject:message-id; bh=OfexyeS3odpnq+o7s7SxZag42Y4SNUvcLgg8dkf19bI=; b=owGbwMvMwCU28Zj0gdSKO4sYT6slMaS6+aUG2iwtzZ6vNKHbz6njkYzcYoPfH6ee+nvZYOe3nv7g a98md5SyMIhxMciKKbI4tJuEyy3nqdhslKkBM4eVCWQIAxenAEzEKoaR4UXo8z2x4ZtOrvpSxysd/1 nb5qHB4q3bX20UusW/8NHTCnWG/3mO70sfeHBrng600VR6378l8MdMx8d8bkuKDZ6Krzh3mBEA X-Developer-Key: i=brauner@kernel.org; a=openpgp; fpr=4880B8C9BD0E5106FC070F4F7B3C391EFEA93624 Content-Transfer-Encoding: 8bit
Series	fs: handle freezing from multiple devices \| expand fs: handle freezing from multiple devices [v2,2/2] fs: handle freezing from multiple devices

[v2,2/2] fs: handle freezing from multiple devices

Commit Message

Comments

Patch