[v2,md-6.14,5/5] md/md-bitmap: move bitmap_{start, end}write to md upper layer

From: Yu Kuai <yukuai3@huawei.com>

From: Yu Kuai <yukuai3@huawei.com>

There are two BUG reports that raid5 will hang at
bitmap_startwrite([1],[2]), root cause is that bitmap start write and end
write is unbalanced. For example, handle_stripe_clean_event() doesn't
check if stripe->dev[].towrite is NULL after tag 'returnbi', and extra
bitmap_endwrite() will be called.

While reviewing raid5 code, it's found that bitmap operations can be
optimized. For example, for a 4 disks raid5, with chunksize=8k, if user
issue a IO (0 + 48k) to the array:

┌────────────────────────────────────────────────────────────┐
│chunk 0                                                     │
│      ┌────────────┬─────────────┬─────────────┬────────────┼
│  sh0 │A0: 0 + 4k  │A1: 8k + 4k  │A2: 16k + 4k │A3: P       │
│      ┼────────────┼─────────────┼─────────────┼────────────┼
│  sh1 │B0: 4k + 4k │B1: 12k + 4k │B2: 20k + 4k │B3: P       │
┼──────┴────────────┴─────────────┴─────────────┴────────────┼
│chunk 1                                                     │
│      ┌────────────┬─────────────┬─────────────┬────────────┤
│  sh2 │C0: 24k + 4k│C1: 32k + 4k │C2: P        │C3: 40k + 4k│
│      ┼────────────┼─────────────┼─────────────┼────────────┼
│  sh3 │D0: 28k + 4k│D1: 36k + 4k │D2: P        │D3: 44k + 4k│
└──────┴────────────┴─────────────┴─────────────┴────────────┘

Before this patch, 4 stripe head will be used, and each sh will attach
bio for 3 disks, and each attached bio will trigger
bitmap_startwrite() once, which means total 12 times.
 - 3 times (0 + 4k), for (A0, A1 and A2)
 - 3 times (4 + 4k), for (B0, B1 and B2)
 - 3 times (8 + 4k), for (C0, C1 and C3)
 - 3 times (12 + 4k), for (D0, D1 and D3)

After this patch, md upper layer will calculate that IO range (0 + 48k)
is corresponding to the bitmap (0 + 16k), and call bitmap_startwrite()
just once.

Noted that this patch will align bitmap ranges to the chunks, for example,
if user issue a IO (0 + 4k) to array:

- Before this patch, 1 time (0 + 4k), for A0;
- After this patch, 1 time (0 + 8k) for chunk 0;

Usually, one bitmap bit will represent more than one disk chunk, and this
doesn't have any difference. And even if user really created a array
that one chunk contain multiple bits, the overhead is that more data
will be recovered after power failure.

[1] https://lore.kernel.org/all/CAJpMwyjmHQLvm6zg1cmQErttNNQPDAAXPKM3xgTjMhbfts986Q@mail.gmail.com/
[2] https://lore.kernel.org/all/ADF7D720-5764-4AF3-B68E-1845988737AA@flyingcircus.io/
Signed-off-by: Yu Kuai <yukuai3@huawei.com>
Signed-off-by: Yu Kuai <yukuai@kernel.org>
---
 drivers/md/md.c          | 29 +++++++++++++++++++++++++++++
 drivers/md/md.h          |  2 ++
 drivers/md/raid1.c       |  4 ----
 drivers/md/raid10.c      |  3 ---
 drivers/md/raid5-cache.c |  2 --
 drivers/md/raid5.c       | 24 +-----------------------
 6 files changed, 32 insertions(+), 32 deletions(-)

Message ID	20241218121745.2459-6-yukuai@kernel.org (mailing list archive)
State	New
Headers	show Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 791801B4234; Wed, 18 Dec 2024 12:20:45 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1734524445; cv=none; b=JkBM0y2Uq6okYTg4PM86feTZEInupz4Hh+xf9d/T6cWy8ocK1vU/dzCRSTATFMgWT6LgGef8WAtBJGWs6A9bFBo+FLUpgdCposRZPeW1Mu4IH2kBJILLZsI/qpK5wuWTCnoT5K0/zEyWZFhpN7GKnhbOUqd72T3CL7YuuXzAytY= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1734524445; c=relaxed/simple; bh=6KeQ2vIfFnuG7hGe0UBhR5whLnz0miFsENwyK2Q7YIE=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version:Content-Type; b=ChBGHRiXgOvyBlOr5RPyXnt2AJbcUR0MTHy7STzlTPob4M3Rp4fMJItHNDOOa09h80XRcPzIsgxX/rOs64zw2NjDIetMtdz3mM+TZ9sGr8aB1CID3jFgvXzOIgRB7l47T5FmKQp4blNXvyOmfRH4VLsn9fcrghZ7eBe9lxJemcM= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=iGCi0z5b; arc=none smtp.client-ip=10.30.226.201 Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="iGCi0z5b" Received: by smtp.kernel.org (Postfix) with ESMTPSA id 1F2BCC4CECE; Wed, 18 Dec 2024 12:20:41 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1734524445; bh=6KeQ2vIfFnuG7hGe0UBhR5whLnz0miFsENwyK2Q7YIE=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=iGCi0z5bxOAt8eWuJ7aFkgRcuw1It8d9uTd0AHoy+0Qw8iaz5AxNyR+6lMUMOJtl5 G2WxTky2dRyx/ho4IRSpaTP7GpfFJuxH3/Gug0ZooeeMtUJZ53JTZ/MeDUIAJ876Ww 152rK8+nVGbcHCotOtrFb5S2MaE9yOnAF8VnRd/hf9a82PE6+jv0CNM1rv216+7Vo6 nuDnGpWkzfKJzMgHA2TXZEi0X1YaG55AAp9+N9hAfW++HHhQfm64nqRgLznR6IX1hg vbMuY448BqpbO/O7NKRVcn4sdWueJJFk7TScB7oaN0j5qAVdxummpkX782i1i4b6/o 1ZH8niaWLtm3w== From: yukuai@kernel.org To: song@kernel.org, yukuai3@huawei.com Cc: linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org, yi.zhang@hauwei.com, yangerkun@huawei.com, Yu Kuai <yukuai@kernel.org> Subject: [PATCH v2 md-6.14 5/5] md/md-bitmap: move bitmap_{start, end}write to md upper layer Date: Wed, 18 Dec 2024 20:17:45 +0800 Message-ID: <20241218121745.2459-6-yukuai@kernel.org> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20241218121745.2459-1-yukuai@kernel.org> References: <20241218121745.2459-1-yukuai@kernel.org> Precedence: bulk X-Mailing-List: linux-raid@vger.kernel.org List-Id: <linux-raid.vger.kernel.org> List-Subscribe: <mailto:linux-raid+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:linux-raid+unsubscribe@vger.kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit
Series	md/md-bitmap: move bitmap_{start, end}write to md upper layer \| expand [v2,md-6.14,0/5] md/md-bitmap: move bitmap_{start, end}write to md upper layer [v2,md-6.14,1/5] md/md-bitmap: factor behind write counters out from bitmap_{start/end}write() [v2,md-6.14,2/5] md/md-bitmap: remove the last parameter for bimtap_ops->endwrite() [v2,md-6.14,3/5] md: add a new callback pers->bitmap_sector() [v2,md-6.14,4/5] md/raid5: implement pers->bitmap_sector() [v2,md-6.14,5/5] md/md-bitmap: move bitmap_{start, end}write to md upper layer

[v2,md-6.14,5/5] md/md-bitmap: move bitmap_{start, end}write to md upper layer

Commit Message

Patch