From patchwork Sat Dec 7 00:04:08 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Darrick J. Wong" X-Patchwork-Id: 13898090 Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id F2E4A139B; Sat, 7 Dec 2024 00:04:08 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1733529849; cv=none; b=YccOfWxVbZRDZRLf77gJOZuZxSNHDSUeuDOCQ9kRHyB+s3h6C91QkSnAqptHV2wZ5a/SHs/KGMVfWDjKDFFELQ5NT3a/dTQDUldhv+pXRVT/hC/M6qy1NX6a3M4Eng+x0dkyBdiqmQAIF5B1sD2BNe30o5L5i99wHWmWrPxfLrg= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1733529849; c=relaxed/simple; bh=XcSmG1DDmIoBwBMa7DfIzTntQFdzableBCgb8sgrhc0=; h=Date:Subject:From:To:Cc:Message-ID:In-Reply-To:References: MIME-Version:Content-Type; b=S/5rfXEG25O7kagrxhfWdCsXzYt1Ci83fbb6/m8T1DAKXwC8NRenLkXuXRk/drsOCotXecTV+x3z5/TbwTPl3shPeDr5XQ5aY+Q6rOgWlAr94uKeFbNwF7RCCvgA6Tg0fDvO2+9OJeDPE06L7JxF4vCJezpbb1GYMPo+oM27u7g= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=YnT+O1g4; arc=none smtp.client-ip=10.30.226.201 Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="YnT+O1g4" Received: by smtp.kernel.org (Postfix) with ESMTPSA id C8516C4CED1; Sat, 7 Dec 2024 00:04:08 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1733529848; bh=XcSmG1DDmIoBwBMa7DfIzTntQFdzableBCgb8sgrhc0=; h=Date:Subject:From:To:Cc:In-Reply-To:References:From; b=YnT+O1g4snN2boy/tQWXqEhrETMlGg1xdpBFpM6cdFETCkrn1h1niYYezSQWcMT2L Y5aO/SkvRI3KQ5xreo+mRegLsqgQ9BIFlPLYcNEF+DqUnRHFTctdY7GCeMg/URDBUL fLbRWKHbN73Ve8Ri12trwHVMp2/SsnNf6DIwvZW2lVFHx9eslGLFVXtFavfNOHdOyJ cVfxtlSAaPcMz9E0TfAiD6F6ctEhAfuY2e3e0KanHTZKfJ/3wLPrO/2ZAMNjqnmK9t QNKV8th3eWy4oK0IiQFzqm9xUA9fmMZbe3Livmt8LpfycX4P0vzDskhyZUnhfKFz04 Jnem5gs4fUiLw== Date: Fri, 06 Dec 2024 16:04:08 -0800 Subject: [PATCH 3/6] xfs: update btree keys correctly when _insrec splits an inode root block From: "Darrick J. Wong" To: aalbersh@kernel.org, djwong@kernel.org Cc: stable@vger.kernel.org, hch@lst.de, hch@lst.de, linux-xfs@vger.kernel.org Message-ID: <173352751244.126106.16426037647903930309.stgit@frogsfrogsfrogs> In-Reply-To: <173352751190.126106.5258055486306925523.stgit@frogsfrogsfrogs> References: <173352751190.126106.5258055486306925523.stgit@frogsfrogsfrogs> Precedence: bulk X-Mailing-List: linux-xfs@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 From: Darrick J. Wong In commit 2c813ad66a72, I partially fixed a bug wherein xfs_btree_insrec would erroneously try to update the parent's key for a block that had been split if we decided to insert the new record into the new block. The solution was to detect this situation and update the in-core key value that we pass up to the caller so that the caller will (eventually) add the new block to the parent level of the tree with the correct key. However, I missed a subtlety about the way inode-rooted btrees work. If the full block was a maximally sized inode root block, we'll solve that fullness by moving the root block's records to a new block, resizing the root block, and updating the root to point to the new block. We don't pass a pointer to the new block to the caller because that work has already been done. The new record will /always/ land in the new block, so in this case we need to use xfs_btree_update_keys to update the keys. This bug can theoretically manifest itself in the very rare case that we split a bmbt root block and the new record lands in the very first slot of the new block, though I've never managed to trigger it in practice. However, it is very easy to reproduce by running generic/522 with the realtime rmapbt patchset if rtinherit=1. Cc: # v4.8 Fixes: 2c813ad66a7218 ("xfs: support btrees with overlapping intervals for keys") Signed-off-by: "Darrick J. Wong" Reviewed-by: Christoph Hellwig --- libxfs/xfs_btree.c | 29 +++++++++++++++++++++++------ 1 file changed, 23 insertions(+), 6 deletions(-) diff --git a/libxfs/xfs_btree.c b/libxfs/xfs_btree.c index 5c293ccf623336..f4c4db62e2069e 100644 --- a/libxfs/xfs_btree.c +++ b/libxfs/xfs_btree.c @@ -3555,14 +3555,31 @@ xfs_btree_insrec( xfs_btree_log_block(cur, bp, XFS_BB_NUMRECS); /* - * If we just inserted into a new tree block, we have to - * recalculate nkey here because nkey is out of date. + * Update btree keys to reflect the newly added record or keyptr. + * There are three cases here to be aware of. Normally, all we have to + * do is walk towards the root, updating keys as necessary. * - * Otherwise we're just updating an existing block (having shoved - * some records into the new tree block), so use the regular key - * update mechanism. + * If the caller had us target a full block for the insertion, we dealt + * with that by calling the _make_block_unfull function. If the + * "make unfull" function splits the block, it'll hand us back the key + * and pointer of the new block. We haven't yet added the new block to + * the next level up, so if we decide to add the new record to the new + * block (bp->b_bn != old_bn), we have to update the caller's pointer + * so that the caller adds the new block with the correct key. + * + * However, there is a third possibility-- if the selected block is the + * root block of an inode-rooted btree and cannot be expanded further, + * the "make unfull" function moves the root block contents to a new + * block and updates the root block to point to the new block. In this + * case, no block pointer is passed back because the block has already + * been added to the btree. In this case, we need to use the regular + * key update function, just like the first case. This is critical for + * overlapping btrees, because the high key must be updated to reflect + * the entire tree, not just the subtree accessible through the first + * child of the root (which is now two levels down from the root). */ - if (bp && xfs_buf_daddr(bp) != old_bn) { + if (!xfs_btree_ptr_is_null(cur, &nptr) && + bp && xfs_buf_daddr(bp) != old_bn) { xfs_btree_get_keys(cur, block, lkey); } else if (xfs_btree_needs_key_update(cur, optr)) { error = xfs_btree_update_keys(cur, level);