[PATCHSET,0/3] passthru block optimizations

Message ID	20220806152004.382170-1-axboe@kernel.dk (mailing list archive)
Headers	show Return-Path: <linux-block-owner@kernel.org> From: Jens Axboe <axboe@kernel.dk> To: linux-block@vger.kernel.org Cc: joshi.k@samsung.com, kbusch@kernel.org Subject: [PATCHSET 0/3] passthru block optimizations Date: Sat, 6 Aug 2022 09:20:01 -0600 Message-Id: <20220806152004.382170-1-axboe@kernel.dk> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	passthru block optimizations \| expand [PATCHSET,0/3] passthru block optimizations [1/3] block: shrink rq_map_data a bit [2/3] block: enable bio caching use for passthru IO [3/3] block: use on-stack page vec for <= UIO_FASTIOV

Message ID

20220806152004.382170-1-axboe@kernel.dk (mailing list archive)

Headers

From: Jens Axboe <axboe@kernel.dk>
To: linux-block@vger.kernel.org
Cc: joshi.k@samsung.com, kbusch@kernel.org
Subject: [PATCHSET 0/3] passthru block optimizations
Date: Sat,  6 Aug 2022 09:20:01 -0600
Message-Id: <20220806152004.382170-1-axboe@kernel.dk>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Precedence: bulk

Series

passthru block optimizations | expand

Message

Jens Axboe Aug. 6, 2022, 3:20 p.m. UTC

Hi,

Currently passthru IO is slower than bdev O_DIRECT. One of the reasons
is that we do two allocations for each IO:

- One alloc+free for the page array for mapping the data
- One alloc+free of the bio

Let passthru IO dip into the bio cache to eliminate that one, and use
UIO_FASTIOV to gate whether we need to alloc+free the page array for
mapping purposes.

This closes about half of the gap between passthru and bdev dio for me.
If we can sanely wire up completion batching for passthru, then that
would almost fully close the gap. Outside of that, the main missing
feature for passthru is the ability to use registered buffers with
io_uring, as the per-io get_user_pages() is a large cycle consumer as
well.