diff mbox series

block: move .bd_inode into 1st cacheline of block_device

Message ID 20231121101156.378105-1-ming.lei@redhat.com (mailing list archive)
State New, archived
Headers show
Series block: move .bd_inode into 1st cacheline of block_device | expand

Commit Message

Ming Lei Nov. 21, 2023, 10:11 a.m. UTC
The .bd_inode field of block_device is used in IO fast path of
blkdev_write_iter() and blkdev_llseek(), so it is more efficient to keep
it into the 1st cacheline.

.bd_openers is only touched in open()/close(), and .bd_size_lock is only
for updating bdev capacity, which is in slow path too.

So swap .bd_inode layout with .bd_openers & .bd_size_lock to move
.bd_inode into the 1st cache line.

Cc: Yu Kuai <yukuai3@huawei.com>
Signed-off-by: Ming Lei <ming.lei@redhat.com>
---
 include/linux/blk_types.h | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

Comments

Yu Kuai Nov. 21, 2023, 11:12 a.m. UTC | #1
Hi,

在 2023/11/21 18:11, Ming Lei 写道:
> The .bd_inode field of block_device is used in IO fast path of
> blkdev_write_iter() and blkdev_llseek(), so it is more efficient to keep
> it into the 1st cacheline.
> 
> .bd_openers is only touched in open()/close(), and .bd_size_lock is only
> for updating bdev capacity, which is in slow path too.
> 
> So swap .bd_inode layout with .bd_openers & .bd_size_lock to move
> .bd_inode into the 1st cache line.

This patch looks good, do you want me do take it for a v3 for the
other patchset?

And by the way, can we also move 'int bd_writers' to near 'atomic_t
bd_fsfreeze_count' to save 8 bytes(int 64bit platform)?

diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
index 07abd0165226..a47ab9249bdd 100644
--- a/include/linux/blk_types.h
+++ b/include/linux/blk_types.h
@@ -63,11 +63,11 @@ struct block_device {
         int                     bd_holders;
         struct kobject          *bd_holder_dir;

+       int                     bd_writers;
         atomic_t                bd_fsfreeze_count; /* number of freeze 
requests */
         struct mutex            bd_fsfreeze_mutex; /* serialize 
freeze/thaw */

         struct partition_meta_info *bd_meta_info;
-       int                     bd_writers;

Thanks,
Kuai
> 
> Cc: Yu Kuai <yukuai3@huawei.com>
> Signed-off-by: Ming Lei <ming.lei@redhat.com>
> ---
>   include/linux/blk_types.h | 3 ++-
>   1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
> index d5c5e59ddbd2..f7d40692dd94 100644
> --- a/include/linux/blk_types.h
> +++ b/include/linux/blk_types.h
> @@ -49,9 +49,10 @@ struct block_device {
>   	bool			bd_write_holder;
>   	bool			bd_has_submit_bio;
>   	dev_t			bd_dev;
> +	struct inode		*bd_inode;	/* will die */
> +
>   	atomic_t		bd_openers;
>   	spinlock_t		bd_size_lock; /* for bd_inode->i_size updates */
> -	struct inode *		bd_inode;	/* will die */
>   	void *			bd_claiming;
>   	void *			bd_holder;
>   	const struct blk_holder_ops *bd_holder_ops;
>
Ming Lei Nov. 21, 2023, 11:21 a.m. UTC | #2
On Tue, Nov 21, 2023 at 07:12:44PM +0800, Yu Kuai wrote:
> Hi,
> 
> 在 2023/11/21 18:11, Ming Lei 写道:
> > The .bd_inode field of block_device is used in IO fast path of
> > blkdev_write_iter() and blkdev_llseek(), so it is more efficient to keep
> > it into the 1st cacheline.
> > 
> > .bd_openers is only touched in open()/close(), and .bd_size_lock is only
> > for updating bdev capacity, which is in slow path too.
> > 
> > So swap .bd_inode layout with .bd_openers & .bd_size_lock to move
> > .bd_inode into the 1st cache line.
> 
> This patch looks good, do you want me do take it for a v3 for the
> other patchset?

Yeah, please take it.

> 
> And by the way, can we also move 'int bd_writers' to near 'atomic_t
> bd_fsfreeze_count' to save 8 bytes(int 64bit platform)?
> 
> diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
> index 07abd0165226..a47ab9249bdd 100644
> --- a/include/linux/blk_types.h
> +++ b/include/linux/blk_types.h
> @@ -63,11 +63,11 @@ struct block_device {
>         int                     bd_holders;
>         struct kobject          *bd_holder_dir;
> 
> +       int                     bd_writers;
>         atomic_t                bd_fsfreeze_count; /* number of freeze
> requests */
>         struct mutex            bd_fsfreeze_mutex; /* serialize freeze/thaw
> */
> 
>         struct partition_meta_info *bd_meta_info;
> -       int                     bd_writers;

Which tree are you talking about? I don't see 'bd_writers' in both
linus tree and block-6.7, and for-6.8/block isn't open yet.

Thanks,
Ming
Yu Kuai Nov. 21, 2023, 11:36 a.m. UTC | #3
Hi,

在 2023/11/21 19:21, Ming Lei 写道:
> On Tue, Nov 21, 2023 at 07:12:44PM +0800, Yu Kuai wrote:
>> Hi,
>>
>> 在 2023/11/21 18:11, Ming Lei 写道:
>>> The .bd_inode field of block_device is used in IO fast path of
>>> blkdev_write_iter() and blkdev_llseek(), so it is more efficient to keep
>>> it into the 1st cacheline.
>>>
>>> .bd_openers is only touched in open()/close(), and .bd_size_lock is only
>>> for updating bdev capacity, which is in slow path too.
>>>
>>> So swap .bd_inode layout with .bd_openers & .bd_size_lock to move
>>> .bd_inode into the 1st cache line.
>>
>> This patch looks good, do you want me do take it for a v3 for the
>> other patchset?
> 
> Yeah, please take it.

Ok
> 
>>
>> And by the way, can we also move 'int bd_writers' to near 'atomic_t
>> bd_fsfreeze_count' to save 8 bytes(int 64bit platform)?
>>
>> diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
>> index 07abd0165226..a47ab9249bdd 100644
>> --- a/include/linux/blk_types.h
>> +++ b/include/linux/blk_types.h
>> @@ -63,11 +63,11 @@ struct block_device {
>>          int                     bd_holders;
>>          struct kobject          *bd_holder_dir;
>>
>> +       int                     bd_writers;
>>          atomic_t                bd_fsfreeze_count; /* number of freeze
>> requests */
>>          struct mutex            bd_fsfreeze_mutex; /* serialize freeze/thaw
>> */
>>
>>          struct partition_meta_info *bd_meta_info;
>> -       int                     bd_writers;
> 
> Which tree are you talking about? I don't see 'bd_writers' in both
> linus tree and block-6.7, and for-6.8/block isn't open yet.

This is introduced from commit dc85fbc92365 ("block: Add config option
to not allow writing to mounted devices") from linux-next by Jan.

Thanks,
Kuai

> 
> Thanks,
> Ming
> 
> .
>
Ming Lei Nov. 21, 2023, 12:26 p.m. UTC | #4
On Tue, Nov 21, 2023 at 07:36:34PM +0800, Yu Kuai wrote:
> Hi,
> 
> 在 2023/11/21 19:21, Ming Lei 写道:
> > On Tue, Nov 21, 2023 at 07:12:44PM +0800, Yu Kuai wrote:
> > > Hi,
> > > 
> > > 在 2023/11/21 18:11, Ming Lei 写道:
> > > > The .bd_inode field of block_device is used in IO fast path of
> > > > blkdev_write_iter() and blkdev_llseek(), so it is more efficient to keep
> > > > it into the 1st cacheline.
> > > > 
> > > > .bd_openers is only touched in open()/close(), and .bd_size_lock is only
> > > > for updating bdev capacity, which is in slow path too.
> > > > 
> > > > So swap .bd_inode layout with .bd_openers & .bd_size_lock to move
> > > > .bd_inode into the 1st cache line.
> > > 
> > > This patch looks good, do you want me do take it for a v3 for the
> > > other patchset?
> > 
> > Yeah, please take it.
> 
> Ok
> > 
> > > 
> > > And by the way, can we also move 'int bd_writers' to near 'atomic_t
> > > bd_fsfreeze_count' to save 8 bytes(int 64bit platform)?
> > > 
> > > diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
> > > index 07abd0165226..a47ab9249bdd 100644
> > > --- a/include/linux/blk_types.h
> > > +++ b/include/linux/blk_types.h
> > > @@ -63,11 +63,11 @@ struct block_device {
> > >          int                     bd_holders;
> > >          struct kobject          *bd_holder_dir;
> > > 
> > > +       int                     bd_writers;
> > >          atomic_t                bd_fsfreeze_count; /* number of freeze
> > > requests */
> > >          struct mutex            bd_fsfreeze_mutex; /* serialize freeze/thaw
> > > */
> > > 
> > >          struct partition_meta_info *bd_meta_info;
> > > -       int                     bd_writers;
> > 
> > Which tree are you talking about? I don't see 'bd_writers' in both
> > linus tree and block-6.7, and for-6.8/block isn't open yet.
> 
> This is introduced from commit dc85fbc92365 ("block: Add config option
> to not allow writing to mounted devices") from linux-next by Jan.

Patch isn't supposed to be against linux-next, and either you need to base the
change against maintainer tree(fs) or block tree when Jan's change lands linus
tree.


Thanks,
Ming
diff mbox series

Patch

diff --git a/include/linux/blk_types.h b/include/linux/blk_types.h
index d5c5e59ddbd2..f7d40692dd94 100644
--- a/include/linux/blk_types.h
+++ b/include/linux/blk_types.h
@@ -49,9 +49,10 @@  struct block_device {
 	bool			bd_write_holder;
 	bool			bd_has_submit_bio;
 	dev_t			bd_dev;
+	struct inode		*bd_inode;	/* will die */
+
 	atomic_t		bd_openers;
 	spinlock_t		bd_size_lock; /* for bd_inode->i_size updates */
-	struct inode *		bd_inode;	/* will die */
 	void *			bd_claiming;
 	void *			bd_holder;
 	const struct blk_holder_ops *bd_holder_ops;