[v4,11/30] qcow2: Add l2_entry_size()

Message ID	fd0f93353a218ff4518f34ebdbca05c2fc0f1085.1584468723.git.berto@igalia.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=XIZz=5C=nongnu.org=qemu-devel-bounces+patchwork-qemu-devel=patchwork.kernel.org@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org EF6F520674 From: Alberto Garcia <berto@igalia.com> To: qemu-devel@nongnu.org Subject: [PATCH v4 11/30] qcow2: Add l2_entry_size() Date: Tue, 17 Mar 2020 19:16:08 +0100 Message-Id: <fd0f93353a218ff4518f34ebdbca05c2fc0f1085.1584468723.git.berto@igalia.com> In-Reply-To: <cover.1584468723.git.berto@igalia.com> References: <cover.1584468723.git.berto@igalia.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: list Cc: Kevin Wolf <kwolf@redhat.com>, Anton Nefedov <anton.nefedov@virtuozzo.com>, Alberto Garcia <berto@igalia.com>, qemu-block@nongnu.org, Max Reitz <mreitz@redhat.com>, Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>, "Denis V . Lunev" <den@openvz.org> Errors-To: qemu-devel-bounces+patchwork-qemu-devel=patchwork.kernel.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patchwork-qemu-devel=patchwork.kernel.org@nongnu.org>
Series	Add subcluster allocation to qcow2 \| expand [v4,00/30] Add subcluster allocation to qcow2 [v4,01/30] qcow2: Make Qcow2AioTask store the full host offset [v4,02/30] qcow2: Convert qcow2_get_cluster_offset() into qcow2_get_host_offset() [v4,03/30] qcow2: Add calculate_l2_meta() [v4,04/30] qcow2: Split cluster_needs_cow() out of count_cow_clusters() [v4,05/30] qcow2: Process QCOW2_CLUSTER_ZERO_ALLOC clusters in handle_copied() [v4,06/30] qcow2: Add get_l2_entry() and set_l2_entry() [v4,07/30] qcow2: Document the Extended L2 Entries feature [v4,08/30] qcow2: Add dummy has_subclusters() function [v4,09/30] qcow2: Add subcluster-related fields to BDRVQcow2State [v4,10/30] qcow2: Add offset_to_sc_index() [v4,11/30] qcow2: Add l2_entry_size() [v4,12/30] qcow2: Update get/set_l2_entry() and add get/set_l2_bitmap() [v4,13/30] qcow2: Add QCow2SubclusterType and qcow2_get_subcluster_type() [v4,14/30] qcow2: Add cluster type parameter to qcow2_get_host_offset() [v4,15/30] qcow2: Replace QCOW2_CLUSTER_* with QCOW2_SUBCLUSTER_* [v4,16/30] qcow2: Handle QCOW2_SUBCLUSTER_UNALLOCATED_ALLOC [v4,17/30] qcow2: Add subcluster support to calculate_l2_meta() [v4,18/30] qcow2: Add subcluster support to qcow2_get_host_offset() [v4,19/30] qcow2: Add subcluster support to zero_in_l2_slice() [v4,20/30] qcow2: Add subcluster support to discard_in_l2_slice() [v4,21/30] qcow2: Add subcluster support to check_refcounts_l2() [v4,22/30] qcow2: Fix offset calculation in handle_dependencies() [v4,23/30] qcow2: Update L2 bitmap in qcow2_alloc_cluster_link_l2() [v4,24/30] qcow2: Clear the L2 bitmap when allocating a compressed cluster [v4,25/30] qcow2: Add subcluster support to handle_alloc_space() [v4,26/30] qcow2: Restrict qcow2_co_pwrite_zeroes() to full clusters only [v4,27/30] qcow2: Assert that expand_zero_clusters_in_l1() does not support subclusters [v4,28/30] qcow2: Add the 'extended_l2' option and the QCOW2_INCOMPAT_EXTL2 bit [v4,29/30] qcow2: Add subcluster support to qcow2_measure() [v4,30/30] iotests: Add tests for qcow2 images with extended L2 entries

Alberto Garcia March 17, 2020, 6:16 p.m. UTC

qcow2 images with subclusters have 128-bit L2 entries. The first 64
bits contain the same information as traditional images and the last
64 bits form a bitmap with the status of each individual subcluster.

Because of that we cannot assume that L2 entries are sizeof(uint64_t)
anymore. This function returns the proper value for the image.

Signed-off-by: Alberto Garcia <berto@igalia.com>
Reviewed-by: Max Reitz <mreitz@redhat.com>
---
 block/qcow2.h          |  9 +++++++++
 block/qcow2-cluster.c  | 12 ++++++------
 block/qcow2-refcount.c | 14 ++++++++------
 block/qcow2.c          |  8 ++++----
 4 files changed, 27 insertions(+), 16 deletions(-)

Vladimir Sementsov-Ogievskiy April 14, 2020, 9:44 a.m. UTC | #1

17.03.2020 21:16, Alberto Garcia wrote:
> qcow2 images with subclusters have 128-bit L2 entries. The first 64
> bits contain the same information as traditional images and the last
> 64 bits form a bitmap with the status of each individual subcluster.
> 
> Because of that we cannot assume that L2 entries are sizeof(uint64_t)
> anymore. This function returns the proper value for the image.
> 
> Signed-off-by: Alberto Garcia <berto@igalia.com>
> Reviewed-by: Max Reitz <mreitz@redhat.com>
> ---
>   block/qcow2.h          |  9 +++++++++
>   block/qcow2-cluster.c  | 12 ++++++------
>   block/qcow2-refcount.c | 14 ++++++++------
>   block/qcow2.c          |  8 ++++----
>   4 files changed, 27 insertions(+), 16 deletions(-)
> 
> diff --git a/block/qcow2.h b/block/qcow2.h
> index 06929072d2..1eb4b46807 100644
> --- a/block/qcow2.h
> +++ b/block/qcow2.h
> @@ -80,6 +80,10 @@
>   
>   #define QCOW_EXTL2_SUBCLUSTERS_PER_CLUSTER 32
>   
> +/* Size of normal and extended L2 entries */
> +#define L2E_SIZE_NORMAL   (sizeof(uint64_t))
> +#define L2E_SIZE_EXTENDED (sizeof(uint64_t) * 2)
> +
>   #define MIN_CLUSTER_BITS 9
>   #define MAX_CLUSTER_BITS 21
>   
> @@ -506,6 +510,11 @@ static inline bool has_subclusters(BDRVQcow2State *s)
>       return false;
>   }
>   
> +static inline size_t l2_entry_size(BDRVQcow2State *s)
> +{
> +    return has_subclusters(s) ? L2E_SIZE_EXTENDED : L2E_SIZE_NORMAL;
> +}
> +
>   static inline uint64_t get_l2_entry(BDRVQcow2State *s, uint64_t *l2_slice,
>                                       int idx)
>   {
> diff --git a/block/qcow2-cluster.c b/block/qcow2-cluster.c
> index cd48ab0223..41a23c5305 100644
> --- a/block/qcow2-cluster.c
> +++ b/block/qcow2-cluster.c
> @@ -208,7 +208,7 @@ static int l2_load(BlockDriverState *bs, uint64_t offset,
>                      uint64_t l2_offset, uint64_t **l2_slice)
>   {
>       BDRVQcow2State *s = bs->opaque;
> -    int start_of_slice = sizeof(uint64_t) *
> +    int start_of_slice = l2_entry_size(s) *
>           (offset_to_l2_index(s, offset) - offset_to_l2_slice_index(s, offset));
>   
>       return qcow2_cache_get(bs, s->l2_table_cache, l2_offset + start_of_slice,
> @@ -281,7 +281,7 @@ static int l2_allocate(BlockDriverState *bs, int l1_index)
>   
>       /* allocate a new l2 entry */
>   
> -    l2_offset = qcow2_alloc_clusters(bs, s->l2_size * sizeof(uint64_t));
> +    l2_offset = qcow2_alloc_clusters(bs, s->l2_size * l2_entry_size(s));

hmm. s->l2_size * l2_entry_size, isn't it just s->cluster_size always? Maybe, just refactor these things?


>       if (l2_offset < 0) {
>           ret = l2_offset;
>           goto fail;
> @@ -305,7 +305,7 @@ static int l2_allocate(BlockDriverState *bs, int l1_index)

[...]

> @@ -1425,7 +1425,7 @@ static int coroutine_fn qcow2_do_open(BlockDriverState *bs, QDict *options,
>           bs->encrypted = true;
>       }
>   
> -    s->l2_bits = s->cluster_bits - 3; /* L2 is always one cluster */
> +    s->l2_bits = s->cluster_bits - ctz32(l2_entry_size(s));
>       s->l2_size = 1 << s->l2_bits;
>       /* 2^(s->refcount_order - 3) is the refcount width in bytes */
>       s->refcount_block_bits = s->cluster_bits - (s->refcount_order - 3);
> @@ -4104,7 +4104,7 @@ static int coroutine_fn qcow2_co_truncate(BlockDriverState *bs, int64_t offset,
>            *  preallocation. All that matters is that we will not have to allocate
>            *  new refcount structures for them.) */
>           nb_new_l2_tables = DIV_ROUND_UP(nb_new_data_clusters,
> -                                        s->cluster_size / sizeof(uint64_t));
> +                                        s->cluster_size / l2_entry_size(s));

Isn't it just s->l2_size ?

>           /* The cluster range may not be aligned to L2 boundaries, so add one L2
>            * table for a potential head/tail */
>           nb_new_l2_tables++;
> 


Conversions looks correct, but how to check that we have converted everything?

Trying at least

    cd block; git grep 'sizeof(uint64_t)' qcow2* | grep -v 'l1_size \*' | grep -v 'l1_sz \*' | grep -v refcount | grep -v reftable

I found this not converted chunk:

     /* total size of L2 tables */
     nl2e = aligned_total_size / cluster_size;
     nl2e = ROUND_UP(nl2e, cluster_size / sizeof(uint64_t));
     meta_size += nl2e * sizeof(uint64_t);


Hmm. How to avoid it? Maybe, at least, refactor the code, to drop all sizeof(uint64_t), converting them to L2_ENTRY_SIZE, L1_ENTRY_SIZE, REFTABLE_ENTRY_SIZE etc?
And all occurrences of pure '8' (not many of them exist)

Alberto Garcia April 14, 2020, 12:20 p.m. UTC | #2

On Tue 14 Apr 2020 11:44:57 AM CEST, Vladimir Sementsov-Ogievskiy wrote:
>>       /* allocate a new l2 entry */
>>   
>> -    l2_offset = qcow2_alloc_clusters(bs, s->l2_size * sizeof(uint64_t));
>> +    l2_offset = qcow2_alloc_clusters(bs, s->l2_size * l2_entry_size(s));
>
> hmm. s->l2_size * l2_entry_size, isn't it just s->cluster_size always?
> Maybe, just refactor these things?

I think the patch is simpler to follow if I only do the strictly
necessary changes and don't mix them with other things.

>>           nb_new_l2_tables = DIV_ROUND_UP(nb_new_data_clusters,
>> -                                        s->cluster_size / sizeof(uint64_t));
>> +                                        s->cluster_size / l2_entry_size(s));
>
> Isn't it just s->l2_size ?

Yes, same as before.

>>           /* The cluster range may not be aligned to L2 boundaries, so add one L2
>>            * table for a potential head/tail */
>>           nb_new_l2_tables++;
>
> Conversions looks correct, but how to check that we have converted
> everything?

I went through all cases, I think I didn't miss any!

> I found this not converted chunk:
>
>      /* total size of L2 tables */
>      nl2e = aligned_total_size / cluster_size;
>      nl2e = ROUND_UP(nl2e, cluster_size / sizeof(uint64_t));
>      meta_size += nl2e * sizeof(uint64_t);

This is used by qcow2_measure() and is fixed on a later patch because,
unlike all other cases, it does not use a BlockDriverState to determine
the size of an L2 entry.

> Hmm. How to avoid it? Maybe, at least, refactor the code, to drop all
> sizeof(uint64_t), converting them to L2_ENTRY_SIZE, L1_ENTRY_SIZE,
> REFTABLE_ENTRY_SIZE etc?

That wouldn't be a bad thing I guess but, again, for a separate patch or
series.

> And all occurrences of pure '8' (not many of them exist)

I think most/all nowadays only refer to the number of bits per byte.

Maybe there's a couple that still need to be fixed, but we have been
removing a lot of numeric literals from the qcow2 code (see for example
b6c246942b, 3afea40243 or a35f87f50d).

Berto

Vladimir Sementsov-Ogievskiy April 14, 2020, 12:29 p.m. UTC | #3

14.04.2020 15:20, Alberto Garcia wrote:
> On Tue 14 Apr 2020 11:44:57 AM CEST, Vladimir Sementsov-Ogievskiy wrote:
>>>        /* allocate a new l2 entry */
>>>    
>>> -    l2_offset = qcow2_alloc_clusters(bs, s->l2_size * sizeof(uint64_t));
>>> +    l2_offset = qcow2_alloc_clusters(bs, s->l2_size * l2_entry_size(s));
>>
>> hmm. s->l2_size * l2_entry_size, isn't it just s->cluster_size always?
>> Maybe, just refactor these things?
> 
> I think the patch is simpler to follow if I only do the strictly
> necessary changes and don't mix them with other things.
> 
>>>            nb_new_l2_tables = DIV_ROUND_UP(nb_new_data_clusters,
>>> -                                        s->cluster_size / sizeof(uint64_t));
>>> +                                        s->cluster_size / l2_entry_size(s));
>>
>> Isn't it just s->l2_size ?
> 
> Yes, same as before.
> 
>>>            /* The cluster range may not be aligned to L2 boundaries, so add one L2
>>>             * table for a potential head/tail */
>>>            nb_new_l2_tables++;
>>
>> Conversions looks correct, but how to check that we have converted
>> everything?
> 
> I went through all cases, I think I didn't miss any!
> 
>> I found this not converted chunk:
>>
>>       /* total size of L2 tables */
>>       nl2e = aligned_total_size / cluster_size;
>>       nl2e = ROUND_UP(nl2e, cluster_size / sizeof(uint64_t));
>>       meta_size += nl2e * sizeof(uint64_t);
> 
> This is used by qcow2_measure() and is fixed on a later patch because,
> unlike all other cases, it does not use a BlockDriverState to determine
> the size of an L2 entry.
> 
>> Hmm. How to avoid it? Maybe, at least, refactor the code, to drop all
>> sizeof(uint64_t), converting them to L2_ENTRY_SIZE, L1_ENTRY_SIZE,
>> REFTABLE_ENTRY_SIZE etc?
> 
> That wouldn't be a bad thing I guess but, again, for a separate patch or
> series.
> 
>> And all occurrences of pure '8' (not many of them exist)
> 
> I think most/all nowadays only refer to the number of bits per byte.
> 
> Maybe there's a couple that still need to be fixed, but we have been
> removing a lot of numeric literals from the qcow2 code (see for example
> b6c246942b, 3afea40243 or a35f87f50d).
> 


git grep '\<8\>' block/qcow2*

shows at least

qcow2-cluster.c:            s->l1_table_offset + 8 * l1_start_index, bufsize, false);
qcow2-cluster.c:                           s->l1_table_offset + 8 * l1_start_index,

Alberto Garcia April 14, 2020, 12:33 p.m. UTC | #4

On Tue 14 Apr 2020 02:29:13 PM CEST, Vladimir Sementsov-Ogievskiy wrote:
>>> Hmm. How to avoid it? Maybe, at least, refactor the code, to drop all
>>> sizeof(uint64_t), converting them to L2_ENTRY_SIZE, L1_ENTRY_SIZE,
>>> REFTABLE_ENTRY_SIZE etc?
>> 
>> That wouldn't be a bad thing I guess but, again, for a separate patch or
>> series.
>> 
>>> And all occurrences of pure '8' (not many of them exist)
>> 
>> I think most/all nowadays only refer to the number of bits per byte.
>> 
>> Maybe there's a couple that still need to be fixed, but we have been
>> removing a lot of numeric literals from the qcow2 code (see for example
>> b6c246942b, 3afea40243 or a35f87f50d).
>> 
>
>
> git grep '\<8\>' block/qcow2*
>
> shows at least
>
> qcow2-cluster.c:            s->l1_table_offset + 8 * l1_start_index, bufsize, false);
> qcow2-cluster.c:                           s->l1_table_offset + 8 * l1_start_index,

I see, worth replacing with L1_ENTRY_SIZE as you suggest. I can take of
writing the patches if you want.

Berto

Vladimir Sementsov-Ogievskiy April 14, 2020, 12:39 p.m. UTC | #5

14.04.2020 15:33, Alberto Garcia wrote:
> On Tue 14 Apr 2020 02:29:13 PM CEST, Vladimir Sementsov-Ogievskiy wrote:
>>>> Hmm. How to avoid it? Maybe, at least, refactor the code, to drop all
>>>> sizeof(uint64_t), converting them to L2_ENTRY_SIZE, L1_ENTRY_SIZE,
>>>> REFTABLE_ENTRY_SIZE etc?
>>>
>>> That wouldn't be a bad thing I guess but, again, for a separate patch or
>>> series.
>>>
>>>> And all occurrences of pure '8' (not many of them exist)
>>>
>>> I think most/all nowadays only refer to the number of bits per byte.
>>>
>>> Maybe there's a couple that still need to be fixed, but we have been
>>> removing a lot of numeric literals from the qcow2 code (see for example
>>> b6c246942b, 3afea40243 or a35f87f50d).
>>>
>>
>>
>> git grep '\<8\>' block/qcow2*
>>
>> shows at least
>>
>> qcow2-cluster.c:            s->l1_table_offset + 8 * l1_start_index, bufsize, false);
>> qcow2-cluster.c:                           s->l1_table_offset + 8 * l1_start_index,
> 
> I see, worth replacing with L1_ENTRY_SIZE as you suggest. I can take of
> writing the patches if you want.
> 

That would be great, if not too burdensome :)

Eric Blake April 14, 2020, 4:01 p.m. UTC | #6

On 4/14/20 7:20 AM, Alberto Garcia wrote:

>> Hmm. How to avoid it? Maybe, at least, refactor the code, to drop all
>> sizeof(uint64_t), converting them to L2_ENTRY_SIZE, L1_ENTRY_SIZE,
>> REFTABLE_ENTRY_SIZE etc?
> 
> That wouldn't be a bad thing I guess but, again, for a separate patch or
> series.
> 
>> And all occurrences of pure '8' (not many of them exist)
> 
> I think most/all nowadays only refer to the number of bits per byte.

CHAR_BIT (from <limits.h>) is good for that.

> 
> Maybe there's a couple that still need to be fixed, but we have been
> removing a lot of numeric literals from the qcow2 code (see for example
> b6c246942b, 3afea40243 or a35f87f50d).
> 
> Berto
>

Alberto Garcia April 14, 2020, 4:16 p.m. UTC | #7

On Tue 14 Apr 2020 06:01:42 PM CEST, Eric Blake <eblake@redhat.com> wrote:
>>> And all occurrences of pure '8' (not many of them exist)
>> 
>> I think most/all nowadays only refer to the number of bits per byte.
>
> CHAR_BIT (from <limits.h>) is good for that.

Wow, ok, I wonder if that actually makes the code more readable, but
I'll take it into account when writing the patch, thanks.

Berto

[v4,11/30] qcow2: Add l2_entry_size()

Commit Message

Comments

Patch