Message ID | 20191217145939.5537-2-kwolf@redhat.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | block: Fix external snapshot with VM state | expand |
Am 17.12.2019 um 15:59 hat Kevin Wolf geschrieben: > bdrv_invalidate_cache_all() assumes that all nodes in a given subtree > are either active or inactive when it starts. Therefore, as soon as it > arrives at an already active node, it stops. > > However, this assumption is wrong. For example, it's possible to take a > snapshot of an inactive node, which results in an active overlay over an > inactive backing file. The active overlay is probably also the root node > of an inactive BlockBackend (blk->disable_perm == true). > > In this case, bdrv_invalidate_cache_all() does not need to do anything > to activate the overlay node, but it still needs to recurse into the > children and the parents to make sure that after returning success, > really everything is activated. > > Signed-off-by: Kevin Wolf <kwolf@redhat.com> Actually: Cc: qemu-stable@nongnu.org
On 17.12.19 15:59, Kevin Wolf wrote: > bdrv_invalidate_cache_all() assumes that all nodes in a given subtree > are either active or inactive when it starts. Therefore, as soon as it > arrives at an already active node, it stops. > > However, this assumption is wrong. For example, it's possible to take a > snapshot of an inactive node, which results in an active overlay over an > inactive backing file. The active overlay is probably also the root node > of an inactive BlockBackend (blk->disable_perm == true). > > In this case, bdrv_invalidate_cache_all() does not need to do anything > to activate the overlay node, but it still needs to recurse into the > children and the parents to make sure that after returning success, > really everything is activated. > > Signed-off-by: Kevin Wolf <kwolf@redhat.com> > --- > block.c | 50 ++++++++++++++++++++++++-------------------------- > 1 file changed, 24 insertions(+), 26 deletions(-) Basically the only change is to not skip the whole function when the node is already active but only the part that actually activates the node. blk_root_activate() is a no-op for already-active BBs, so this looks good to me. Reviewed-by: Max Reitz <mreitz@redhat.com>
diff --git a/block.c b/block.c index 73029fad64..1b6f7c86e8 100644 --- a/block.c +++ b/block.c @@ -5335,10 +5335,6 @@ static void coroutine_fn bdrv_co_invalidate_cache(BlockDriverState *bs, return; } - if (!(bs->open_flags & BDRV_O_INACTIVE)) { - return; - } - QLIST_FOREACH(child, &bs->children, next) { bdrv_co_invalidate_cache(child->bs, &local_err); if (local_err) { @@ -5360,34 +5356,36 @@ static void coroutine_fn bdrv_co_invalidate_cache(BlockDriverState *bs, * just keep the extended permissions for the next time that an activation * of the image is tried. */ - bs->open_flags &= ~BDRV_O_INACTIVE; - bdrv_get_cumulative_perm(bs, &perm, &shared_perm); - ret = bdrv_check_perm(bs, NULL, perm, shared_perm, NULL, NULL, &local_err); - if (ret < 0) { - bs->open_flags |= BDRV_O_INACTIVE; - error_propagate(errp, local_err); - return; - } - bdrv_set_perm(bs, perm, shared_perm); - - if (bs->drv->bdrv_co_invalidate_cache) { - bs->drv->bdrv_co_invalidate_cache(bs, &local_err); - if (local_err) { + if (bs->open_flags & BDRV_O_INACTIVE) { + bs->open_flags &= ~BDRV_O_INACTIVE; + bdrv_get_cumulative_perm(bs, &perm, &shared_perm); + ret = bdrv_check_perm(bs, NULL, perm, shared_perm, NULL, NULL, &local_err); + if (ret < 0) { bs->open_flags |= BDRV_O_INACTIVE; error_propagate(errp, local_err); return; } - } + bdrv_set_perm(bs, perm, shared_perm); - FOR_EACH_DIRTY_BITMAP(bs, bm) { - bdrv_dirty_bitmap_skip_store(bm, false); - } + if (bs->drv->bdrv_co_invalidate_cache) { + bs->drv->bdrv_co_invalidate_cache(bs, &local_err); + if (local_err) { + bs->open_flags |= BDRV_O_INACTIVE; + error_propagate(errp, local_err); + return; + } + } - ret = refresh_total_sectors(bs, bs->total_sectors); - if (ret < 0) { - bs->open_flags |= BDRV_O_INACTIVE; - error_setg_errno(errp, -ret, "Could not refresh total sector count"); - return; + FOR_EACH_DIRTY_BITMAP(bs, bm) { + bdrv_dirty_bitmap_skip_store(bm, false); + } + + ret = refresh_total_sectors(bs, bs->total_sectors); + if (ret < 0) { + bs->open_flags |= BDRV_O_INACTIVE; + error_setg_errno(errp, -ret, "Could not refresh total sector count"); + return; + } } QLIST_FOREACH(parent, &bs->parents, next_parent) {
bdrv_invalidate_cache_all() assumes that all nodes in a given subtree are either active or inactive when it starts. Therefore, as soon as it arrives at an already active node, it stops. However, this assumption is wrong. For example, it's possible to take a snapshot of an inactive node, which results in an active overlay over an inactive backing file. The active overlay is probably also the root node of an inactive BlockBackend (blk->disable_perm == true). In this case, bdrv_invalidate_cache_all() does not need to do anything to activate the overlay node, but it still needs to recurse into the children and the parents to make sure that after returning success, really everything is activated. Signed-off-by: Kevin Wolf <kwolf@redhat.com> --- block.c | 50 ++++++++++++++++++++++++-------------------------- 1 file changed, 24 insertions(+), 26 deletions(-)