diff mbox series

xen/pvhsim: fix cpu onlining

Message ID 20191023121209.4814-1-jgross@suse.com (mailing list archive)
State Superseded
Headers show
Series xen/pvhsim: fix cpu onlining | expand

Commit Message

Jürgen Groß Oct. 23, 2019, 12:12 p.m. UTC
Since commit 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
the initial processor for all pv-shim vcpus will be 0, as no other cpus
are online when the vcpus are created. Before that commit the vcpus
would have processors set not being online yet, which worked just by
chance.

When the pv-shim vcpu becomes active it will have a hard affinity
not matching its initial processor assignment leading to failing
ASSERT()s or other problems depending on the selected scheduler.

Fix that by doing the affinity setting after onlining the cpu but
before taking the vcpu up. For vcpu 0 this is still in
sched_setup_dom0_vcpus(), for the other vcpus setting the affinity
there can be dropped.

Fixes: 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
Reported-by: Sergey Dyasli <sergey.dyasli@citrix.com>
Tested-by: Sergey Dyasli <sergey.dyasli@citrix.com>
Signed-off-by: Juergen Gross <jgross@suse.com>
---
 xen/arch/x86/pv/shim.c |  2 ++
 xen/common/schedule.c  | 11 +++++------
 2 files changed, 7 insertions(+), 6 deletions(-)

Comments

Roger Pau Monné Oct. 23, 2019, 12:55 p.m. UTC | #1
On Wed, Oct 23, 2019 at 02:12:09PM +0200, Juergen Gross wrote:
> Since commit 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
> the initial processor for all pv-shim vcpus will be 0, as no other cpus
> are online when the vcpus are created. Before that commit the vcpus
> would have processors set not being online yet, which worked just by
> chance.
> 
> When the pv-shim vcpu becomes active it will have a hard affinity
> not matching its initial processor assignment leading to failing
> ASSERT()s or other problems depending on the selected scheduler.
> 
> Fix that by doing the affinity setting after onlining the cpu but
> before taking the vcpu up. For vcpu 0 this is still in
> sched_setup_dom0_vcpus(), for the other vcpus setting the affinity
> there can be dropped.
> 
> Fixes: 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
> Reported-by: Sergey Dyasli <sergey.dyasli@citrix.com>
> Tested-by: Sergey Dyasli <sergey.dyasli@citrix.com>
> Signed-off-by: Juergen Gross <jgross@suse.com>

Reviewed-by: Roger Pau Monné <roger.pau@citrix.com>

Thanks.
Jürgen Groß Oct. 23, 2019, 1:46 p.m. UTC | #2
On 23.10.19 14:55, Roger Pau Monné wrote:
> On Wed, Oct 23, 2019 at 02:12:09PM +0200, Juergen Gross wrote:
>> Since commit 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
>> the initial processor for all pv-shim vcpus will be 0, as no other cpus
>> are online when the vcpus are created. Before that commit the vcpus
>> would have processors set not being online yet, which worked just by
>> chance.
>>
>> When the pv-shim vcpu becomes active it will have a hard affinity
>> not matching its initial processor assignment leading to failing
>> ASSERT()s or other problems depending on the selected scheduler.
>>
>> Fix that by doing the affinity setting after onlining the cpu but
>> before taking the vcpu up. For vcpu 0 this is still in
>> sched_setup_dom0_vcpus(), for the other vcpus setting the affinity
>> there can be dropped.
>>
>> Fixes: 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
>> Reported-by: Sergey Dyasli <sergey.dyasli@citrix.com>
>> Tested-by: Sergey Dyasli <sergey.dyasli@citrix.com>
>> Signed-off-by: Juergen Gross <jgross@suse.com>
> 
> Reviewed-by: Roger Pau Monné <roger.pau@citrix.com>

And just for the protocol:

Release-acked-by: Juergen Gross <jgross@suse.com>


Juergen
Jan Beulich Oct. 23, 2019, 1:53 p.m. UTC | #3
On 23.10.2019 14:55, Roger Pau Monné wrote:
> On Wed, Oct 23, 2019 at 02:12:09PM +0200, Juergen Gross wrote:
>> Since commit 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
>> the initial processor for all pv-shim vcpus will be 0, as no other cpus
>> are online when the vcpus are created. Before that commit the vcpus
>> would have processors set not being online yet, which worked just by
>> chance.
>>
>> When the pv-shim vcpu becomes active it will have a hard affinity
>> not matching its initial processor assignment leading to failing
>> ASSERT()s or other problems depending on the selected scheduler.
>>
>> Fix that by doing the affinity setting after onlining the cpu but
>> before taking the vcpu up. For vcpu 0 this is still in
>> sched_setup_dom0_vcpus(), for the other vcpus setting the affinity
>> there can be dropped.
>>
>> Fixes: 8d3c326f6756d1 ("xen: let vcpu_create() select processor")
>> Reported-by: Sergey Dyasli <sergey.dyasli@citrix.com>
>> Tested-by: Sergey Dyasli <sergey.dyasli@citrix.com>
>> Signed-off-by: Juergen Gross <jgross@suse.com>
> 
> Reviewed-by: Roger Pau Monné <roger.pau@citrix.com>

Acked-by: Jan Beulich <jbeulich@suse.com>

I have to admit though that I miss a comment on the pv_shim
conditional in schedule.c - such a special case shouldn't
really be there, but since it's needed it should be explained.
I realize though that the patch here only moves the special
case, i.e. the lack of comment is pre-existing.

Jan
diff mbox series

Patch

diff --git a/xen/arch/x86/pv/shim.c b/xen/arch/x86/pv/shim.c
index 5edbcd9ac5..4329eaaefe 100644
--- a/xen/arch/x86/pv/shim.c
+++ b/xen/arch/x86/pv/shim.c
@@ -837,6 +837,8 @@  long pv_shim_cpu_up(void *data)
                     v->vcpu_id, rc);
             return rc;
         }
+
+        vcpu_set_hard_affinity(v, cpumask_of(v->vcpu_id));
     }
 
     wake = test_and_clear_bit(_VPF_down, &v->pause_flags);
diff --git a/xen/common/schedule.c b/xen/common/schedule.c
index c327c40b92..326f4d3601 100644
--- a/xen/common/schedule.c
+++ b/xen/common/schedule.c
@@ -3102,13 +3102,12 @@  void __init sched_setup_dom0_vcpus(struct domain *d)
     for ( i = 1; i < d->max_vcpus; i++ )
         vcpu_create(d, i);
 
-    for_each_sched_unit ( d, unit )
+    if ( pv_shim )
+        sched_set_affinity(d->vcpu[0]->sched_unit,
+                           cpumask_of(0), cpumask_of(0));
+    else
     {
-        unsigned int id = unit->unit_id;
-
-        if ( pv_shim )
-            sched_set_affinity(unit, cpumask_of(id), cpumask_of(id));
-        else
+        for_each_sched_unit ( d, unit )
         {
             if ( !opt_dom0_vcpus_pin && !dom0_affinity_relaxed )
                 sched_set_affinity(unit, &dom0_cpus, NULL);