[v3,30/47] xen/sched: add support for multiple vcpus per sched unit where missing

Message ID	20190914085251.18816-31-jgross@suse.com (mailing list archive)
State	Superseded
Headers	show Return-Path: <SRS0=7ja4=XJ=lists.xenproject.org=xen-devel-bounces@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 108E420717 From: Juergen Gross <jgross@suse.com> To: xen-devel@lists.xenproject.org Date: Sat, 14 Sep 2019 10:52:34 +0200 Message-Id: <20190914085251.18816-31-jgross@suse.com> In-Reply-To: <20190914085251.18816-1-jgross@suse.com> References: <20190914085251.18816-1-jgross@suse.com> Subject: [Xen-devel] [PATCH v3 30/47] xen/sched: add support for multiple vcpus per sched unit where missing Precedence: list Cc: Juergen Gross <jgross@suse.com>, Stefano Stabellini <sstabellini@kernel.org>, Wei Liu <wl@xen.org>, Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>, George Dunlap <George.Dunlap@eu.citrix.com>, Andrew Cooper <andrew.cooper3@citrix.com>, Ian Jackson <ian.jackson@eu.citrix.com>, Tim Deegan <tim@xen.org>, Julien Grall <julien.grall@arm.com>, Jan Beulich <jbeulich@suse.com>, Dario Faggioli <dfaggioli@suse.com> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: xen-devel-bounces@lists.xenproject.org Sender: "Xen-devel" <xen-devel-bounces@lists.xenproject.org>
Series	xen: add core scheduling support \| expand [v3,00/47] xen: add core scheduling support [v3,01/47] xen/sched: use new sched_unit instead of vcpu in scheduler interfaces [v3,02/47] xen/sched: move per-vcpu scheduler private data pointer to sched_unit [v3,03/47] xen/sched: build a linked list of struct sched_unit [v3,04/47] xen/sched: introduce struct sched_resource [v3,05/47] xen/sched: let pick_cpu return a scheduler resource [v3,06/47] xen/sched: switch schedule_data.curr to point at sched_unit [v3,07/47] xen/sched: move per cpu scheduler private data into struct sched_resource [v3,08/47] xen/sched: switch vcpu_schedule_lock to unit_schedule_lock [v3,09/47] xen/sched: move some per-vcpu items to struct sched_unit [v3,10/47] xen/sched: add scheduler helpers hiding vcpu [v3,11/47] xen/sched: rename scheduler related perf counters [v3,12/47] xen/sched: switch struct task_slice from vcpu to sched_unit [v3,13/47] xen/sched: add is_running indicator to struct sched_unit [v3,14/47] xen/sched: make null scheduler vcpu agnostic. [v3,15/47] xen/sched: make rt scheduler vcpu agnostic. [v3,16/47] xen/sched: make credit scheduler vcpu agnostic. [v3,17/47] xen/sched: make credit2 scheduler vcpu agnostic. [v3,18/47] xen/sched: make arinc653 scheduler vcpu agnostic. [v3,19/47] xen: add sched_unit_pause_nosync() and sched_unit_unpause() [v3,20/47] xen: let vcpu_create() select processor [v3,21/47] xen/sched: use sched_resource cpu instead smp_processor_id in schedulers [v3,22/47] xen/sched: switch schedule() from vcpus to sched_units [v3,23/47] xen/sched: switch sched_move_irqs() to take sched_unit as parameter [v3,24/47] xen: switch from for_each_vcpu() to for_each_sched_unit() [v3,25/47] xen/sched: add runstate counters to struct sched_unit [v3,26/47] xen/sched: Change vcpu_migrate_*() to operate on schedule unit [v3,27/47] xen/sched: move struct task_slice into struct sched_unit [v3,28/47] xen/sched: add code to sync scheduling of all vcpus of a sched unit [v3,29/47] xen/sched: introduce unit_runnable_state() [v3,30/47] xen/sched: add support for multiple vcpus per sched unit where missing [v3,31/47] xen/sched: modify cpupool_domain_cpumask() to be an unit mask [v3,32/47] xen/sched: support allocating multiple vcpus into one sched unit [v3,33/47] xen/sched: add a percpu resource index [v3,34/47] xen/sched: add fall back to idle vcpu when scheduling unit [v3,35/47] xen/sched: make vcpu_wake() and vcpu_sleep() core scheduling aware [v3,36/47] xen/sched: carve out freeing sched_unit memory into dedicated function [v3,37/47] xen/sched: move per-cpu variable scheduler to struct sched_resource [v3,38/47] xen/sched: move per-cpu variable cpupool to struct sched_resource [v3,39/47] xen/sched: reject switching smt on/off with core scheduling active [v3,40/47] xen/sched: prepare per-cpupool scheduling granularity [v3,41/47] xen/sched: split schedule_cpu_switch() [v3,42/47] xen/sched: protect scheduling resource via rcu [v3,43/47] xen/sched: support multiple cpus per scheduling resource [v3,44/47] xen/sched: support differing granularity in schedule_cpu_[add/rm]() [v3,45/47] xen/sched: support core scheduling for moving cpus to/from cpupools [v3,46/47] xen/sched: disable scheduling when entering ACPI deep sleep states [v3,47/47] xen/sched: add scheduling granularity enum

diff --git a/xen/common/domain.c b/xen/common/domain.c index fa4023936b..ea6aee3858 100644 --- a/xen/common/domain.c +++ b/xen/common/domain.c @@ -1259,7 +1259,10 @@ int vcpu_reset(struct vcpu *v) v->async_exception_mask = 0; memset(v->async_exception_state, 0, sizeof(v->async_exception_state)); #endif - v->affinity_broken = 0; + if ( v->affinity_broken & VCPU_AFFINITY_OVERRIDE ) + vcpu_temporary_affinity(v, NR_CPUS, VCPU_AFFINITY_OVERRIDE); + if ( v->affinity_broken & VCPU_AFFINITY_WAIT ) + vcpu_temporary_affinity(v, NR_CPUS, VCPU_AFFINITY_WAIT); clear_bit(_VPF_blocked, &v->pause_flags); clear_bit(_VPF_in_reset, &v->pause_flags); diff --git a/xen/common/schedule.c b/xen/common/schedule.c index 03bcf796ae..a79065c826 100644 --- a/xen/common/schedule.c +++ b/xen/common/schedule.c @@ -243,8 +243,9 @@ static inline void vcpu_runstate_change( s_time_t delta; struct sched_unit *unit = v->sched_unit; - ASSERT(v->runstate.state != new_state); ASSERT(spin_is_locked(get_sched_res(v->processor)->schedule_lock)); + if ( v->runstate.state == new_state ) + return; vcpu_urgent_count_update(v); @@ -266,15 +267,16 @@ static inline void vcpu_runstate_change( static inline void sched_unit_runstate_change(struct sched_unit *unit, bool running, s_time_t new_entry_time) { - struct vcpu *v = unit->vcpu_list; + struct vcpu *v; - if ( running ) - vcpu_runstate_change(v, v->new_state, new_entry_time); - else - vcpu_runstate_change(v, - ((v->pause_flags & VPF_blocked) ? RUNSTATE_blocked : - (vcpu_runnable(v) ? RUNSTATE_runnable : RUNSTATE_offline)), - new_entry_time); + for_each_sched_unit_vcpu ( unit, v ) + if ( running ) + vcpu_runstate_change(v, v->new_state, new_entry_time); + else + vcpu_runstate_change(v, + ((v->pause_flags & VPF_blocked) ? RUNSTATE_blocked : + (vcpu_runnable(v) ? RUNSTATE_runnable : RUNSTATE_offline)), + new_entry_time); } void vcpu_runstate_get(struct vcpu *v, struct vcpu_runstate_info *runstate) @@ -1031,10 +1033,9 @@ int cpu_disable_scheduler(unsigned int cpu) if ( cpumask_empty(&online_affinity) && cpumask_test_cpu(cpu, unit->cpu_hard_affinity) ) { - /* TODO: multiple vcpus per unit. */ - if ( unit->vcpu_list->affinity_broken ) + if ( sched_check_affinity_broken(unit) ) { - /* The vcpu is temporarily pinned, can't move it. */ + /* The unit is temporarily pinned, can't move it. */ unit_schedule_unlock_irqrestore(lock, flags, unit); ret = -EADDRINUSE; break; @@ -1392,17 +1393,17 @@ int vcpu_temporary_affinity(struct vcpu *v, unsigned int cpu, uint8_t reason) ret = 0; v->affinity_broken &= ~reason; } - if ( !ret && !v->affinity_broken ) + if ( !ret && !sched_check_affinity_broken(unit) ) sched_set_affinity(v, unit->cpu_hard_affinity_saved, NULL); } else if ( cpu < nr_cpu_ids ) { if ( (v->affinity_broken & reason) || - (v->affinity_broken && v->processor != cpu) ) + (sched_check_affinity_broken(unit) && v->processor != cpu) ) ret = -EBUSY; else if ( cpumask_test_cpu(cpu, VCPU2ONLINE(v)) ) { - if ( !v->affinity_broken ) + if ( !sched_check_affinity_broken(unit) ) { cpumask_copy(unit->cpu_hard_affinity_saved, unit->cpu_hard_affinity); @@ -1722,14 +1723,14 @@ static void sched_switch_units(struct sched_resource *sd, (next->vcpu_list->runstate.state == RUNSTATE_runnable) ? (now - next->state_entry_time) : 0, prev->next_time); - ASSERT(prev->vcpu_list->runstate.state == RUNSTATE_running); + ASSERT(unit_running(prev)); TRACE_4D(TRC_SCHED_SWITCH, prev->domain->domain_id, prev->unit_id, next->domain->domain_id, next->unit_id); sched_unit_runstate_change(prev, false, now); - ASSERT(next->vcpu_list->runstate.state != RUNSTATE_running); + ASSERT(!unit_running(next)); sched_unit_runstate_change(next, true, now); /* @@ -1851,7 +1852,7 @@ void sched_context_switched(struct vcpu *vprev, struct vcpu *vnext) while ( atomic_read(&next->rendezvous_out_cnt) ) cpu_relax(); } - else if ( vprev != vnext ) + else if ( vprev != vnext && sched_granularity == 1 ) context_saved(vprev); } diff --git a/xen/include/xen/sched-if.h b/xen/include/xen/sched-if.h index 25ba6f25c9..6a4dbac935 100644 --- a/xen/include/xen/sched-if.h +++ b/xen/include/xen/sched-if.h @@ -68,12 +68,32 @@ static inline bool is_idle_unit(const struct sched_unit *unit) static inline bool is_unit_online(const struct sched_unit *unit) { - return is_vcpu_online(unit->vcpu_list); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + if ( is_vcpu_online(v) ) + return true; + + return false; +} + +static inline unsigned int unit_running(const struct sched_unit *unit) +{ + return unit->runstate_cnt[RUNSTATE_running]; } static inline bool unit_runnable(const struct sched_unit *unit) { - return vcpu_runnable(unit->vcpu_list); + struct vcpu *v; + + if ( is_idle_unit(unit) ) + return true; + + for_each_sched_unit_vcpu ( unit, v ) + if ( vcpu_runnable(v) ) + return true; + + return false; } static inline bool unit_runnable_state(const struct sched_unit *unit) @@ -102,7 +122,16 @@ static inline bool unit_runnable_state(const struct sched_unit *unit) static inline void sched_set_res(struct sched_unit *unit, struct sched_resource *res) { - unit->vcpu_list->processor = res->master_cpu; + unsigned int cpu = cpumask_first(res->cpus); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + { + ASSERT(cpu < nr_cpu_ids); + v->processor = cpu; + cpu = cpumask_next(cpu, res->cpus); + } + unit->res = res; } @@ -114,25 +143,37 @@ static inline unsigned int sched_unit_cpu(const struct sched_unit *unit) static inline void sched_set_pause_flags(struct sched_unit *unit, unsigned int bit) { - __set_bit(bit, &unit->vcpu_list->pause_flags); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + __set_bit(bit, &v->pause_flags); } static inline void sched_clear_pause_flags(struct sched_unit *unit, unsigned int bit) { - __clear_bit(bit, &unit->vcpu_list->pause_flags); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + __clear_bit(bit, &v->pause_flags); } static inline void sched_set_pause_flags_atomic(struct sched_unit *unit, unsigned int bit) { - set_bit(bit, &unit->vcpu_list->pause_flags); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + set_bit(bit, &v->pause_flags); } static inline void sched_clear_pause_flags_atomic(struct sched_unit *unit, unsigned int bit) { - clear_bit(bit, &unit->vcpu_list->pause_flags); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + clear_bit(bit, &v->pause_flags); } static inline struct sched_unit *sched_idle_unit(unsigned int cpu) @@ -458,12 +499,18 @@ static inline int sched_adjust_cpupool(const struct scheduler *s, static inline void sched_unit_pause_nosync(struct sched_unit *unit) { - vcpu_pause_nosync(unit->vcpu_list); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + vcpu_pause_nosync(v); } static inline void sched_unit_unpause(struct sched_unit *unit) { - vcpu_unpause(unit->vcpu_list); + struct vcpu *v; + + for_each_sched_unit_vcpu ( unit, v ) + vcpu_unpause(v); } #define REGISTER_SCHEDULER(x) static const struct scheduler *x##_entry \

[v3,30/47] xen/sched: add support for multiple vcpus per sched unit where missing

Commit Message

Comments

Patch