diff mbox

cpufreq: schedutil: Fix sugov_start versus sugov_update_shared race

Message ID 1499363600-10732-1-git-send-email-markivx@codeaurora.org (mailing list archive)
State Mainlined
Delegated to: Rafael Wysocki
Headers show

Commit Message

Vikram Mulukutla July 6, 2017, 5:53 p.m. UTC
With a shared policy in place, when one of the CPUs in the policy is
hotplugged out and then brought back online, sugov_stop and
sugov_start are called in order.

sugov_stop removes utilization hooks for each CPU in the policy and
does nothing else in the for_each_cpu loop. sugov_start on the other
hand iterates through the CPUs in the policy and re-initializes the
per-cpu structure _and_ adds the utilization hook. This implies that
the scheduler is allowed to invoke a CPU's utilization update hook
when the rest of the per-cpu structures have yet to be re-inited.

Apart from some strange values in tracepoints this doesn't cause a
problem, but if we do end up accessing a pointer from the per-cpu
sugov_cpu structure somewhere in the sugov_update_shared path,
we will likely see crashes since the memset for another CPU in the
policy is free to race with sugov_update_shared from the CPU that is
ready to go. So let's fix this now to first init all per-cpu
structures, and then add the per-cpu utilization update hooks all at
once.

Signed-off-by: Vikram Mulukutla <markivx@codeaurora.org>
---
 kernel/sched/cpufreq_schedutil.c | 5 +++++
 1 file changed, 5 insertions(+)

Comments

Rafael J. Wysocki July 6, 2017, 8:48 p.m. UTC | #1
On Thursday, July 06, 2017 10:53:20 AM Vikram Mulukutla wrote:
> With a shared policy in place, when one of the CPUs in the policy is
> hotplugged out and then brought back online, sugov_stop and
> sugov_start are called in order.
> 
> sugov_stop removes utilization hooks for each CPU in the policy and
> does nothing else in the for_each_cpu loop. sugov_start on the other
> hand iterates through the CPUs in the policy and re-initializes the
> per-cpu structure _and_ adds the utilization hook. This implies that
> the scheduler is allowed to invoke a CPU's utilization update hook
> when the rest of the per-cpu structures have yet to be re-inited.
> 
> Apart from some strange values in tracepoints this doesn't cause a
> problem, but if we do end up accessing a pointer from the per-cpu
> sugov_cpu structure somewhere in the sugov_update_shared path,
> we will likely see crashes since the memset for another CPU in the
> policy is free to race with sugov_update_shared from the CPU that is
> ready to go. So let's fix this now to first init all per-cpu
> structures, and then add the per-cpu utilization update hooks all at
> once.
> 
> Signed-off-by: Vikram Mulukutla <markivx@codeaurora.org>

OK

Any objections anyone?

> ---
>  kernel/sched/cpufreq_schedutil.c | 5 +++++
>  1 file changed, 5 insertions(+)
> 
> diff --git a/kernel/sched/cpufreq_schedutil.c b/kernel/sched/cpufreq_schedutil.c
> index 076a2e3..29a3970 100644
> --- a/kernel/sched/cpufreq_schedutil.c
> +++ b/kernel/sched/cpufreq_schedutil.c
> @@ -610,6 +610,11 @@ static int sugov_start(struct cpufreq_policy *policy)
>  		sg_cpu->sg_policy = sg_policy;
>  		sg_cpu->flags = SCHED_CPUFREQ_RT;
>  		sg_cpu->iowait_boost_max = policy->cpuinfo.max_freq;
> +	}
> +
> +	for_each_cpu(cpu, policy->cpus) {
> +		struct sugov_cpu *sg_cpu = &per_cpu(sugov_cpu, cpu);
> +
>  		cpufreq_add_update_util_hook(cpu, &sg_cpu->update_util,
>  					     policy_is_shared(policy) ?
>  							sugov_update_shared :
>
Viresh Kumar July 7, 2017, 3:25 a.m. UTC | #2
On 06-07-17, 10:53, Vikram Mulukutla wrote:
> With a shared policy in place, when one of the CPUs in the policy is
> hotplugged out and then brought back online, sugov_stop and
> sugov_start are called in order.
> 
> sugov_stop removes utilization hooks for each CPU in the policy and
> does nothing else in the for_each_cpu loop. sugov_start on the other
> hand iterates through the CPUs in the policy and re-initializes the
> per-cpu structure _and_ adds the utilization hook. This implies that
> the scheduler is allowed to invoke a CPU's utilization update hook
> when the rest of the per-cpu structures have yet to be re-inited.
> 
> Apart from some strange values in tracepoints this doesn't cause a
> problem, but if we do end up accessing a pointer from the per-cpu
> sugov_cpu structure somewhere in the sugov_update_shared path,
> we will likely see crashes since the memset for another CPU in the
> policy is free to race with sugov_update_shared from the CPU that is
> ready to go. So let's fix this now to first init all per-cpu
> structures, and then add the per-cpu utilization update hooks all at
> once.
> 
> Signed-off-by: Vikram Mulukutla <markivx@codeaurora.org>
> ---
>  kernel/sched/cpufreq_schedutil.c | 5 +++++
>  1 file changed, 5 insertions(+)
> 
> diff --git a/kernel/sched/cpufreq_schedutil.c b/kernel/sched/cpufreq_schedutil.c
> index 076a2e3..29a3970 100644
> --- a/kernel/sched/cpufreq_schedutil.c
> +++ b/kernel/sched/cpufreq_schedutil.c
> @@ -610,6 +610,11 @@ static int sugov_start(struct cpufreq_policy *policy)
>  		sg_cpu->sg_policy = sg_policy;
>  		sg_cpu->flags = SCHED_CPUFREQ_RT;
>  		sg_cpu->iowait_boost_max = policy->cpuinfo.max_freq;
> +	}
> +
> +	for_each_cpu(cpu, policy->cpus) {
> +		struct sugov_cpu *sg_cpu = &per_cpu(sugov_cpu, cpu);
> +
>  		cpufreq_add_update_util_hook(cpu, &sg_cpu->update_util,
>  					     policy_is_shared(policy) ?
>  							sugov_update_shared :

Acked-by: Viresh Kumar <viresh.kumar@linaro.org>
Rafael J. Wysocki July 12, 2017, 9:33 p.m. UTC | #3
On Friday, July 07, 2017 08:55:09 AM Viresh Kumar wrote:
> On 06-07-17, 10:53, Vikram Mulukutla wrote:
> > With a shared policy in place, when one of the CPUs in the policy is
> > hotplugged out and then brought back online, sugov_stop and
> > sugov_start are called in order.
> > 
> > sugov_stop removes utilization hooks for each CPU in the policy and
> > does nothing else in the for_each_cpu loop. sugov_start on the other
> > hand iterates through the CPUs in the policy and re-initializes the
> > per-cpu structure _and_ adds the utilization hook. This implies that
> > the scheduler is allowed to invoke a CPU's utilization update hook
> > when the rest of the per-cpu structures have yet to be re-inited.
> > 
> > Apart from some strange values in tracepoints this doesn't cause a
> > problem, but if we do end up accessing a pointer from the per-cpu
> > sugov_cpu structure somewhere in the sugov_update_shared path,
> > we will likely see crashes since the memset for another CPU in the
> > policy is free to race with sugov_update_shared from the CPU that is
> > ready to go. So let's fix this now to first init all per-cpu
> > structures, and then add the per-cpu utilization update hooks all at
> > once.
> > 
> > Signed-off-by: Vikram Mulukutla <markivx@codeaurora.org>
> > ---
> >  kernel/sched/cpufreq_schedutil.c | 5 +++++
> >  1 file changed, 5 insertions(+)
> > 
> > diff --git a/kernel/sched/cpufreq_schedutil.c b/kernel/sched/cpufreq_schedutil.c
> > index 076a2e3..29a3970 100644
> > --- a/kernel/sched/cpufreq_schedutil.c
> > +++ b/kernel/sched/cpufreq_schedutil.c
> > @@ -610,6 +610,11 @@ static int sugov_start(struct cpufreq_policy *policy)
> >  		sg_cpu->sg_policy = sg_policy;
> >  		sg_cpu->flags = SCHED_CPUFREQ_RT;
> >  		sg_cpu->iowait_boost_max = policy->cpuinfo.max_freq;
> > +	}
> > +
> > +	for_each_cpu(cpu, policy->cpus) {
> > +		struct sugov_cpu *sg_cpu = &per_cpu(sugov_cpu, cpu);
> > +
> >  		cpufreq_add_update_util_hook(cpu, &sg_cpu->update_util,
> >  					     policy_is_shared(policy) ?
> >  							sugov_update_shared :
> 
> Acked-by: Viresh Kumar <viresh.kumar@linaro.org>

Applied, thanks!
diff mbox

Patch

diff --git a/kernel/sched/cpufreq_schedutil.c b/kernel/sched/cpufreq_schedutil.c
index 076a2e3..29a3970 100644
--- a/kernel/sched/cpufreq_schedutil.c
+++ b/kernel/sched/cpufreq_schedutil.c
@@ -610,6 +610,11 @@  static int sugov_start(struct cpufreq_policy *policy)
 		sg_cpu->sg_policy = sg_policy;
 		sg_cpu->flags = SCHED_CPUFREQ_RT;
 		sg_cpu->iowait_boost_max = policy->cpuinfo.max_freq;
+	}
+
+	for_each_cpu(cpu, policy->cpus) {
+		struct sugov_cpu *sg_cpu = &per_cpu(sugov_cpu, cpu);
+
 		cpufreq_add_update_util_hook(cpu, &sg_cpu->update_util,
 					     policy_is_shared(policy) ?
 							sugov_update_shared :