diff mbox series

[v2] sched/fair: Disable DL server on rcu_torture_disable_rt_throttle()

Message ID 078dd26a-9118-48f7-b9f9-c5476b168f6a@nvidia.com (mailing list archive)
State New
Headers show
Series [v2] sched/fair: Disable DL server on rcu_torture_disable_rt_throttle() | expand

Commit Message

Joel Fernandes March 6, 2025, 1:25 a.m. UTC
Oops, forgot to CC rcu@.


-------- Forwarded Message --------
Subject: [PATCH v2] sched/fair: Disable DL server on
rcu_torture_disable_rt_throttle()
Date: Wed, 5 Mar 2025 20:10:13 -0500
From: Joel Fernandes <joelagnelf@nvidia.com>
To: Ingo Molnar <mingo@redhat.com>, Peter Zijlstra <peterz@infradead.org>, Juri
Lelli <juri.lelli@redhat.com>, Vincent Guittot <vincent.guittot@linaro.org>,
Dietmar Eggemann <dietmar.eggemann@arm.com>, Steven Rostedt
<rostedt@goodmis.org>, Ben Segall <bsegall@google.com>, Mel Gorman
<mgorman@suse.de>, Valentin Schneider <vschneid@redhat.com>
CC: Joel Fernandes <joelagnelf@nvidia.com>, stable@vger.kernel.org, Paul E .
McKenney <paulmck@kernel.org>, linux-kernel@vger.kernel.org

Currently, RCU boost testing in rcutorture is broken because it relies on
having RT throttling disabled. This means the test will always pass (or
rarely fail). This occurs because recently, RT throttling was replaced
by DL server which boosts CFS tasks even when rcutorture tried to
disable throttling (see rcu_torture_disable_rt_throttle()). However, the
systctl_sched_rt_runtime variable is not considered thus still allowing
RT tasks to be preempted by CFS tasks.

Therefore this patch prevents DL server from starting when RCU torture
sets the sysctl_sched_rt_runtime to -1.

With this patch, boosting in TREE09 fails reliably if RCU_BOOST=n.

Steven also mentioned that this could fix RT usecases where users do not
want DL server to be interfering.

Cc: stable@vger.kernel.org
Cc: Paul E. McKenney <paulmck@kernel.org>
Cc: Steven Rostedt <rostedt@goodmis.org>
Fixes: cea5a3472ac4 ("sched/fair: Cleanup fair_server")
Signed-off-by: Joel Fernandes <joelagnelf@nvidia.com>
---
v1->v2:
	Updated Fixes tag (Steven)
	Moved the stoppage of DL server to fair (Juri)

 kernel/sched/fair.c | 14 ++++++++------
 1 file changed, 8 insertions(+), 6 deletions(-)

 	sub_nr_running(rq, queued_delta);
  	/* Stop the fair server if throttling resulted in no runnable tasks */
-	if (rq_h_nr_queued && !rq->cfs.h_nr_queued)
+	if (rq_h_nr_queued && !rq->cfs.h_nr_queued && dl_server_active(&rq->fair_server))
 		dl_server_stop(&rq->fair_server);
 done:
 	/*
@@ -6056,7 +6056,7 @@ void unthrottle_cfs_rq(struct cfs_rq *cfs_rq)
 	}
  	/* Start the fair server if un-throttling resulted in new runnable tasks */
-	if (!rq_h_nr_queued && rq->cfs.h_nr_queued)
+	if (!rq_h_nr_queued && rq->cfs.h_nr_queued && rt_bandwidth_enabled())
 		dl_server_start(&rq->fair_server);
  	/* At this point se is NULL and we are at root level*/
@@ -7005,9 +7005,11 @@ enqueue_task_fair(struct rq *rq, struct task_struct *p,
int flags)
  	if (!rq_h_nr_queued && rq->cfs.h_nr_queued) {
 		/* Account for idle runtime */
-		if (!rq->nr_running)
+		if (!rq->nr_running && rt_bandwidth_enabled())
 			dl_server_update_idle_time(rq, rq->curr);
-		dl_server_start(&rq->fair_server);
+
+		if (rt_bandwidth_enabled())
+			dl_server_start(&rq->fair_server);
 	}
  	/* At this point se is NULL and we are at root level*/
@@ -7134,7 +7136,7 @@ static int dequeue_entities(struct rq *rq, struct
sched_entity *se, int flags)
  	sub_nr_running(rq, h_nr_queued);
 -	if (rq_h_nr_queued && !rq->cfs.h_nr_queued)
+	if (rq_h_nr_queued && !rq->cfs.h_nr_queued && dl_server_active(&rq->fair_server))
 		dl_server_stop(&rq->fair_server);
  	/* balance early to pull high priority tasks */
diff mbox series

Patch

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index 1c0ef435a7aa..d7ba333393f2 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -1242,7 +1242,7 @@  static void update_curr(struct cfs_rq *cfs_rq)
 		 *    against fair_server such that it can account for this time
 		 *    and possibly avoid running this period.
 		 */
-		if (dl_server_active(&rq->fair_server))
+		if (dl_server_active(&rq->fair_server) && rt_bandwidth_enabled())
 			dl_server_update(&rq->fair_server, delta_exec);
 	}
 @@ -5957,7 +5957,7 @@ static bool throttle_cfs_rq(struct cfs_rq *cfs_rq)