[40/41] drm/i915/gt: Enable ring scheduling for gen5-7

Message ID	20210125140136.10494-40-chris@chris-wilson.co.uk (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=x+Hj=G4=lists.freedesktop.org=intel-gfx-bounces@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C26AC230FD From: Chris Wilson <chris@chris-wilson.co.uk> To: intel-gfx@lists.freedesktop.org Date: Mon, 25 Jan 2021 14:01:35 +0000 Message-Id: <20210125140136.10494-40-chris@chris-wilson.co.uk> In-Reply-To: <20210125140136.10494-1-chris@chris-wilson.co.uk> References: <20210125140136.10494-1-chris@chris-wilson.co.uk> MIME-Version: 1.0 Subject: [Intel-gfx] [PATCH 40/41] drm/i915/gt: Enable ring scheduling for gen5-7 Precedence: list Cc: thomas.hellstrom@intel.com, Chris Wilson <chris@chris-wilson.co.uk> Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>
Series	[01/41] drm/i915/selftests: Check for engine-reset errors in the middle of workarounds \| expand [01/41] drm/i915/selftests: Check for engine-reset errors in the middle of workarounds [02/41] drm/i915/gt: Move the defer_request waiter active assertion [03/41] drm/i915: Replace engine->schedule() with a known request operation [04/41] drm/i915: Teach the i915_dependency to use a double-lock [05/41] drm/i915: Restructure priority inheritance [06/41] drm/i915/selftests: Measure set-priority duration [07/41] drm/i915/selftests: Exercise priority inheritance around an engine loop [08/41] drm/i915: Improve DFS for priority inheritance [09/41] drm/i915/selftests: Exercise relative mmio paths to non-privileged registers [10/41] drm/i915/selftests: Exercise cross-process context isolation [11/41] drm/i915: Extract request submission from execlists [12/41] drm/i915: Extract request rewinding from execlists [13/41] drm/i915: Extract request suspension from the execlists [14/41] drm/i915: Extract the ability to defer and rerun a request later [15/41] drm/i915: Fix the iterative dfs for defering requests [16/41] drm/i915: Move common active lists from engine to i915_scheduler [17/41] drm/i915: Move scheduler queue [18/41] drm/i915: Move tasklet from execlists to sched [19/41] drm/i915/gt: Show scheduler queues when dumping state [20/41] drm/i915: Replace priolist rbtree with a skiplist [21/41] drm/i915: Wrap cmpxchg64 with try_cmpxchg64() helper [22/41] drm/i915: Fair low-latency scheduling [23/41] drm/i915/gt: Specify a deadline for the heartbeat [24/41] drm/i915: Extend the priority boosting for the display with a deadline [25/41] drm/i915/gt: Support virtual engine queues [26/41] drm/i915: Move saturated workload detection back to the context [27/41] drm/i915: Bump default timeslicing quantum to 5ms [28/41] drm/i915/gt: Wrap intel_timeline.has_initial_breadcrumb [29/41] drm/i915/gt: Track timeline GGTT offset separately from subpage offset [30/41] drm/i915/gt: Add timeline "mode" [31/41] drm/i915/gt: Use indices for writing into relative timelines [32/41] drm/i915/selftests: Exercise relative timeline modes [33/41] drm/i915/gt: Use ppHWSP for unshared non-semaphore related timelines [34/41] Restore "drm/i915: drop engine_pin/unpin_breadcrumbs_irq" [35/41] drm/i915/gt: Couple tasklet scheduling for all CS interrupts [36/41] drm/i915/gt: Support creation of 'internal' rings [37/41] drm/i915/gt: Use client timeline address for seqno writes [38/41] drm/i915/gt: Infrastructure for ring scheduling [39/41] drm/i915/gt: Implement ring scheduler for gen4-7 [40/41] drm/i915/gt: Enable ring scheduling for gen5-7 [41/41] drm/i915: Support secure dispatch on gen6/gen7

Message ID

20210125140136.10494-40-chris@chris-wilson.co.uk (mailing list archive)

State

New, archived

Headers

DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C26AC230FD
From: Chris Wilson <chris@chris-wilson.co.uk>
To: intel-gfx@lists.freedesktop.org
Date: Mon, 25 Jan 2021 14:01:35 +0000
Message-Id: <20210125140136.10494-40-chris@chris-wilson.co.uk>
In-Reply-To: <20210125140136.10494-1-chris@chris-wilson.co.uk>
References: <20210125140136.10494-1-chris@chris-wilson.co.uk>
MIME-Version: 1.0
Subject: [Intel-gfx] [PATCH 40/41] drm/i915/gt: Enable ring scheduling for
 gen5-7
Precedence: list
Cc: thomas.hellstrom@intel.com, Chris Wilson <chris@chris-wilson.co.uk>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: intel-gfx-bounces@lists.freedesktop.org
Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>

Series

[01/41] drm/i915/selftests: Check for engine-reset errors in the middle of workarounds | expand

Commit Message

Chris Wilson Jan. 25, 2021, 2:01 p.m. UTC

Switch over from FIFO global submission to the priority-sorted
topographical scheduler. At the cost of more busy work on the CPU to
keep the GPU supplied with the next packet of requests, this allows us
to reorder requests around submission stalls and so allow low latency
under load while maintaining fairness between clients.

The downside is that we enable interrupts on all requests (unlike with
execlists where we have an interrupt for context switches). This means
that instead of receiving an interrupt for when we are waitng for
completion, we are processing them all the time, with noticeable
overhead of cpu time absorbed by the interrupt handler. The effect is
most pronounced on CPU-throughput limited renderers like uxa, where
performance can be degraded by 20% in the worst case. Nevertheless, this
is a pathological example of an obsolete userspace driver. (There are
also cases where uxa performs better by 20%, which is an interesting
quirk...) The glxgears-not-a-benchmark (cpu throughtput bound) is one
such example of a performance hit, only affecting uxa.

The expectation is that allowing request reordering will allow much
smoother UX that greatly compensates for reduced throughput under high
submission load (but low GPU load).

This also enables the timer based RPS for better powersaving, with the
exception of Valleyview whose PCU doesn't take kindly to our
interference.

References: 0f46832fab77 ("drm/i915: Mask USER interrupts on gen6 (until required)")
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
---
 drivers/gpu/drm/i915/gem/selftests/i915_gem_context.c | 2 +-
 drivers/gpu/drm/i915/gt/intel_engine_cs.c             | 2 ++
 drivers/gpu/drm/i915/gt/intel_rps.c                   | 6 ++----
 3 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/drivers/gpu/drm/i915/gem/selftests/i915_gem_context.c b/drivers/gpu/drm/i915/gem/selftests/i915_gem_context.c
index d3f87dc4eda3..2246b5c308dc 100644
--- a/drivers/gpu/drm/i915/gem/selftests/i915_gem_context.c
+++ b/drivers/gpu/drm/i915/gem/selftests/i915_gem_context.c
@@ -94,7 +94,7 @@  static int live_nop_switch(void *arg)
 			rq = i915_request_get(this);
 			i915_request_add(this);
 		}
-		if (i915_request_wait(rq, 0, HZ / 5) < 0) {
+		if (i915_request_wait(rq, 0, HZ) < 0) {
 			pr_err("Failed to populated %d contexts\n", nctx);
 			intel_gt_set_wedged(&i915->gt);
 			i915_request_put(rq);
diff --git a/drivers/gpu/drm/i915/gt/intel_engine_cs.c b/drivers/gpu/drm/i915/gt/intel_engine_cs.c
index 936820b240dd..99d910f2c172 100644
--- a/drivers/gpu/drm/i915/gt/intel_engine_cs.c
+++ b/drivers/gpu/drm/i915/gt/intel_engine_cs.c
@@ -868,6 +868,8 @@  int intel_engines_init(struct intel_gt *gt)
 		setup = intel_guc_submission_setup;
 	else if (HAS_EXECLISTS(gt->i915))
 		setup = intel_execlists_submission_setup;
+	else if (INTEL_GEN(gt->i915) >= 5)
+		setup = intel_ring_scheduler_setup;
 	else
 		setup = intel_ring_submission_setup;
 
diff --git a/drivers/gpu/drm/i915/gt/intel_rps.c b/drivers/gpu/drm/i915/gt/intel_rps.c
index 900c20a6d073..2c78d61e7ea9 100644
--- a/drivers/gpu/drm/i915/gt/intel_rps.c
+++ b/drivers/gpu/drm/i915/gt/intel_rps.c
@@ -1081,9 +1081,7 @@  static bool gen6_rps_enable(struct intel_rps *rps)
 	intel_uncore_write_fw(uncore, GEN6_RP_DOWN_TIMEOUT, 50000);
 	intel_uncore_write_fw(uncore, GEN6_RP_IDLE_HYSTERSIS, 10);
 
-	rps->pm_events = (GEN6_PM_RP_UP_THRESHOLD |
-			  GEN6_PM_RP_DOWN_THRESHOLD |
-			  GEN6_PM_RP_DOWN_TIMEOUT);
+	rps->pm_events = GEN6_PM_RP_UP_THRESHOLD | GEN6_PM_RP_DOWN_THRESHOLD;
 
 	return rps_reset(rps);
 }
@@ -1391,7 +1389,7 @@  void intel_rps_enable(struct intel_rps *rps)
 	GEM_BUG_ON(rps->efficient_freq < rps->min_freq);
 	GEM_BUG_ON(rps->efficient_freq > rps->max_freq);
 
-	if (has_busy_stats(rps))
+	if (has_busy_stats(rps) && !IS_VALLEYVIEW(i915))
 		intel_rps_set_timer(rps);
 	else if (INTEL_GEN(i915) >= 6)
 		intel_rps_set_interrupts(rps);

[40/41] drm/i915/gt: Enable ring scheduling for gen5-7

Commit Message

Patch