[22/22] semaphore-no-stats

Message ID	20190204132214.9459-23-chris@chris-wilson.co.uk (mailing list archive)
State	New, archived
Headers	show Return-Path: <intel-gfx-bounces@lists.freedesktop.org> From: Chris Wilson <chris@chris-wilson.co.uk> To: intel-gfx@lists.freedesktop.org Date: Mon, 4 Feb 2019 13:22:14 +0000 Message-Id: <20190204132214.9459-23-chris@chris-wilson.co.uk> In-Reply-To: <20190204132214.9459-1-chris@chris-wilson.co.uk> References: <20190204132214.9459-1-chris@chris-wilson.co.uk> MIME-Version: 1.0 Subject: [Intel-gfx] [PATCH 22/22] semaphore-no-stats Precedence: list Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>
Series	[01/22] drm/i915/execlists: Suppress mere WAIT preemption \| expand [01/22] drm/i915/execlists: Suppress mere WAIT preemption [02/22] drm/i915/execlists: Suppress redundant preemption [03/22] drm/i915/selftests: Exercise some AB...BA preemption chains [04/22] drm/i915: Trim NEWCLIENT boosting [05/22] drm/i915: Show support for accurate sw PMU busyness tracking [06/22] drm/i915: Revoke mmaps and prevent access to fence registers across reset [07/22] drm/i915: Force the GPU reset upon wedging [08/22] drm/i915: Uninterruptibly drain the timelines on unwedging [09/22] drm/i915: Wait for old resets before applying debugfs/i915_wedged [10/22] drm/i915: Serialise resets with wedging [11/22] drm/i915: Don't claim an unstarted request was guilty [12/22] drm/i915: Generalise GPU activity tracking [13/22] drm/i915: Release the active tracker tree upon idling [14/22] drm/i915: Allocate active tracking nodes from a slabcache [15/22] drm/i915: Make request allocation caches global [16/22] drm/i915: Add timeline barrier support [17/22] drm/i915: Pull i915_gem_active into the i915_active family [18/22] drm/i915: Keep timeline HWSP allocated until idle across the system [19/22] drm/i915/execlists: Refactor out can_merge_rq() [20/22] drm/i915: Use HW semaphores for inter-engine synchronisation on gen8+ [21/22] drm/i915: Prioritise non-busywait semaphore workloads [22/22] semaphore-no-stats

Message ID

20190204132214.9459-23-chris@chris-wilson.co.uk (mailing list archive)

State

New, archived

Headers

From: Chris Wilson <chris@chris-wilson.co.uk>
To: intel-gfx@lists.freedesktop.org
Date: Mon,  4 Feb 2019 13:22:14 +0000
Message-Id: <20190204132214.9459-23-chris@chris-wilson.co.uk>
In-Reply-To: <20190204132214.9459-1-chris@chris-wilson.co.uk>
References: <20190204132214.9459-1-chris@chris-wilson.co.uk>
MIME-Version: 1.0
Subject: [Intel-gfx] [PATCH 22/22] semaphore-no-stats
Precedence: list
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: intel-gfx-bounces@lists.freedesktop.org
Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>

Series

[01/22] drm/i915/execlists: Suppress mere WAIT preemption | expand

Comments

Tvrtko Ursulin Feb. 5, 2019, 10:03 a.m. UTC | #1

On 04/02/2019 13:22, Chris Wilson wrote:
> SW PMU reports semaphore time as busy, HW PMU reports semaphore time as
> idle. Who is correct?

[It's not really HW PMU, it's a different implementation of the SW PMU. :)]

As an additional data point, HW tracking of accumulated total context 
runtime as stored in the PPHWSP also reports semaphore spin time 
(polling mode) as context running.

So overall from the point of view of busy being opposite of idle, it is 
kind of correct. Regardless of whether engine is doing something useful 
or not. It is unavailable for other contexts due some action of the 
currently executing context.

In this light we could view busy as aggregate of busy and semaphore 
want. (MI_WAIT_EVENT is an open.) But there is indeed an inconsistency 
on platforms which cannot do context tracking.

Therefore solution a) add semaphore wait time to busy when reporting 
busy on those platforms.

Advantage - PMU sampling timer is already running on these platform so 
additional cost is small.

 From the point of view of wanting to make busy mean "useful" work, that 
seems much harder.

Option b) could be subtract semaphore wait time from busy, on the other 
set of platforms.

Disadvantage - this would mean running the PMU sampling timer when it 
today doesn't need to.

So I am leaning towards option a). Engine busy time semantics would 
therefore be defined as engine not being idle = occupied by a context 
doing something.

Regards,

Tvrtko

> ---
>   drivers/gpu/drm/i915/intel_lrc.c | 1 -
>   1 file changed, 1 deletion(-)
> 
> diff --git a/drivers/gpu/drm/i915/intel_lrc.c b/drivers/gpu/drm/i915/intel_lrc.c
> index ae90ce034252..d00b268ed6ee 100644
> --- a/drivers/gpu/drm/i915/intel_lrc.c
> +++ b/drivers/gpu/drm/i915/intel_lrc.c
> @@ -2308,7 +2308,6 @@ void intel_execlists_set_default_submission(struct intel_engine_cs *engine)
>   	engine->unpark = NULL;
>   
>   	engine->flags |= I915_ENGINE_HAS_SEMAPHORES;
> -	engine->flags |= I915_ENGINE_SUPPORTS_STATS;
>   	if (engine->i915->preempt_context)
>   		engine->flags |= I915_ENGINE_HAS_PREEMPTION;
>   }
>

Chris Wilson Feb. 5, 2019, 10:07 a.m. UTC | #2

Quoting Tvrtko Ursulin (2019-02-05 10:03:04)
> 
> On 04/02/2019 13:22, Chris Wilson wrote:
> > SW PMU reports semaphore time as busy, HW PMU reports semaphore time as
> > idle. Who is correct?
> 
> [It's not really HW PMU, it's a different implementation of the SW PMU. :)]
> 
> As an additional data point, HW tracking of accumulated total context 
> runtime as stored in the PPHWSP also reports semaphore spin time 
> (polling mode) as context running.
> 
> So overall from the point of view of busy being opposite of idle, it is 
> kind of correct. Regardless of whether engine is doing something useful 
> or not. It is unavailable for other contexts due some action of the 
> currently executing context.
> 
> In this light we could view busy as aggregate of busy and semaphore 
> want. (MI_WAIT_EVENT is an open.) But there is indeed an inconsistency 
> on platforms which cannot do context tracking.
> 
> Therefore solution a) add semaphore wait time to busy when reporting 
> busy on those platforms.
> 
> Advantage - PMU sampling timer is already running on these platform so 
> additional cost is small.
> 
>  From the point of view of wanting to make busy mean "useful" work, that 
> seems much harder.
> 
> Option b) could be subtract semaphore wait time from busy, on the other 
> set of platforms.
> 
> Disadvantage - this would mean running the PMU sampling timer when it 
> today doesn't need to.
> 
> So I am leaning towards option a). Engine busy time semantics would 
> therefore be defined as engine not being idle = occupied by a context 
> doing something.

(a) is fine by me. The disadvantage is that if clients care about the
spinning they need to account for it themselves... Or they opt out of
semaphores (but they really want a global switch rather than per-context
for accurate system tracking). Is this the compelling reason to have a
context param?
-Chris

diff --git a/drivers/gpu/drm/i915/intel_lrc.c b/drivers/gpu/drm/i915/intel_lrc.c
index ae90ce034252..d00b268ed6ee 100644
--- a/drivers/gpu/drm/i915/intel_lrc.c
+++ b/drivers/gpu/drm/i915/intel_lrc.c
@@ -2308,7 +2308,6 @@  void intel_execlists_set_default_submission(struct intel_engine_cs *engine)
 	engine->unpark = NULL;
 
 	engine->flags |= I915_ENGINE_HAS_SEMAPHORES;
-	engine->flags |= I915_ENGINE_SUPPORTS_STATS;
 	if (engine->i915->preempt_context)
 		engine->flags |= I915_ENGINE_HAS_PREEMPTION;
 }

[22/22] semaphore-no-stats

Commit Message

Comments

Patch