[08/41] drm/i915: Improve DFS for priority inheritance

Message ID	20210125140136.10494-8-chris@chris-wilson.co.uk (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=x+Hj=G4=lists.freedesktop.org=intel-gfx-bounces@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org B248323102 From: Chris Wilson <chris@chris-wilson.co.uk> To: intel-gfx@lists.freedesktop.org Date: Mon, 25 Jan 2021 14:01:03 +0000 Message-Id: <20210125140136.10494-8-chris@chris-wilson.co.uk> In-Reply-To: <20210125140136.10494-1-chris@chris-wilson.co.uk> References: <20210125140136.10494-1-chris@chris-wilson.co.uk> MIME-Version: 1.0 Subject: [Intel-gfx] [PATCH 08/41] drm/i915: Improve DFS for priority inheritance Precedence: list Cc: thomas.hellstrom@intel.com, Chris Wilson <chris@chris-wilson.co.uk> Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>
Series	[01/41] drm/i915/selftests: Check for engine-reset errors in the middle of workarounds \| expand [01/41] drm/i915/selftests: Check for engine-reset errors in the middle of workarounds [02/41] drm/i915/gt: Move the defer_request waiter active assertion [03/41] drm/i915: Replace engine->schedule() with a known request operation [04/41] drm/i915: Teach the i915_dependency to use a double-lock [05/41] drm/i915: Restructure priority inheritance [06/41] drm/i915/selftests: Measure set-priority duration [07/41] drm/i915/selftests: Exercise priority inheritance around an engine loop [08/41] drm/i915: Improve DFS for priority inheritance [09/41] drm/i915/selftests: Exercise relative mmio paths to non-privileged registers [10/41] drm/i915/selftests: Exercise cross-process context isolation [11/41] drm/i915: Extract request submission from execlists [12/41] drm/i915: Extract request rewinding from execlists [13/41] drm/i915: Extract request suspension from the execlists [14/41] drm/i915: Extract the ability to defer and rerun a request later [15/41] drm/i915: Fix the iterative dfs for defering requests [16/41] drm/i915: Move common active lists from engine to i915_scheduler [17/41] drm/i915: Move scheduler queue [18/41] drm/i915: Move tasklet from execlists to sched [19/41] drm/i915/gt: Show scheduler queues when dumping state [20/41] drm/i915: Replace priolist rbtree with a skiplist [21/41] drm/i915: Wrap cmpxchg64 with try_cmpxchg64() helper [22/41] drm/i915: Fair low-latency scheduling [23/41] drm/i915/gt: Specify a deadline for the heartbeat [24/41] drm/i915: Extend the priority boosting for the display with a deadline [25/41] drm/i915/gt: Support virtual engine queues [26/41] drm/i915: Move saturated workload detection back to the context [27/41] drm/i915: Bump default timeslicing quantum to 5ms [28/41] drm/i915/gt: Wrap intel_timeline.has_initial_breadcrumb [29/41] drm/i915/gt: Track timeline GGTT offset separately from subpage offset [30/41] drm/i915/gt: Add timeline "mode" [31/41] drm/i915/gt: Use indices for writing into relative timelines [32/41] drm/i915/selftests: Exercise relative timeline modes [33/41] drm/i915/gt: Use ppHWSP for unshared non-semaphore related timelines [34/41] Restore "drm/i915: drop engine_pin/unpin_breadcrumbs_irq" [35/41] drm/i915/gt: Couple tasklet scheduling for all CS interrupts [36/41] drm/i915/gt: Support creation of 'internal' rings [37/41] drm/i915/gt: Use client timeline address for seqno writes [38/41] drm/i915/gt: Infrastructure for ring scheduling [39/41] drm/i915/gt: Implement ring scheduler for gen4-7 [40/41] drm/i915/gt: Enable ring scheduling for gen5-7 [41/41] drm/i915: Support secure dispatch on gen6/gen7

Chris Wilson Jan. 25, 2021, 2:01 p.m. UTC

The core of the scheduling algorithm is that we compute the topological
order of the fence DAG. Knowing that we have a DAG, we should be able to
use a DFS to compute the topological sort in linear time. However,
during the conversion of the recursive algorithm into an iterative one,
the memoization of how far we had progressed down a branch was
forgotten. The result was that instead of running in linear time, it was
running in geometric time and could easily run for a few hundred
milliseconds given a wide enough graph, not the microseconds as required.

Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
---
 drivers/gpu/drm/i915/i915_scheduler.c | 58 ++++++++++++++++-----------
 1 file changed, 34 insertions(+), 24 deletions(-)

Tvrtko Ursulin Jan. 26, 2021, 4:22 p.m. UTC | #1

On 25/01/2021 14:01, Chris Wilson wrote:
> The core of the scheduling algorithm is that we compute the topological
> order of the fence DAG. Knowing that we have a DAG, we should be able to
> use a DFS to compute the topological sort in linear time. However,
> during the conversion of the recursive algorithm into an iterative one,
> the memoization of how far we had progressed down a branch was
> forgotten. The result was that instead of running in linear time, it was
> running in geometric time and could easily run for a few hundred
> milliseconds given a wide enough graph, not the microseconds as required.
> 
> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
> ---
>   drivers/gpu/drm/i915/i915_scheduler.c | 58 ++++++++++++++++-----------
>   1 file changed, 34 insertions(+), 24 deletions(-)
> 
> diff --git a/drivers/gpu/drm/i915/i915_scheduler.c b/drivers/gpu/drm/i915/i915_scheduler.c
> index 4802c9b1081d..9139a91f0aa3 100644
> --- a/drivers/gpu/drm/i915/i915_scheduler.c
> +++ b/drivers/gpu/drm/i915/i915_scheduler.c
> @@ -234,6 +234,26 @@ void __i915_priolist_free(struct i915_priolist *p)
>   	kmem_cache_free(global.slab_priorities, p);
>   }
>   
> +static struct i915_request *
> +stack_push(struct i915_request *rq,
> +	   struct i915_request *stack,
> +	   struct list_head *pos)
> +{
> +	stack->sched.dfs.prev = pos;
> +	rq->sched.dfs.next = (struct list_head *)stack;
> +	return rq;
> +}
> +
> +static struct i915_request *
> +stack_pop(struct i915_request *rq,
> +	  struct list_head **pos)
> +{
> +	rq = (struct i915_request *)rq->sched.dfs.next;
> +	if (rq)
> +		*pos = rq->sched.dfs.prev;
> +	return rq;
> +}
> +
>   static inline bool need_preempt(int prio, int active)
>   {
>   	/*
> @@ -298,11 +318,10 @@ static void ipi_priority(struct i915_request *rq, int prio)
>   static void __i915_request_set_priority(struct i915_request *rq, int prio)
>   {
>   	struct intel_engine_cs *engine = rq->engine;
> -	struct i915_request *rn;
> +	struct list_head *pos = &rq->sched.signalers_list;
>   	struct list_head *plist;
> -	LIST_HEAD(dfs);
>   
> -	list_add(&rq->sched.dfs, &dfs);
> +	plist = i915_sched_lookup_priolist(engine, prio);
>   
>   	/*
>   	 * Recursively bump all dependent priorities to match the new request.
> @@ -322,40 +341,31 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
>   	 * end result is a topological list of requests in reverse order, the
>   	 * last element in the list is the request we must execute first.
>   	 */
> -	list_for_each_entry(rq, &dfs, sched.dfs) {
> -		struct i915_dependency *p;
> -
> -		/* Also release any children on this engine that are ready */
> -		GEM_BUG_ON(rq->engine != engine);
> -
> -		for_each_signaler(p, rq) {
> +	rq->sched.dfs.next = NULL;
> +	do {
> +		list_for_each_continue(pos, &rq->sched.signalers_list) {
> +			struct i915_dependency *p =
> +				list_entry(pos, typeof(*p), signal_link);
>   			struct i915_request *s =
>   				container_of(p->signaler, typeof(*s), sched);
>   
> -			GEM_BUG_ON(s == rq);
> -
>   			if (rq_prio(s) >= prio)
>   				continue;
>   
>   			if (__i915_request_is_complete(s))
>   				continue;
>   
> -			if (s->engine != rq->engine) {
> +			if (s->engine != engine) {
>   				ipi_priority(s, prio);
>   				continue;
>   			}
>   
> -			list_move_tail(&s->sched.dfs, &dfs);
> +			/* Remember our position along this branch */
> +			rq = stack_push(s, rq, pos);
> +			pos = &rq->sched.signalers_list;
>   		}
> -	}
>   
> -	plist = i915_sched_lookup_priolist(engine, prio);
> -
> -	/* Fifo and depth-first replacement ensure our deps execute first */
> -	list_for_each_entry_safe_reverse(rq, rn, &dfs, sched.dfs) {
> -		GEM_BUG_ON(rq->engine != engine);
> -
> -		INIT_LIST_HEAD(&rq->sched.dfs);
> +		RQ_TRACE(rq, "set-priority:%d\n", prio);
>   		WRITE_ONCE(rq->sched.attr.priority, prio);
>   
>   		/*
> @@ -369,12 +379,13 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
>   		if (!i915_request_is_ready(rq))
>   			continue;
>   
> +		GEM_BUG_ON(rq->engine != engine);
>   		if (i915_request_in_priority_queue(rq))
>   			list_move_tail(&rq->sched.link, plist);
>   
>   		/* Defer (tasklet) submission until after all updates. */
>   		kick_submission(engine, rq, prio);
> -	}
> +	} while ((rq = stack_pop(rq, &pos)));
>   }
>   
>   void i915_request_set_priority(struct i915_request *rq, int prio)
> @@ -444,7 +455,6 @@ void i915_sched_node_init(struct i915_sched_node *node)
>   	INIT_LIST_HEAD(&node->signalers_list);
>   	INIT_LIST_HEAD(&node->waiters_list);
>   	INIT_LIST_HEAD(&node->link);
> -	INIT_LIST_HEAD(&node->dfs);
>   
>   	node->ipi_link = NULL;
>   
> 

Pen and paper was needed here but it looks good.

Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>

Regards,

Tvrtko

Chris Wilson Jan. 26, 2021, 4:26 p.m. UTC | #2

Quoting Tvrtko Ursulin (2021-01-26 16:22:58)
> 
> 
> On 25/01/2021 14:01, Chris Wilson wrote:
> > The core of the scheduling algorithm is that we compute the topological
> > order of the fence DAG. Knowing that we have a DAG, we should be able to
> > use a DFS to compute the topological sort in linear time. However,
> > during the conversion of the recursive algorithm into an iterative one,
> > the memoization of how far we had progressed down a branch was
> > forgotten. The result was that instead of running in linear time, it was
> > running in geometric time and could easily run for a few hundred
> > milliseconds given a wide enough graph, not the microseconds as required.
> > 
> > Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
> > ---
> >   drivers/gpu/drm/i915/i915_scheduler.c | 58 ++++++++++++++++-----------
> >   1 file changed, 34 insertions(+), 24 deletions(-)
> > 
> > diff --git a/drivers/gpu/drm/i915/i915_scheduler.c b/drivers/gpu/drm/i915/i915_scheduler.c
> > index 4802c9b1081d..9139a91f0aa3 100644
> > --- a/drivers/gpu/drm/i915/i915_scheduler.c
> > +++ b/drivers/gpu/drm/i915/i915_scheduler.c
> > @@ -234,6 +234,26 @@ void __i915_priolist_free(struct i915_priolist *p)
> >       kmem_cache_free(global.slab_priorities, p);
> >   }
> >   
> > +static struct i915_request *
> > +stack_push(struct i915_request *rq,
> > +        struct i915_request *stack,
> > +        struct list_head *pos)
> > +{
> > +     stack->sched.dfs.prev = pos;
> > +     rq->sched.dfs.next = (struct list_head *)stack;
> > +     return rq;
> > +}
> > +
> > +static struct i915_request *
> > +stack_pop(struct i915_request *rq,
> > +       struct list_head **pos)
> > +{
> > +     rq = (struct i915_request *)rq->sched.dfs.next;
> > +     if (rq)
> > +             *pos = rq->sched.dfs.prev;
> > +     return rq;
> > +}
> > +
> >   static inline bool need_preempt(int prio, int active)
> >   {
> >       /*
> > @@ -298,11 +318,10 @@ static void ipi_priority(struct i915_request *rq, int prio)
> >   static void __i915_request_set_priority(struct i915_request *rq, int prio)
> >   {
> >       struct intel_engine_cs *engine = rq->engine;
> > -     struct i915_request *rn;
> > +     struct list_head *pos = &rq->sched.signalers_list;
> >       struct list_head *plist;
> > -     LIST_HEAD(dfs);
> >   
> > -     list_add(&rq->sched.dfs, &dfs);
> > +     plist = i915_sched_lookup_priolist(engine, prio);
> >   
> >       /*
> >        * Recursively bump all dependent priorities to match the new request.
> > @@ -322,40 +341,31 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
> >        * end result is a topological list of requests in reverse order, the
> >        * last element in the list is the request we must execute first.
> >        */
> > -     list_for_each_entry(rq, &dfs, sched.dfs) {
> > -             struct i915_dependency *p;
> > -
> > -             /* Also release any children on this engine that are ready */
> > -             GEM_BUG_ON(rq->engine != engine);
> > -
> > -             for_each_signaler(p, rq) {
> > +     rq->sched.dfs.next = NULL;
> > +     do {
> > +             list_for_each_continue(pos, &rq->sched.signalers_list) {
> > +                     struct i915_dependency *p =
> > +                             list_entry(pos, typeof(*p), signal_link);
> >                       struct i915_request *s =
> >                               container_of(p->signaler, typeof(*s), sched);
> >   
> > -                     GEM_BUG_ON(s == rq);
> > -
> >                       if (rq_prio(s) >= prio)
> >                               continue;
> >   
> >                       if (__i915_request_is_complete(s))
> >                               continue;
> >   
> > -                     if (s->engine != rq->engine) {
> > +                     if (s->engine != engine) {
> >                               ipi_priority(s, prio);
> >                               continue;
> >                       }
> >   
> > -                     list_move_tail(&s->sched.dfs, &dfs);
> > +                     /* Remember our position along this branch */
> > +                     rq = stack_push(s, rq, pos);
> > +                     pos = &rq->sched.signalers_list;
> >               }
> > -     }
> >   
> > -     plist = i915_sched_lookup_priolist(engine, prio);
> > -
> > -     /* Fifo and depth-first replacement ensure our deps execute first */
> > -     list_for_each_entry_safe_reverse(rq, rn, &dfs, sched.dfs) {
> > -             GEM_BUG_ON(rq->engine != engine);
> > -
> > -             INIT_LIST_HEAD(&rq->sched.dfs);
> > +             RQ_TRACE(rq, "set-priority:%d\n", prio);
> >               WRITE_ONCE(rq->sched.attr.priority, prio);
> >   
> >               /*
> > @@ -369,12 +379,13 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
> >               if (!i915_request_is_ready(rq))
> >                       continue;
> >   
> > +             GEM_BUG_ON(rq->engine != engine);
> >               if (i915_request_in_priority_queue(rq))
> >                       list_move_tail(&rq->sched.link, plist);
> >   
> >               /* Defer (tasklet) submission until after all updates. */
> >               kick_submission(engine, rq, prio);
> > -     }
> > +     } while ((rq = stack_pop(rq, &pos)));
> >   }
> >   
> >   void i915_request_set_priority(struct i915_request *rq, int prio)
> > @@ -444,7 +455,6 @@ void i915_sched_node_init(struct i915_sched_node *node)
> >       INIT_LIST_HEAD(&node->signalers_list);
> >       INIT_LIST_HEAD(&node->waiters_list);
> >       INIT_LIST_HEAD(&node->link);
> > -     INIT_LIST_HEAD(&node->dfs);
> >   
> >       node->ipi_link = NULL;
> >   
> > 
> 
> Pen and paper was needed here but it looks good.

If you highlight the areas that need more commentary, I guess
a theory-of-operation for stack_push/stack_pop?
-Chris

Tvrtko Ursulin Jan. 26, 2021, 4:42 p.m. UTC | #3

On 26/01/2021 16:26, Chris Wilson wrote:
> Quoting Tvrtko Ursulin (2021-01-26 16:22:58)
>>
>>
>> On 25/01/2021 14:01, Chris Wilson wrote:
>>> The core of the scheduling algorithm is that we compute the topological
>>> order of the fence DAG. Knowing that we have a DAG, we should be able to
>>> use a DFS to compute the topological sort in linear time. However,
>>> during the conversion of the recursive algorithm into an iterative one,
>>> the memoization of how far we had progressed down a branch was
>>> forgotten. The result was that instead of running in linear time, it was
>>> running in geometric time and could easily run for a few hundred
>>> milliseconds given a wide enough graph, not the microseconds as required.
>>>
>>> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
>>> ---
>>>    drivers/gpu/drm/i915/i915_scheduler.c | 58 ++++++++++++++++-----------
>>>    1 file changed, 34 insertions(+), 24 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/i915_scheduler.c b/drivers/gpu/drm/i915/i915_scheduler.c
>>> index 4802c9b1081d..9139a91f0aa3 100644
>>> --- a/drivers/gpu/drm/i915/i915_scheduler.c
>>> +++ b/drivers/gpu/drm/i915/i915_scheduler.c
>>> @@ -234,6 +234,26 @@ void __i915_priolist_free(struct i915_priolist *p)
>>>        kmem_cache_free(global.slab_priorities, p);
>>>    }
>>>    
>>> +static struct i915_request *
>>> +stack_push(struct i915_request *rq,
>>> +        struct i915_request *stack,
>>> +        struct list_head *pos)
>>> +{
>>> +     stack->sched.dfs.prev = pos;
>>> +     rq->sched.dfs.next = (struct list_head *)stack;
>>> +     return rq;
>>> +}
>>> +
>>> +static struct i915_request *
>>> +stack_pop(struct i915_request *rq,
>>> +       struct list_head **pos)
>>> +{
>>> +     rq = (struct i915_request *)rq->sched.dfs.next;
>>> +     if (rq)
>>> +             *pos = rq->sched.dfs.prev;
>>> +     return rq;
>>> +}
>>> +
>>>    static inline bool need_preempt(int prio, int active)
>>>    {
>>>        /*
>>> @@ -298,11 +318,10 @@ static void ipi_priority(struct i915_request *rq, int prio)
>>>    static void __i915_request_set_priority(struct i915_request *rq, int prio)
>>>    {
>>>        struct intel_engine_cs *engine = rq->engine;
>>> -     struct i915_request *rn;
>>> +     struct list_head *pos = &rq->sched.signalers_list;
>>>        struct list_head *plist;
>>> -     LIST_HEAD(dfs);
>>>    
>>> -     list_add(&rq->sched.dfs, &dfs);
>>> +     plist = i915_sched_lookup_priolist(engine, prio);
>>>    
>>>        /*
>>>         * Recursively bump all dependent priorities to match the new request.
>>> @@ -322,40 +341,31 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
>>>         * end result is a topological list of requests in reverse order, the
>>>         * last element in the list is the request we must execute first.
>>>         */
>>> -     list_for_each_entry(rq, &dfs, sched.dfs) {
>>> -             struct i915_dependency *p;
>>> -
>>> -             /* Also release any children on this engine that are ready */
>>> -             GEM_BUG_ON(rq->engine != engine);
>>> -
>>> -             for_each_signaler(p, rq) {
>>> +     rq->sched.dfs.next = NULL;
>>> +     do {
>>> +             list_for_each_continue(pos, &rq->sched.signalers_list) {
>>> +                     struct i915_dependency *p =
>>> +                             list_entry(pos, typeof(*p), signal_link);
>>>                        struct i915_request *s =
>>>                                container_of(p->signaler, typeof(*s), sched);
>>>    
>>> -                     GEM_BUG_ON(s == rq);
>>> -
>>>                        if (rq_prio(s) >= prio)
>>>                                continue;
>>>    
>>>                        if (__i915_request_is_complete(s))
>>>                                continue;
>>>    
>>> -                     if (s->engine != rq->engine) {
>>> +                     if (s->engine != engine) {
>>>                                ipi_priority(s, prio);
>>>                                continue;
>>>                        }
>>>    
>>> -                     list_move_tail(&s->sched.dfs, &dfs);
>>> +                     /* Remember our position along this branch */
>>> +                     rq = stack_push(s, rq, pos);
>>> +                     pos = &rq->sched.signalers_list;
>>>                }
>>> -     }
>>>    
>>> -     plist = i915_sched_lookup_priolist(engine, prio);
>>> -
>>> -     /* Fifo and depth-first replacement ensure our deps execute first */
>>> -     list_for_each_entry_safe_reverse(rq, rn, &dfs, sched.dfs) {
>>> -             GEM_BUG_ON(rq->engine != engine);
>>> -
>>> -             INIT_LIST_HEAD(&rq->sched.dfs);
>>> +             RQ_TRACE(rq, "set-priority:%d\n", prio);
>>>                WRITE_ONCE(rq->sched.attr.priority, prio);
>>>    
>>>                /*
>>> @@ -369,12 +379,13 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
>>>                if (!i915_request_is_ready(rq))
>>>                        continue;
>>>    
>>> +             GEM_BUG_ON(rq->engine != engine);
>>>                if (i915_request_in_priority_queue(rq))
>>>                        list_move_tail(&rq->sched.link, plist);
>>>    
>>>                /* Defer (tasklet) submission until after all updates. */
>>>                kick_submission(engine, rq, prio);
>>> -     }
>>> +     } while ((rq = stack_pop(rq, &pos)));
>>>    }
>>>    
>>>    void i915_request_set_priority(struct i915_request *rq, int prio)
>>> @@ -444,7 +455,6 @@ void i915_sched_node_init(struct i915_sched_node *node)
>>>        INIT_LIST_HEAD(&node->signalers_list);
>>>        INIT_LIST_HEAD(&node->waiters_list);
>>>        INIT_LIST_HEAD(&node->link);
>>> -     INIT_LIST_HEAD(&node->dfs);
>>>    
>>>        node->ipi_link = NULL;
>>>    
>>>
>>
>> Pen and paper was needed here but it looks good.
> 
> If you highlight the areas that need more commentary, I guess
> a theory-of-operation for stack_push/stack_pop?

At some point I wanted to suggest you change dfs.list_head abuse to 
explicit rq and list head pointer to better represent how there are two 
pieces of information tracked in there.

In terms of commentary don't know really. Perhaps it could be made 
clearer just with some code re-structure, for instance maybe a new data 
structure like i915_request_stack would work like:

struct i915_request_stack {
	struct i915_request *prev;
	struct list_head *pos;
};

And then push and pop operate on three distinct data types for clarity, 
request stack being embedded in request. I haven't really thought it 
through to be sure it works so just maybe.

Regards,

Tvrtko

Tvrtko Ursulin Jan. 26, 2021, 4:51 p.m. UTC | #4

On 26/01/2021 16:42, Tvrtko Ursulin wrote:
> 
> On 26/01/2021 16:26, Chris Wilson wrote:
>> Quoting Tvrtko Ursulin (2021-01-26 16:22:58)
>>>
>>>
>>> On 25/01/2021 14:01, Chris Wilson wrote:
>>>> The core of the scheduling algorithm is that we compute the topological
>>>> order of the fence DAG. Knowing that we have a DAG, we should be 
>>>> able to
>>>> use a DFS to compute the topological sort in linear time. However,
>>>> during the conversion of the recursive algorithm into an iterative one,
>>>> the memoization of how far we had progressed down a branch was
>>>> forgotten. The result was that instead of running in linear time, it 
>>>> was
>>>> running in geometric time and could easily run for a few hundred
>>>> milliseconds given a wide enough graph, not the microseconds as 
>>>> required.
>>>>
>>>> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
>>>> ---
>>>>    drivers/gpu/drm/i915/i915_scheduler.c | 58 
>>>> ++++++++++++++++-----------
>>>>    1 file changed, 34 insertions(+), 24 deletions(-)
>>>>
>>>> diff --git a/drivers/gpu/drm/i915/i915_scheduler.c 
>>>> b/drivers/gpu/drm/i915/i915_scheduler.c
>>>> index 4802c9b1081d..9139a91f0aa3 100644
>>>> --- a/drivers/gpu/drm/i915/i915_scheduler.c
>>>> +++ b/drivers/gpu/drm/i915/i915_scheduler.c
>>>> @@ -234,6 +234,26 @@ void __i915_priolist_free(struct i915_priolist *p)
>>>>        kmem_cache_free(global.slab_priorities, p);
>>>>    }
>>>> +static struct i915_request *
>>>> +stack_push(struct i915_request *rq,
>>>> +        struct i915_request *stack,
>>>> +        struct list_head *pos)
>>>> +{
>>>> +     stack->sched.dfs.prev = pos;
>>>> +     rq->sched.dfs.next = (struct list_head *)stack;
>>>> +     return rq;
>>>> +}
>>>> +
>>>> +static struct i915_request *
>>>> +stack_pop(struct i915_request *rq,
>>>> +       struct list_head **pos)
>>>> +{
>>>> +     rq = (struct i915_request *)rq->sched.dfs.next;
>>>> +     if (rq)
>>>> +             *pos = rq->sched.dfs.prev;
>>>> +     return rq;
>>>> +}
>>>> +
>>>>    static inline bool need_preempt(int prio, int active)
>>>>    {
>>>>        /*
>>>> @@ -298,11 +318,10 @@ static void ipi_priority(struct i915_request 
>>>> *rq, int prio)
>>>>    static void __i915_request_set_priority(struct i915_request *rq, 
>>>> int prio)
>>>>    {
>>>>        struct intel_engine_cs *engine = rq->engine;
>>>> -     struct i915_request *rn;
>>>> +     struct list_head *pos = &rq->sched.signalers_list;
>>>>        struct list_head *plist;
>>>> -     LIST_HEAD(dfs);
>>>> -     list_add(&rq->sched.dfs, &dfs);
>>>> +     plist = i915_sched_lookup_priolist(engine, prio);
>>>>        /*
>>>>         * Recursively bump all dependent priorities to match the new 
>>>> request.
>>>> @@ -322,40 +341,31 @@ static void __i915_request_set_priority(struct 
>>>> i915_request *rq, int prio)
>>>>         * end result is a topological list of requests in reverse 
>>>> order, the
>>>>         * last element in the list is the request we must execute 
>>>> first.
>>>>         */
>>>> -     list_for_each_entry(rq, &dfs, sched.dfs) {
>>>> -             struct i915_dependency *p;
>>>> -
>>>> -             /* Also release any children on this engine that are 
>>>> ready */
>>>> -             GEM_BUG_ON(rq->engine != engine);
>>>> -
>>>> -             for_each_signaler(p, rq) {
>>>> +     rq->sched.dfs.next = NULL;
>>>> +     do {
>>>> +             list_for_each_continue(pos, &rq->sched.signalers_list) {
>>>> +                     struct i915_dependency *p =
>>>> +                             list_entry(pos, typeof(*p), signal_link);
>>>>                        struct i915_request *s =
>>>>                                container_of(p->signaler, typeof(*s), 
>>>> sched);
>>>> -                     GEM_BUG_ON(s == rq);
>>>> -
>>>>                        if (rq_prio(s) >= prio)
>>>>                                continue;
>>>>                        if (__i915_request_is_complete(s))
>>>>                                continue;
>>>> -                     if (s->engine != rq->engine) {
>>>> +                     if (s->engine != engine) {
>>>>                                ipi_priority(s, prio);
>>>>                                continue;
>>>>                        }
>>>> -                     list_move_tail(&s->sched.dfs, &dfs);
>>>> +                     /* Remember our position along this branch */
>>>> +                     rq = stack_push(s, rq, pos);
>>>> +                     pos = &rq->sched.signalers_list;
>>>>                }
>>>> -     }
>>>> -     plist = i915_sched_lookup_priolist(engine, prio);
>>>> -
>>>> -     /* Fifo and depth-first replacement ensure our deps execute 
>>>> first */
>>>> -     list_for_each_entry_safe_reverse(rq, rn, &dfs, sched.dfs) {
>>>> -             GEM_BUG_ON(rq->engine != engine);
>>>> -
>>>> -             INIT_LIST_HEAD(&rq->sched.dfs);
>>>> +             RQ_TRACE(rq, "set-priority:%d\n", prio);
>>>>                WRITE_ONCE(rq->sched.attr.priority, prio);
>>>>                /*
>>>> @@ -369,12 +379,13 @@ static void __i915_request_set_priority(struct 
>>>> i915_request *rq, int prio)
>>>>                if (!i915_request_is_ready(rq))
>>>>                        continue;
>>>> +             GEM_BUG_ON(rq->engine != engine);
>>>>                if (i915_request_in_priority_queue(rq))
>>>>                        list_move_tail(&rq->sched.link, plist);
>>>>                /* Defer (tasklet) submission until after all 
>>>> updates. */
>>>>                kick_submission(engine, rq, prio);
>>>> -     }
>>>> +     } while ((rq = stack_pop(rq, &pos)));
>>>>    }
>>>>    void i915_request_set_priority(struct i915_request *rq, int prio)
>>>> @@ -444,7 +455,6 @@ void i915_sched_node_init(struct i915_sched_node 
>>>> *node)
>>>>        INIT_LIST_HEAD(&node->signalers_list);
>>>>        INIT_LIST_HEAD(&node->waiters_list);
>>>>        INIT_LIST_HEAD(&node->link);
>>>> -     INIT_LIST_HEAD(&node->dfs);
>>>>        node->ipi_link = NULL;
>>>>
>>>
>>> Pen and paper was needed here but it looks good.
>>
>> If you highlight the areas that need more commentary, I guess
>> a theory-of-operation for stack_push/stack_pop?
> 
> At some point I wanted to suggest you change dfs.list_head abuse to 
> explicit rq and list head pointer to better represent how there are two 
> pieces of information tracked in there.
> 
> In terms of commentary don't know really. Perhaps it could be made 
> clearer just with some code re-structure, for instance maybe a new data 
> structure like i915_request_stack would work like:
> 
> struct i915_request_stack {
>      struct i915_request *prev;
>      struct list_head *pos;
> };
> 
> And then push and pop operate on three distinct data types for clarity, 
> request stack being embedded in request. I haven't really thought it 
> through to be sure it works so just maybe.

Ah I remember why I did not suggest this, to avoid wasting one pointer 
because of:

struct list_head {
         struct list_head *next, *prev;
};

There isn't anything for just one.

Regards,

Tvrtko

Chris Wilson Jan. 26, 2021, 4:51 p.m. UTC | #5

Quoting Tvrtko Ursulin (2021-01-26 16:42:37)
> 
> On 26/01/2021 16:26, Chris Wilson wrote:
> > Quoting Tvrtko Ursulin (2021-01-26 16:22:58)
> >>
> >>
> >> On 25/01/2021 14:01, Chris Wilson wrote:
> >>> The core of the scheduling algorithm is that we compute the topological
> >>> order of the fence DAG. Knowing that we have a DAG, we should be able to
> >>> use a DFS to compute the topological sort in linear time. However,
> >>> during the conversion of the recursive algorithm into an iterative one,
> >>> the memoization of how far we had progressed down a branch was
> >>> forgotten. The result was that instead of running in linear time, it was
> >>> running in geometric time and could easily run for a few hundred
> >>> milliseconds given a wide enough graph, not the microseconds as required.
> >>>
> >>> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
> >>> ---
> >>>    drivers/gpu/drm/i915/i915_scheduler.c | 58 ++++++++++++++++-----------
> >>>    1 file changed, 34 insertions(+), 24 deletions(-)
> >>>
> >>> diff --git a/drivers/gpu/drm/i915/i915_scheduler.c b/drivers/gpu/drm/i915/i915_scheduler.c
> >>> index 4802c9b1081d..9139a91f0aa3 100644
> >>> --- a/drivers/gpu/drm/i915/i915_scheduler.c
> >>> +++ b/drivers/gpu/drm/i915/i915_scheduler.c
> >>> @@ -234,6 +234,26 @@ void __i915_priolist_free(struct i915_priolist *p)
> >>>        kmem_cache_free(global.slab_priorities, p);
> >>>    }
> >>>    
> >>> +static struct i915_request *
> >>> +stack_push(struct i915_request *rq,
> >>> +        struct i915_request *stack,
> >>> +        struct list_head *pos)
> >>> +{
> >>> +     stack->sched.dfs.prev = pos;
> >>> +     rq->sched.dfs.next = (struct list_head *)stack;
> >>> +     return rq;
> >>> +}
> >>> +
> >>> +static struct i915_request *
> >>> +stack_pop(struct i915_request *rq,
> >>> +       struct list_head **pos)
> >>> +{
> >>> +     rq = (struct i915_request *)rq->sched.dfs.next;
> >>> +     if (rq)
> >>> +             *pos = rq->sched.dfs.prev;
> >>> +     return rq;
> >>> +}
> >>> +
> >>>    static inline bool need_preempt(int prio, int active)
> >>>    {
> >>>        /*
> >>> @@ -298,11 +318,10 @@ static void ipi_priority(struct i915_request *rq, int prio)
> >>>    static void __i915_request_set_priority(struct i915_request *rq, int prio)
> >>>    {
> >>>        struct intel_engine_cs *engine = rq->engine;
> >>> -     struct i915_request *rn;
> >>> +     struct list_head *pos = &rq->sched.signalers_list;
> >>>        struct list_head *plist;
> >>> -     LIST_HEAD(dfs);
> >>>    
> >>> -     list_add(&rq->sched.dfs, &dfs);
> >>> +     plist = i915_sched_lookup_priolist(engine, prio);
> >>>    
> >>>        /*
> >>>         * Recursively bump all dependent priorities to match the new request.
> >>> @@ -322,40 +341,31 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
> >>>         * end result is a topological list of requests in reverse order, the
> >>>         * last element in the list is the request we must execute first.
> >>>         */
> >>> -     list_for_each_entry(rq, &dfs, sched.dfs) {
> >>> -             struct i915_dependency *p;
> >>> -
> >>> -             /* Also release any children on this engine that are ready */
> >>> -             GEM_BUG_ON(rq->engine != engine);
> >>> -
> >>> -             for_each_signaler(p, rq) {
> >>> +     rq->sched.dfs.next = NULL;
> >>> +     do {
> >>> +             list_for_each_continue(pos, &rq->sched.signalers_list) {
> >>> +                     struct i915_dependency *p =
> >>> +                             list_entry(pos, typeof(*p), signal_link);
> >>>                        struct i915_request *s =
> >>>                                container_of(p->signaler, typeof(*s), sched);
> >>>    
> >>> -                     GEM_BUG_ON(s == rq);
> >>> -
> >>>                        if (rq_prio(s) >= prio)
> >>>                                continue;
> >>>    
> >>>                        if (__i915_request_is_complete(s))
> >>>                                continue;
> >>>    
> >>> -                     if (s->engine != rq->engine) {
> >>> +                     if (s->engine != engine) {
> >>>                                ipi_priority(s, prio);
> >>>                                continue;
> >>>                        }
> >>>    
> >>> -                     list_move_tail(&s->sched.dfs, &dfs);
> >>> +                     /* Remember our position along this branch */
> >>> +                     rq = stack_push(s, rq, pos);
> >>> +                     pos = &rq->sched.signalers_list;
> >>>                }
> >>> -     }
> >>>    
> >>> -     plist = i915_sched_lookup_priolist(engine, prio);
> >>> -
> >>> -     /* Fifo and depth-first replacement ensure our deps execute first */
> >>> -     list_for_each_entry_safe_reverse(rq, rn, &dfs, sched.dfs) {
> >>> -             GEM_BUG_ON(rq->engine != engine);
> >>> -
> >>> -             INIT_LIST_HEAD(&rq->sched.dfs);
> >>> +             RQ_TRACE(rq, "set-priority:%d\n", prio);
> >>>                WRITE_ONCE(rq->sched.attr.priority, prio);
> >>>    
> >>>                /*
> >>> @@ -369,12 +379,13 @@ static void __i915_request_set_priority(struct i915_request *rq, int prio)
> >>>                if (!i915_request_is_ready(rq))
> >>>                        continue;
> >>>    
> >>> +             GEM_BUG_ON(rq->engine != engine);
> >>>                if (i915_request_in_priority_queue(rq))
> >>>                        list_move_tail(&rq->sched.link, plist);
> >>>    
> >>>                /* Defer (tasklet) submission until after all updates. */
> >>>                kick_submission(engine, rq, prio);
> >>> -     }
> >>> +     } while ((rq = stack_pop(rq, &pos)));
> >>>    }
> >>>    
> >>>    void i915_request_set_priority(struct i915_request *rq, int prio)
> >>> @@ -444,7 +455,6 @@ void i915_sched_node_init(struct i915_sched_node *node)
> >>>        INIT_LIST_HEAD(&node->signalers_list);
> >>>        INIT_LIST_HEAD(&node->waiters_list);
> >>>        INIT_LIST_HEAD(&node->link);
> >>> -     INIT_LIST_HEAD(&node->dfs);
> >>>    
> >>>        node->ipi_link = NULL;
> >>>    
> >>>
> >>
> >> Pen and paper was needed here but it looks good.
> > 
> > If you highlight the areas that need more commentary, I guess
> > a theory-of-operation for stack_push/stack_pop?
> 
> At some point I wanted to suggest you change dfs.list_head abuse to 
> explicit rq and list head pointer to better represent how there are two 
> pieces of information tracked in there.

Ok. While writing it I thought some places continued to use it as a
struct list_head, but it appears that this is the only user.
I'll give it a whirl.
-Chris

[08/41] drm/i915: Improve DFS for priority inheritance

Commit Message

Comments

Patch