mbox series

[0/4] Allow error capture without a request / on reset failure

Message ID 20230112025311.2577084-1-John.C.Harrison@Intel.com (mailing list archive)
Headers show
Series Allow error capture without a request / on reset failure | expand

Message

John Harrison Jan. 12, 2023, 2:53 a.m. UTC
From: John Harrison <John.C.Harrison@Intel.com>

It is technically possible to get a hung context without a valid
request. In such a situation, try to provide as much information in
the error capture as possible rather than just aborting and capturing
nothing.

Similarly, in the case of an engine reset failure the GuC is not able
to report the guilty context. So try a manual search instead of
reporting nothing.

Signed-off-by: John Harrison <John.C.Harrison@Intel.com>


John Harrison (4):
  drm/i915: Allow error capture without a request
  drm/i915: Allow error capture of a pending request
  drm/i915/guc: Look for a guilty context when an engine reset fails
  drm/i915/guc: Add a debug print on GuC triggered reset

 .../gpu/drm/i915/gt/uc/intel_guc_submission.c | 21 ++++++-
 drivers/gpu/drm/i915/i915_gpu_error.c         | 59 +++++++++++++------
 2 files changed, 60 insertions(+), 20 deletions(-)