mbox series

[CI,0/5] Refactor to expand subslice mask

Message ID 20190524154022.13575-1-stuart.summers@intel.com (mailing list archive)
Headers show
Series Refactor to expand subslice mask | expand

Message

Summers, Stuart May 24, 2019, 3:40 p.m. UTC
This patch series contains a few code clean-up patches, followed
by a patch which changes the storage of the subslice mask to better
match the userspace access through the I915_QUERY_TOPOLOGY_INFO
ioctl. The index into the subslice_mask array is then calculated:
  slice * subslice stride + subslice index / 8

v2: fix i915_pm_sseu test failure
v3: no changes to patches in the series, just resending to pick up
    in CI correctly
v4: rebase
v5: fix header test
v6: address review comments from Jari
    address minor checkpatch warning in existing code
    use eu_stride for EU div-by-8
v7: another rebase
v8: address review comments from Tvrtko and Daniele
v9: address review comments from Daniele
v10: add reviewed-by on last patch with minor suggested change,
     rebase, and repost for CI

Stuart Summers (5):
  drm/i915: Use local variable for SSEU info in GETPARAM ioctl
  drm/i915: Add macro for SSEU stride calculation
  drm/i915: Move calculation of subslices per slice to new function
  drm/i915: Refactor sseu helper functions
  drm/i915: Expand subslice mask

 drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
 drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
 drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
 drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
 drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
 drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
 drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
 drivers/gpu/drm/i915/i915_drv.c              |  15 +-
 drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
 drivers/gpu/drm/i915/i915_query.c            |  15 +-
 drivers/gpu/drm/i915/intel_device_info.c     | 176 +++++++++++--------
 drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
 12 files changed, 280 insertions(+), 180 deletions(-)

Comments

Navare, Manasi May 28, 2019, 6:32 p.m. UTC | #1
Pushed to dinq, thanks for the patches and the reviews!

Regards
Manasi

On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
> This patch series contains a few code clean-up patches, followed
> by a patch which changes the storage of the subslice mask to better
> match the userspace access through the I915_QUERY_TOPOLOGY_INFO
> ioctl. The index into the subslice_mask array is then calculated:
>   slice * subslice stride + subslice index / 8
> 
> v2: fix i915_pm_sseu test failure
> v3: no changes to patches in the series, just resending to pick up
>     in CI correctly
> v4: rebase
> v5: fix header test
> v6: address review comments from Jari
>     address minor checkpatch warning in existing code
>     use eu_stride for EU div-by-8
> v7: another rebase
> v8: address review comments from Tvrtko and Daniele
> v9: address review comments from Daniele
> v10: add reviewed-by on last patch with minor suggested change,
>      rebase, and repost for CI
> 
> Stuart Summers (5):
>   drm/i915: Use local variable for SSEU info in GETPARAM ioctl
>   drm/i915: Add macro for SSEU stride calculation
>   drm/i915: Move calculation of subslices per slice to new function
>   drm/i915: Refactor sseu helper functions
>   drm/i915: Expand subslice mask
> 
>  drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
>  drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
>  drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
>  drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
>  drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
>  drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
>  drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
>  drivers/gpu/drm/i915/i915_drv.c              |  15 +-
>  drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
>  drivers/gpu/drm/i915/i915_query.c            |  15 +-
>  drivers/gpu/drm/i915/intel_device_info.c     | 176 +++++++++++--------
>  drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
>  12 files changed, 280 insertions(+), 180 deletions(-)
> 
> -- 
> 2.21.0.5.gaeb582a983
> 
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Summers, Stuart May 28, 2019, 6:33 p.m. UTC | #2
On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
> Pushed to dinq, thanks for the patches and the reviews!

Thanks for the push Manasi and the reviews Daniele and others!

-Stuart

> 
> Regards
> Manasi
> 
> On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
> > This patch series contains a few code clean-up patches, followed
> > by a patch which changes the storage of the subslice mask to better
> > match the userspace access through the I915_QUERY_TOPOLOGY_INFO
> > ioctl. The index into the subslice_mask array is then calculated:
> >   slice * subslice stride + subslice index / 8
> > 
> > v2: fix i915_pm_sseu test failure
> > v3: no changes to patches in the series, just resending to pick up
> >     in CI correctly
> > v4: rebase
> > v5: fix header test
> > v6: address review comments from Jari
> >     address minor checkpatch warning in existing code
> >     use eu_stride for EU div-by-8
> > v7: another rebase
> > v8: address review comments from Tvrtko and Daniele
> > v9: address review comments from Daniele
> > v10: add reviewed-by on last patch with minor suggested change,
> >      rebase, and repost for CI
> > 
> > Stuart Summers (5):
> >   drm/i915: Use local variable for SSEU info in GETPARAM ioctl
> >   drm/i915: Add macro for SSEU stride calculation
> >   drm/i915: Move calculation of subslices per slice to new function
> >   drm/i915: Refactor sseu helper functions
> >   drm/i915: Expand subslice mask
> > 
> >  drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
> >  drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
> >  drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
> >  drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
> >  drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
> >  drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
> >  drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
> >  drivers/gpu/drm/i915/i915_drv.c              |  15 +-
> >  drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
> >  drivers/gpu/drm/i915/i915_query.c            |  15 +-
> >  drivers/gpu/drm/i915/intel_device_info.c     | 176 +++++++++++--
> > ------
> >  drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
> >  12 files changed, 280 insertions(+), 180 deletions(-)
> > 
> > -- 
> > 2.21.0.5.gaeb582a983
> > 
> > _______________________________________________
> > Intel-gfx mailing list
> > Intel-gfx@lists.freedesktop.org
> > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Saarinen, Jani May 29, 2019, 6:48 a.m. UTC | #3
Hi, 

> -----Original Message-----
> From: Intel-gfx [mailto:intel-gfx-bounces@lists.freedesktop.org] On Behalf Of
> Summers, Stuart
> Sent: tiistai 28. toukokuuta 2019 21.33
> To: Navare, Manasi D <manasi.d.navare@intel.com>
> Cc: intel-gfx@lists.freedesktop.org
> Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice mask
> 
> On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
> > Pushed to dinq, thanks for the patches and the reviews!
> 
> Thanks for the push Manasi and the reviews Daniele and others!
This broke all the ICL systems because CI data was not looked that they did not actually even boot at all.
All ICL's in BAT and whole ICL shards. 


> 
> -Stuart
> 
> >
> > Regards
> > Manasi
> >
> > On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
> > > This patch series contains a few code clean-up patches, followed by
> > > a patch which changes the storage of the subslice mask to better
> > > match the userspace access through the I915_QUERY_TOPOLOGY_INFO
> > > ioctl. The index into the subslice_mask array is then calculated:
> > >   slice * subslice stride + subslice index / 8
> > >
> > > v2: fix i915_pm_sseu test failure
> > > v3: no changes to patches in the series, just resending to pick up
> > >     in CI correctly
> > > v4: rebase
> > > v5: fix header test
> > > v6: address review comments from Jari
> > >     address minor checkpatch warning in existing code
> > >     use eu_stride for EU div-by-8
> > > v7: another rebase
> > > v8: address review comments from Tvrtko and Daniele
> > > v9: address review comments from Daniele
> > > v10: add reviewed-by on last patch with minor suggested change,
> > >      rebase, and repost for CI
> > >
> > > Stuart Summers (5):
> > >   drm/i915: Use local variable for SSEU info in GETPARAM ioctl
> > >   drm/i915: Add macro for SSEU stride calculation
> > >   drm/i915: Move calculation of subslices per slice to new function
> > >   drm/i915: Refactor sseu helper functions
> > >   drm/i915: Expand subslice mask
> > >
> > >  drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
> > >  drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
> > >  drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
> > >  drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
> > >  drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
> > >  drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
> > >  drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
> > >  drivers/gpu/drm/i915/i915_drv.c              |  15 +-
> > >  drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
> > >  drivers/gpu/drm/i915/i915_query.c            |  15 +-
> > >  drivers/gpu/drm/i915/intel_device_info.c     | 176 +++++++++++--
> > > ------
> > >  drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
> > >  12 files changed, 280 insertions(+), 180 deletions(-)
> > >
> > > --
> > > 2.21.0.5.gaeb582a983
> > >
> > > _______________________________________________
> > > Intel-gfx mailing list
> > > Intel-gfx@lists.freedesktop.org
> > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Daniele Ceraolo Spurio May 29, 2019, 2:21 p.m. UTC | #4
On 5/28/19 11:48 PM, Saarinen, Jani wrote:
> Hi,
> 
>> -----Original Message-----
>> From: Intel-gfx [mailto:intel-gfx-bounces@lists.freedesktop.org] On Behalf Of
>> Summers, Stuart
>> Sent: tiistai 28. toukokuuta 2019 21.33
>> To: Navare, Manasi D <manasi.d.navare@intel.com>
>> Cc: intel-gfx@lists.freedesktop.org
>> Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice mask
>>
>> On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
>>> Pushed to dinq, thanks for the patches and the reviews!
>>
>> Thanks for the push Manasi and the reviews Daniele and others!
> This broke all the ICL systems because CI data was not looked that they did not actually even boot at all.
> All ICL's in BAT and whole ICL shards.
> 

Can we change the CI reply for the case where there are extra missing 
machines compared to the reference run from SUCCESS to WARNING or 
something like that, so people have a clearer indication that something 
might have gone wrong?

Daniele

> 
>>
>> -Stuart
>>
>>>
>>> Regards
>>> Manasi
>>>
>>> On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
>>>> This patch series contains a few code clean-up patches, followed by
>>>> a patch which changes the storage of the subslice mask to better
>>>> match the userspace access through the I915_QUERY_TOPOLOGY_INFO
>>>> ioctl. The index into the subslice_mask array is then calculated:
>>>>    slice * subslice stride + subslice index / 8
>>>>
>>>> v2: fix i915_pm_sseu test failure
>>>> v3: no changes to patches in the series, just resending to pick up
>>>>      in CI correctly
>>>> v4: rebase
>>>> v5: fix header test
>>>> v6: address review comments from Jari
>>>>      address minor checkpatch warning in existing code
>>>>      use eu_stride for EU div-by-8
>>>> v7: another rebase
>>>> v8: address review comments from Tvrtko and Daniele
>>>> v9: address review comments from Daniele
>>>> v10: add reviewed-by on last patch with minor suggested change,
>>>>       rebase, and repost for CI
>>>>
>>>> Stuart Summers (5):
>>>>    drm/i915: Use local variable for SSEU info in GETPARAM ioctl
>>>>    drm/i915: Add macro for SSEU stride calculation
>>>>    drm/i915: Move calculation of subslices per slice to new function
>>>>    drm/i915: Refactor sseu helper functions
>>>>    drm/i915: Expand subslice mask
>>>>
>>>>   drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
>>>>   drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
>>>>   drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
>>>>   drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
>>>>   drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
>>>>   drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
>>>>   drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
>>>>   drivers/gpu/drm/i915/i915_drv.c              |  15 +-
>>>>   drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
>>>>   drivers/gpu/drm/i915/i915_query.c            |  15 +-
>>>>   drivers/gpu/drm/i915/intel_device_info.c     | 176 +++++++++++--
>>>> ------
>>>>   drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
>>>>   12 files changed, 280 insertions(+), 180 deletions(-)
>>>>
>>>> --
>>>> 2.21.0.5.gaeb582a983
>>>>
>>>> _______________________________________________
>>>> Intel-gfx mailing list
>>>> Intel-gfx@lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>
Jani Nikula May 29, 2019, 2:29 p.m. UTC | #5
On Wed, 29 May 2019, "Saarinen, Jani" <jani.saarinen@intel.com> wrote:
> Hi, 
>
>> -----Original Message-----
>> From: Intel-gfx [mailto:intel-gfx-bounces@lists.freedesktop.org] On Behalf Of
>> Summers, Stuart
>> Sent: tiistai 28. toukokuuta 2019 21.33
>> To: Navare, Manasi D <manasi.d.navare@intel.com>
>> Cc: intel-gfx@lists.freedesktop.org
>> Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice mask
>> 
>> On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
>> > Pushed to dinq, thanks for the patches and the reviews!
>> 
>> Thanks for the push Manasi and the reviews Daniele and others!
> This broke all the ICL systems because CI data was not looked that they did not actually even boot at all.
> All ICL's in BAT and whole ICL shards. 

Commit 1ac159e23c2c ("drm/i915: Expand subslice mask") has now been
reverted and we can carry on.

As discussed, I don't really expect people to dig into the logs after
seeing "Fi.CI.IGT: success". For one I didn't expect patches to be able
to take down machines with just a fairly silent "Participating hosts (10
-> 9)" message. But that discussion is for another forum, another time.

BR,
Jani.


>
>
>> 
>> -Stuart
>> 
>> >
>> > Regards
>> > Manasi
>> >
>> > On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
>> > > This patch series contains a few code clean-up patches, followed by
>> > > a patch which changes the storage of the subslice mask to better
>> > > match the userspace access through the I915_QUERY_TOPOLOGY_INFO
>> > > ioctl. The index into the subslice_mask array is then calculated:
>> > >   slice * subslice stride + subslice index / 8
>> > >
>> > > v2: fix i915_pm_sseu test failure
>> > > v3: no changes to patches in the series, just resending to pick up
>> > >     in CI correctly
>> > > v4: rebase
>> > > v5: fix header test
>> > > v6: address review comments from Jari
>> > >     address minor checkpatch warning in existing code
>> > >     use eu_stride for EU div-by-8
>> > > v7: another rebase
>> > > v8: address review comments from Tvrtko and Daniele
>> > > v9: address review comments from Daniele
>> > > v10: add reviewed-by on last patch with minor suggested change,
>> > >      rebase, and repost for CI
>> > >
>> > > Stuart Summers (5):
>> > >   drm/i915: Use local variable for SSEU info in GETPARAM ioctl
>> > >   drm/i915: Add macro for SSEU stride calculation
>> > >   drm/i915: Move calculation of subslices per slice to new function
>> > >   drm/i915: Refactor sseu helper functions
>> > >   drm/i915: Expand subslice mask
>> > >
>> > >  drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
>> > >  drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
>> > >  drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
>> > >  drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
>> > >  drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
>> > >  drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
>> > >  drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
>> > >  drivers/gpu/drm/i915/i915_drv.c              |  15 +-
>> > >  drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
>> > >  drivers/gpu/drm/i915/i915_query.c            |  15 +-
>> > >  drivers/gpu/drm/i915/intel_device_info.c     | 176 +++++++++++--
>> > > ------
>> > >  drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
>> > >  12 files changed, 280 insertions(+), 180 deletions(-)
>> > >
>> > > --
>> > > 2.21.0.5.gaeb582a983
>> > >
>> > > _______________________________________________
>> > > Intel-gfx mailing list
>> > > Intel-gfx@lists.freedesktop.org
>> > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
Summers, Stuart May 29, 2019, 4:02 p.m. UTC | #6
On Wed, 2019-05-29 at 07:21 -0700, Daniele Ceraolo Spurio wrote:
> 
> On 5/28/19 11:48 PM, Saarinen, Jani wrote:
> > Hi,
> > 
> > > -----Original Message-----
> > > From: Intel-gfx [mailto:intel-gfx-bounces@lists.freedesktop.org]
> > > On Behalf Of
> > > Summers, Stuart
> > > Sent: tiistai 28. toukokuuta 2019 21.33
> > > To: Navare, Manasi D <manasi.d.navare@intel.com>
> > > Cc: intel-gfx@lists.freedesktop.org
> > > Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice
> > > mask
> > > 
> > > On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
> > > > Pushed to dinq, thanks for the patches and the reviews!
> > > 
> > > Thanks for the push Manasi and the reviews Daniele and others!
> > 
> > This broke all the ICL systems because CI data was not looked that
> > they did not actually even boot at all.
> > All ICL's in BAT and whole ICL shards.
> > 
> 
> Can we change the CI reply for the case where there are extra
> missing 
> machines compared to the reference run from SUCCESS to WARNING or 
> something like that, so people have a clearer indication that
> something 
> might have gone wrong?

I agree here. I'm sure with time and experience these types of things
will get easier to parse, but this was very unobvious to me when
posting. I have no problem reworking, but would really appreciate a
solution to this from the CI side to ensure we don't hit this type of
thing in the future.

Thanks,
Stuart

> 
> Daniele
> 
> > 
> > > 
> > > -Stuart
> > > 
> > > > 
> > > > Regards
> > > > Manasi
> > > > 
> > > > On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
> > > > > This patch series contains a few code clean-up patches,
> > > > > followed by
> > > > > a patch which changes the storage of the subslice mask to
> > > > > better
> > > > > match the userspace access through the
> > > > > I915_QUERY_TOPOLOGY_INFO
> > > > > ioctl. The index into the subslice_mask array is then
> > > > > calculated:
> > > > >    slice * subslice stride + subslice index / 8
> > > > > 
> > > > > v2: fix i915_pm_sseu test failure
> > > > > v3: no changes to patches in the series, just resending to
> > > > > pick up
> > > > >      in CI correctly
> > > > > v4: rebase
> > > > > v5: fix header test
> > > > > v6: address review comments from Jari
> > > > >      address minor checkpatch warning in existing code
> > > > >      use eu_stride for EU div-by-8
> > > > > v7: another rebase
> > > > > v8: address review comments from Tvrtko and Daniele
> > > > > v9: address review comments from Daniele
> > > > > v10: add reviewed-by on last patch with minor suggested
> > > > > change,
> > > > >       rebase, and repost for CI
> > > > > 
> > > > > Stuart Summers (5):
> > > > >    drm/i915: Use local variable for SSEU info in GETPARAM
> > > > > ioctl
> > > > >    drm/i915: Add macro for SSEU stride calculation
> > > > >    drm/i915: Move calculation of subslices per slice to new
> > > > > function
> > > > >    drm/i915: Refactor sseu helper functions
> > > > >    drm/i915: Expand subslice mask
> > > > > 
> > > > >   drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
> > > > >   drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
> > > > >   drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
> > > > >   drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
> > > > >   drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
> > > > >   drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
> > > > >   drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
> > > > >   drivers/gpu/drm/i915/i915_drv.c              |  15 +-
> > > > >   drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
> > > > >   drivers/gpu/drm/i915/i915_query.c            |  15 +-
> > > > >   drivers/gpu/drm/i915/intel_device_info.c     | 176
> > > > > +++++++++++--
> > > > > ------
> > > > >   drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
> > > > >   12 files changed, 280 insertions(+), 180 deletions(-)
> > > > > 
> > > > > --
> > > > > 2.21.0.5.gaeb582a983
> > > > > 
> > > > > _______________________________________________
> > > > > Intel-gfx mailing list
> > > > > Intel-gfx@lists.freedesktop.org
> > > > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > 
> > _______________________________________________
> > Intel-gfx mailing list
> > Intel-gfx@lists.freedesktop.org
> > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> >
Saarinen, Jani May 30, 2019, 8:29 a.m. UTC | #7
Hi, 


> -----Original Message-----
> From: Summers, Stuart
> Sent: keskiviikko 29. toukokuuta 2019 19.02
> To: Saarinen, Jani <jani.saarinen@intel.com>; Ceraolo Spurio, Daniele
> <daniele.ceraolospurio@intel.com>; Navare, Manasi D
> <manasi.d.navare@intel.com>
> Cc: intel-gfx@lists.freedesktop.org
> Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice mask
> 
> On Wed, 2019-05-29 at 07:21 -0700, Daniele Ceraolo Spurio wrote:
> >
> > On 5/28/19 11:48 PM, Saarinen, Jani wrote:
> > > Hi,
> > >
> > > > -----Original Message-----
> > > > From: Intel-gfx [mailto:intel-gfx-bounces@lists.freedesktop.org]
> > > > On Behalf Of
> > > > Summers, Stuart
> > > > Sent: tiistai 28. toukokuuta 2019 21.33
> > > > To: Navare, Manasi D <manasi.d.navare@intel.com>
> > > > Cc: intel-gfx@lists.freedesktop.org
> > > > Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice
> > > > mask
> > > >
> > > > On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
> > > > > Pushed to dinq, thanks for the patches and the reviews!
> > > >
> > > > Thanks for the push Manasi and the reviews Daniele and others!
> > >
> > > This broke all the ICL systems because CI data was not looked that
> > > they did not actually even boot at all.
> > > All ICL's in BAT and whole ICL shards.
> > >
> >
> > Can we change the CI reply for the case where there are extra
> > missing
> > machines compared to the reference run from SUCCESS to WARNING or
> > something like that, so people have a clearer indication that
> > something
> > might have gone wrong?
> 
> I agree here. I'm sure with time and experience these types of things
> will get easier to parse, but this was very unobvious to me when
> posting. I have no problem reworking, but would really appreciate a
> solution to this from the CI side to ensure we don't hit this type of
> thing in the future.
Sure, CI team already discussed on this. 
But going forward. Can you fix this still today that ICL's systems are green not orange on ci-grid.
So would be good to get to the state that was on CI_DRM_6158. 
Reference eg. : https://intel-gfx-ci.01.org/tree/drm-tip/fi-icl-u2.html so clearly after module reload we fail.
<3> [415.887946] [drm:_wa_add [i915]] *ERROR* Discarding overwritten w/a for reg 7034 (mask: ffffffff, value: 80000280)

Br,
Jani
> 
> Thanks,
> Stuart
> 
> >
> > Daniele
> >
> > >
> > > >
> > > > -Stuart
> > > >
> > > > >
> > > > > Regards
> > > > > Manasi
> > > > >
> > > > > On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
> > > > > > This patch series contains a few code clean-up patches,
> > > > > > followed by
> > > > > > a patch which changes the storage of the subslice mask to
> > > > > > better
> > > > > > match the userspace access through the
> > > > > > I915_QUERY_TOPOLOGY_INFO
> > > > > > ioctl. The index into the subslice_mask array is then
> > > > > > calculated:
> > > > > >    slice * subslice stride + subslice index / 8
> > > > > >
> > > > > > v2: fix i915_pm_sseu test failure
> > > > > > v3: no changes to patches in the series, just resending to
> > > > > > pick up
> > > > > >      in CI correctly
> > > > > > v4: rebase
> > > > > > v5: fix header test
> > > > > > v6: address review comments from Jari
> > > > > >      address minor checkpatch warning in existing code
> > > > > >      use eu_stride for EU div-by-8
> > > > > > v7: another rebase
> > > > > > v8: address review comments from Tvrtko and Daniele
> > > > > > v9: address review comments from Daniele
> > > > > > v10: add reviewed-by on last patch with minor suggested
> > > > > > change,
> > > > > >       rebase, and repost for CI
> > > > > >
> > > > > > Stuart Summers (5):
> > > > > >    drm/i915: Use local variable for SSEU info in GETPARAM
> > > > > > ioctl
> > > > > >    drm/i915: Add macro for SSEU stride calculation
> > > > > >    drm/i915: Move calculation of subslices per slice to new
> > > > > > function
> > > > > >    drm/i915: Refactor sseu helper functions
> > > > > >    drm/i915: Expand subslice mask
> > > > > >
> > > > > >   drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
> > > > > >   drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
> > > > > >   drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
> > > > > >   drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
> > > > > >   drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
> > > > > >   drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
> > > > > >   drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
> > > > > >   drivers/gpu/drm/i915/i915_drv.c              |  15 +-
> > > > > >   drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
> > > > > >   drivers/gpu/drm/i915/i915_query.c            |  15 +-
> > > > > >   drivers/gpu/drm/i915/intel_device_info.c     | 176
> > > > > > +++++++++++--
> > > > > > ------
> > > > > >   drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
> > > > > >   12 files changed, 280 insertions(+), 180 deletions(-)
> > > > > >
> > > > > > --
> > > > > > 2.21.0.5.gaeb582a983
> > > > > >
> > > > > > _______________________________________________
> > > > > > Intel-gfx mailing list
> > > > > > Intel-gfx@lists.freedesktop.org
> > > > > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > >
> > > _______________________________________________
> > > Intel-gfx mailing list
> > > Intel-gfx@lists.freedesktop.org
> > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > >
Summers, Stuart May 30, 2019, 2:46 p.m. UTC | #8
On Thu, 2019-05-30 at 09:29 +0100, Saarinen, Jani wrote:
> Hi, 
> 
> 
> > -----Original Message-----
> > From: Summers, Stuart
> > Sent: keskiviikko 29. toukokuuta 2019 19.02
> > To: Saarinen, Jani <jani.saarinen@intel.com>; Ceraolo Spurio,
> > Daniele
> > <daniele.ceraolospurio@intel.com>; Navare, Manasi D
> > <manasi.d.navare@intel.com>
> > Cc: intel-gfx@lists.freedesktop.org
> > Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice mask
> > 
> > On Wed, 2019-05-29 at 07:21 -0700, Daniele Ceraolo Spurio wrote:
> > > 
> > > On 5/28/19 11:48 PM, Saarinen, Jani wrote:
> > > > Hi,
> > > > 
> > > > > -----Original Message-----
> > > > > From: Intel-gfx [mailto:
> > > > > intel-gfx-bounces@lists.freedesktop.org]
> > > > > On Behalf Of
> > > > > Summers, Stuart
> > > > > Sent: tiistai 28. toukokuuta 2019 21.33
> > > > > To: Navare, Manasi D <manasi.d.navare@intel.com>
> > > > > Cc: intel-gfx@lists.freedesktop.org
> > > > > Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice
> > > > > mask
> > > > > 
> > > > > On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
> > > > > > Pushed to dinq, thanks for the patches and the reviews!
> > > > > 
> > > > > Thanks for the push Manasi and the reviews Daniele and
> > > > > others!sk: ffffffff, value: 80000280)
> > > > 
> > > > This broke all the ICL systems because CI data was not looked
> > > > that
> > > > they did not actually even boot at all.
> > > > All ICL's in BAT and whole ICL shards.
> > > > 
> > > 
> > > Can we change the CI reply for the case where there are extra
> > > missing
> > > machines compared to the reference run from SUCCESS to WARNING or
> > > something like that, so people have a clearer indication that
> > > something
> > > might have gone wrong?
> > 
> > I agree here. I'm sure with time and experience these types of
> > things
> > will get easier to parse, but this was very unobvious to me when
> > posting. I have no problem reworking, but would really appreciate a
> > solution to this from the CI side to ensure we don't hit this type
> > of
> > thing in the future.
> 
> Sure, CI team already discussed on this. 
> But going forward. Can you fix this still today that ICL's systems
> are green not orange on ci-grid.
> So would be good to get to the state that was on CI_DRM_6158. 
> Reference eg. : 
> https://intel-gfx-ci.01.org/tree/drm-tip/fi-icl-u2.html so clearly
> after module reload we fail.
> <3> [415.887946] [drm:_wa_add [i915]] *ERROR* Discarding overwritten
> w/a for reg 7034 (mask: ffffffff, value: 80000280)

Hi Jani,

I will likely not have time to get to this today or maybe even tomorrow
unfortunately. I'll try to look at this as part of my rework of the
SSEU revert from yesterday.

Thanks,
Stuart

> 
> Br,
> Jani
> > 
> > Thanks,
> > Stuart
> > 
> > > 
> > > Daniele
> > > 
> > > > 
> > > > > 
> > > > > -Stuart
> > > > > 
> > > > > > 
> > > > > > Regards
> > > > > > Manasi
> > > > > > 
> > > > > > On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers
> > > > > > wrote:
> > > > > > > This patch series contains a few code clean-up patches,
> > > > > > > followed by
> > > > > > > a patch which changes the storage of the subslice mask to
> > > > > > > better
> > > > > > > match the userspace access through the
> > > > > > > I915_QUERY_TOPOLOGY_INFO
> > > > > > > ioctl. The index into the subslice_mask array is then
> > > > > > > calculated:
> > > > > > >    slice * subslice stride + subslice index / 8
> > > > > > > 
> > > > > > > v2: fix i915_pm_sseu test failure
> > > > > > > v3: no changes to patches in the series, just resending
> > > > > > > to
> > > > > > > pick up
> > > > > > >      in CI correctly
> > > > > > > v4: rebase
> > > > > > > v5: fix header test
> > > > > > > v6: address review comments from Jari
> > > > > > >      address minor checkpatch warning in existing code
> > > > > > >      use eu_stride for EU div-by-8
> > > > > > > v7: another rebase
> > > > > > > v8: address review comments from Tvrtko and Daniele
> > > > > > > v9: address review comments from Daniele
> > > > > > > v10: add reviewed-by on last patch with minor suggested
> > > > > > > change,
> > > > > > >       rebase, and repost for CI
> > > > > > > 
> > > > > > > Stuart Summers (5):
> > > > > > >    drm/i915: Use local variable for SSEU info in GETPARAM
> > > > > > > ioctl
> > > > > > >    drm/i915: Add macro for SSEU stride calculation
> > > > > > >    drm/i915: Move calculation of subslices per slice to
> > > > > > > new
> > > > > > > function
> > > > > > >    drm/i915: Refactor sseu helper functions
> > > > > > >    drm/i915: Expand subslice mask
> > > > > > > 
> > > > > > >   drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
> > > > > > >   drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
> > > > > > >   drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
> > > > > > >   drivers/gpu/drm/i915/gt/intel_sseu.c         |  62
> > > > > > > +++++++
> > > > > > >   drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
> > > > > > >   drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
> > > > > > >   drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++
> > > > > > > ---
> > > > > > >   drivers/gpu/drm/i915/i915_drv.c              |  15 +-
> > > > > > >   drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
> > > > > > >   drivers/gpu/drm/i915/i915_query.c            |  15 +-
> > > > > > >   drivers/gpu/drm/i915/intel_device_info.c     | 176
> > > > > > > +++++++++++--
> > > > > > > ------
> > > > > > >   drivers/gpu/drm/i915/intel_device_info.h     |  47 ----
> > > > > > > -
> > > > > > >   12 files changed, 280 insertions(+), 180 deletions(-)
> > > > > > > 
> > > > > > > --
> > > > > > > 2.21.0.5.gaeb582a983
> > > > > > > 
> > > > > > > _______________________________________________
> > > > > > > Intel-gfx mailing list
> > > > > > > Intel-gfx@lists.freedesktop.org
> > > > > > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > > > 
> > > > _______________________________________________
> > > > Intel-gfx mailing list
> > > > Intel-gfx@lists.freedesktop.org
> > > > https://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > > >
Daniele Ceraolo Spurio May 30, 2019, 2:55 p.m. UTC | #9
On 5/30/19 1:29 AM, Saarinen, Jani wrote:
> Hi,
> 
> 
>> -----Original Message-----
>> From: Summers, Stuart
>> Sent: keskiviikko 29. toukokuuta 2019 19.02
>> To: Saarinen, Jani <jani.saarinen@intel.com>; Ceraolo Spurio, Daniele
>> <daniele.ceraolospurio@intel.com>; Navare, Manasi D
>> <manasi.d.navare@intel.com>
>> Cc: intel-gfx@lists.freedesktop.org
>> Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice mask
>>
>> On Wed, 2019-05-29 at 07:21 -0700, Daniele Ceraolo Spurio wrote:
>>>
>>> On 5/28/19 11:48 PM, Saarinen, Jani wrote:
>>>> Hi,
>>>>
>>>>> -----Original Message-----
>>>>> From: Intel-gfx [mailto:intel-gfx-bounces@lists.freedesktop.org]
>>>>> On Behalf Of
>>>>> Summers, Stuart
>>>>> Sent: tiistai 28. toukokuuta 2019 21.33
>>>>> To: Navare, Manasi D <manasi.d.navare@intel.com>
>>>>> Cc: intel-gfx@lists.freedesktop.org
>>>>> Subject: Re: [Intel-gfx] [CI 0/5] Refactor to expand subslice
>>>>> mask
>>>>>
>>>>> On Tue, 2019-05-28 at 11:32 -0700, Manasi Navare wrote:
>>>>>> Pushed to dinq, thanks for the patches and the reviews!
>>>>>
>>>>> Thanks for the push Manasi and the reviews Daniele and others!
>>>>
>>>> This broke all the ICL systems because CI data was not looked that
>>>> they did not actually even boot at all.
>>>> All ICL's in BAT and whole ICL shards.
>>>>
>>>
>>> Can we change the CI reply for the case where there are extra
>>> missing
>>> machines compared to the reference run from SUCCESS to WARNING or
>>> something like that, so people have a clearer indication that
>>> something
>>> might have gone wrong?
>>
>> I agree here. I'm sure with time and experience these types of things
>> will get easier to parse, but this was very unobvious to me when
>> posting. I have no problem reworking, but would really appreciate a
>> solution to this from the CI side to ensure we don't hit this type of
>> thing in the future.
> Sure, CI team already discussed on this.
> But going forward. Can you fix this still today that ICL's systems are green not orange on ci-grid.
> So would be good to get to the state that was on CI_DRM_6158.
> Reference eg. : https://intel-gfx-ci.01.org/tree/drm-tip/fi-icl-u2.html so clearly after module reload we fail.
> <3> [415.887946] [drm:_wa_add [i915]] *ERROR* Discarding overwritten w/a for reg 7034 (mask: ffffffff, value: 80000280)
> 
> Br,
> Jani

This doesn't seem to have anything to do with this series though. Looks 
like we duplicated the WA when merging 'drm/drm-next' into drm-tip:

static void icl_ctx_workarounds_init(struct intel_engine_cs *engine,
				     struct i915_wa_list *wal)
{
	struct drm_i915_private *i915 = engine->i915;

	/* WaDisableBankHangMode:icl */
	wa_write(wal,
		 GEN8_L3CNTLREG,
		 intel_uncore_read(engine->uncore, GEN8_L3CNTLREG) |
		 GEN8_ERRDETBCTRL);

	/* WaDisableBankHangMode:icl */
	wa_write(wal,
		 GEN8_L3CNTLREG,
		 intel_uncore_read(engine->uncore, GEN8_L3CNTLREG) |
		 GEN8_ERRDETBCTRL);


AFAICS the duplication is added by:

commit 7126b65091c417e757b638065ae4fdc2a5dc2f5c
Merge: 3aea8d02f801 14ee642c2ab0
Author: Chris Wilson <chris@chris-wilson.co.uk>
Date:   Thu May 30 12:06:55 2019 +0100

     Merge remote-tracking branch 'drm/drm-next' into drm-tip

Daniele

>>
>> Thanks,
>> Stuart
>>
>>>
>>> Daniele
>>>
>>>>
>>>>>
>>>>> -Stuart
>>>>>
>>>>>>
>>>>>> Regards
>>>>>> Manasi
>>>>>>
>>>>>> On Fri, May 24, 2019 at 08:40:17AM -0700, Stuart Summers wrote:
>>>>>>> This patch series contains a few code clean-up patches,
>>>>>>> followed by
>>>>>>> a patch which changes the storage of the subslice mask to
>>>>>>> better
>>>>>>> match the userspace access through the
>>>>>>> I915_QUERY_TOPOLOGY_INFO
>>>>>>> ioctl. The index into the subslice_mask array is then
>>>>>>> calculated:
>>>>>>>     slice * subslice stride + subslice index / 8
>>>>>>>
>>>>>>> v2: fix i915_pm_sseu test failure
>>>>>>> v3: no changes to patches in the series, just resending to
>>>>>>> pick up
>>>>>>>       in CI correctly
>>>>>>> v4: rebase
>>>>>>> v5: fix header test
>>>>>>> v6: address review comments from Jari
>>>>>>>       address minor checkpatch warning in existing code
>>>>>>>       use eu_stride for EU div-by-8
>>>>>>> v7: another rebase
>>>>>>> v8: address review comments from Tvrtko and Daniele
>>>>>>> v9: address review comments from Daniele
>>>>>>> v10: add reviewed-by on last patch with minor suggested
>>>>>>> change,
>>>>>>>        rebase, and repost for CI
>>>>>>>
>>>>>>> Stuart Summers (5):
>>>>>>>     drm/i915: Use local variable for SSEU info in GETPARAM
>>>>>>> ioctl
>>>>>>>     drm/i915: Add macro for SSEU stride calculation
>>>>>>>     drm/i915: Move calculation of subslices per slice to new
>>>>>>> function
>>>>>>>     drm/i915: Refactor sseu helper functions
>>>>>>>     drm/i915: Expand subslice mask
>>>>>>>
>>>>>>>    drivers/gpu/drm/i915/gt/intel_engine_cs.c    |  24 ++-
>>>>>>>    drivers/gpu/drm/i915/gt/intel_engine_types.h |  30 ++--
>>>>>>>    drivers/gpu/drm/i915/gt/intel_hangcheck.c    |   3 +-
>>>>>>>    drivers/gpu/drm/i915/gt/intel_sseu.c         |  62 +++++++
>>>>>>>    drivers/gpu/drm/i915/gt/intel_sseu.h         |  35 +++-
>>>>>>>    drivers/gpu/drm/i915/gt/intel_workarounds.c  |   2 +-
>>>>>>>    drivers/gpu/drm/i915/i915_debugfs.c          |  46 ++---
>>>>>>>    drivers/gpu/drm/i915/i915_drv.c              |  15 +-
>>>>>>>    drivers/gpu/drm/i915/i915_gpu_error.c        |   5 +-
>>>>>>>    drivers/gpu/drm/i915/i915_query.c            |  15 +-
>>>>>>>    drivers/gpu/drm/i915/intel_device_info.c     | 176
>>>>>>> +++++++++++--
>>>>>>> ------
>>>>>>>    drivers/gpu/drm/i915/intel_device_info.h     |  47 -----
>>>>>>>    12 files changed, 280 insertions(+), 180 deletions(-)
>>>>>>>
>>>>>>> --
>>>>>>> 2.21.0.5.gaeb582a983
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Intel-gfx mailing list
>>>>>>> Intel-gfx@lists.freedesktop.org
>>>>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>>>>
>>>> _______________________________________________
>>>> Intel-gfx mailing list
>>>> Intel-gfx@lists.freedesktop.org
>>>> https://lists.freedesktop.org/mailman/listinfo/intel-gfx
>>>>