[RFC,33/60] drm/i915/lmem: support pwrite

Message ID	20200710115757.290984-34-matthew.auld@intel.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=fwG1=AV=lists.freedesktop.org=intel-gfx-bounces@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 84BDD206A5 IronPort-SDR: 2jrlji+nem+7waxVTd/YZ2PkbuLT0WTO3i9sM4K/+JMpXrQ6HcB+xTnoWBEVzYc1MY8gQgtN9G 7k77Z/EESocQ== IronPort-SDR: 0BP7ZcFdD+Nwwi1sKDStOPV/3u8lmcUE2S59bgcrjtnHJC//3+fL5gIgi33ChSACgpkK14EY0G NMo89HDkr1HA== From: Matthew Auld <matthew.auld@intel.com> To: intel-gfx@lists.freedesktop.org Date: Fri, 10 Jul 2020 12:57:30 +0100 Message-Id: <20200710115757.290984-34-matthew.auld@intel.com> In-Reply-To: <20200710115757.290984-1-matthew.auld@intel.com> References: <20200710115757.290984-1-matthew.auld@intel.com> MIME-Version: 1.0 Subject: [Intel-gfx] [RFC 33/60] drm/i915/lmem: support pwrite Precedence: list Cc: Abdiel Janulgue <abdiel.janulgue@linux.intel.com> Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>
Series	DG1 LMEM enabling \| expand [RFC,00/60] DG1 LMEM enabling [RFC,01/60] drm/i915: Add has_master_unit_irq flag [RFC,02/60] drm/i915/dg1: add initial DG-1 definitions [RFC,03/60] drm/i915/dg1: Add DG1 PCI IDs [RFC,04/60] drm/i915/dg1: add support for the master unit interrupt [RFC,05/60] drm/i915/dg1: Remove SHPD_FILTER_CNT register programming [RFC,06/60] drm/i915/dg1: Add fake PCH [RFC,07/60] drm/i915/dg1: Initialize RAWCLK properly [RFC,08/60] drm/i915/dg1: Define MOCS table for DG1 [RFC,09/60] drm/i915/dg1: Add DG1 power wells [RFC,10/60] drm/i915/dg1: Increase mmio size to 4MB [RFC,11/60] drm/i915/dg1: Wait for pcode/uncore handshake at startup [RFC,12/60] drm/i915/dg1: Add DPLL macros for DG1 [RFC,13/60] drm/i915/dg1: Add and setup DPLLs for DG1 [RFC,14/60] drm/i915/dg1: Enable DPLL for DG1 [RFC,15/60] drm/i915/dg1: add hpd interrupt handling [RFC,16/60] drm/i915/dg1: invert HPD pins [RFC,17/60] drm/i915/dg1: gmbus pin mapping [RFC,18/60] drm/i915/dg1: Enable first 2 ports for DG1 [RFC,19/60] drm/i915/dg1: Don't program PHY_MISC for PHY-C and PHY-D [RFC,20/60] drm/i915/dg1: Update comp master/slave relationships for PHYs [RFC,21/60] drm/i915/dg1: Update voltage swing tables for DP [RFC,22/60] drm/i915/dg1: provide port/phy mapping for vbt [RFC,23/60] drm/i915/dg1: map/unmap pll clocks [RFC,24/60] drm/i915/dg1: enable PORT C/D aka D/E [RFC,25/60] drm/i915/dg1: Load DMC [RFC,26/60] drm/i915/rkl: Add initial workarounds [RFC,27/60] drm/i915/dg1: Add initial DG1 workarounds [RFC,28/60] drm/i915/dg1: DG1 does not support DC6 [RFC,29/60] drm/i915/lmem: Limit block size to 4G [RFC,30/60] drm/i915/lmem: Do not check r->sgt.pfn for NULL [RFC,31/60] drm/i915/dgfx: define llc and snooping behaviour [RFC,32/60] drm/i915/lmem: support pread [RFC,33/60] drm/i915/lmem: support pwrite [RFC,34/60] drm/i915: introduce kernel blitter_context [RFC,35/60] drm/i915/query: Expose memory regions through the query uAPI [RFC,36/60] drm/i915/uapi: introduce drm_i915_gem_create_ext [RFC,37/60] drm/i915/lmem: allocate cmd ring in lmem [RFC,38/60] drm/i915/dg1: Introduce dmabuf mmap to LMEM [RFC,39/60] drm/i915: setup the LMEM region [RFC,40/60] drm/i915: drop fake LMEM [RFC,41/60] drm/i915: Distinction of memory regions [RFC,42/60] drm/i915: PPGTT support [RFC,43/60] drm/i915: support GGTT LMEM entries [RFC,44/60] drm/i915: allocate context from LMEM [RFC,45/60] drm/i915: move engine scratch to LMEM [RFC,46/60] drm/i915: Provide a way to disable PCIe relaxed write ordering [RFC,47/60] drm/i915: setup GPU device lmem region [RFC,48/60] drm/i915: Fix object page offset within a region [RFC,49/60] drm/i915: add i915_gem_object_is_devmem() function [RFC,50/60] drm/i915: finish memory region support for stolen objects. [RFC,51/60] drm/i915/lmem: support optional CPU clearing for special internal use [RFC,52/60] drm/i915/guc: put all guc objects in lmem when available [RFC,53/60] drm/i915: Create stolen memory region from local memory [RFC,54/60] drm/i915/lmem: Bypass aperture when lmem is available [RFC,55/60] drm/i915/lmem: reset the lmem buffer created by fbdev [RFC,56/60] drm/i915/dsb: Enable lmem for dsb [RFC,57/60] drm/i915: Reintroduce mem->reserved [RFC,58/60] drm/i915/dg1: Reserve first 1MB of local memory [RFC,59/60] drm/i915: defer pd lmem block put to worker [RFC,60/60] drm/i915/lmem: allocate HWSP in lmem

Matthew Auld July 10, 2020, 11:57 a.m. UTC

We need to add support for pwrite'ing an LMEM object.

Signed-off-by: Matthew Auld <matthew.auld@intel.com>
Signed-off-by: Steve Hampson <steven.t.hampson@intel.com>
Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
Cc: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>
---
 drivers/gpu/drm/i915/gem/i915_gem_lmem.c | 86 ++++++++++++++++++++++++
 1 file changed, 86 insertions(+)

Dave Airlie July 13, 2020, 5:09 a.m. UTC | #1

On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
>
> We need to add support for pwrite'ing an LMEM object.

why? DG1 is a discrete GPU, these interfaces we already gross and
overly hacky for integrated, I'd prefer not to drag them across into
discrete land.

same goes for pread.

You have no legacy userspace here, userspace needs change to support
LMEM, it can be fixed to avoid legacy ioctls paths.

Dave.

Matthew Auld July 14, 2020, 2:35 p.m. UTC | #2

On 13/07/2020 06:09, Dave Airlie wrote:
> On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
>>
>> We need to add support for pwrite'ing an LMEM object.
> 
> why? DG1 is a discrete GPU, these interfaces we already gross and
> overly hacky for integrated, I'd prefer not to drag them across into
> discrete land.
> 
> same goes for pread.
> 
> You have no legacy userspace here, userspace needs change to support
> LMEM, it can be fixed to avoid legacy ioctls paths.

Ok, there have also been similar discussions internally in the past. I 
think one of the reasons was around IGT, and how keeping the 
pread/pwrite interface meant slightly less pain, also it's not much 
effort to implement for LMEM. If this is a NACK, then I guess the other 
idea was to somehow fallback to mmap and update IGT accordingly.

> 
> Dave.
>

Dave Airlie July 16, 2020, 12:43 a.m. UTC | #3

On Wed, 15 Jul 2020 at 00:35, Matthew Auld <matthew.auld@intel.com> wrote:
>
> On 13/07/2020 06:09, Dave Airlie wrote:
> > On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
> >>
> >> We need to add support for pwrite'ing an LMEM object.
> >
> > why? DG1 is a discrete GPU, these interfaces we already gross and
> > overly hacky for integrated, I'd prefer not to drag them across into
> > discrete land.
> >
> > same goes for pread.
> >
> > You have no legacy userspace here, userspace needs change to support
> > LMEM, it can be fixed to avoid legacy ioctls paths.
>
> Ok, there have also been similar discussions internally in the past. I
> think one of the reasons was around IGT, and how keeping the
> pread/pwrite interface meant slightly less pain, also it's not much
> effort to implement for LMEM. If this is a NACK, then I guess the other
> idea was to somehow fallback to mmap and update IGT accordingly.

I just don't think we should have internal kernel interfaces for
mapping ram in the kernel address space, seems pointless, makes less
sense with a discrete GPU in the mix, so yes I think NAK for
pread/pwrite at least at this time.

I'd also like to see a hard no relocs policy for DG1 enforced in the kernel.

Dave.

Matthew Auld July 16, 2020, 10:11 a.m. UTC | #4

On 16/07/2020 01:43, Dave Airlie wrote:
> On Wed, 15 Jul 2020 at 00:35, Matthew Auld <matthew.auld@intel.com> wrote:
>>
>> On 13/07/2020 06:09, Dave Airlie wrote:
>>> On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
>>>>
>>>> We need to add support for pwrite'ing an LMEM object.
>>>
>>> why? DG1 is a discrete GPU, these interfaces we already gross and
>>> overly hacky for integrated, I'd prefer not to drag them across into
>>> discrete land.
>>>
>>> same goes for pread.
>>>
>>> You have no legacy userspace here, userspace needs change to support
>>> LMEM, it can be fixed to avoid legacy ioctls paths.
>>
>> Ok, there have also been similar discussions internally in the past. I
>> think one of the reasons was around IGT, and how keeping the
>> pread/pwrite interface meant slightly less pain, also it's not much
>> effort to implement for LMEM. If this is a NACK, then I guess the other
>> idea was to somehow fallback to mmap and update IGT accordingly.
> 
> I just don't think we should have internal kernel interfaces for
> mapping ram in the kernel address space, seems pointless, makes less
> sense with a discrete GPU in the mix, so yes I think NAK for
> pread/pwrite at least at this time.

Ok.

> 
> I'd also like to see a hard no relocs policy for DG1 enforced in the kernel.

Ok, just checking, is that the case even if we don't require extra code 
to support it? We recently dropped the CPU reloc path completely, in 
favour of single GPU reloc path, and so no special code is required to 
support LMEM, it should just work. IGT of course makes heavy use of 
relocs, so that would need an overhaul.

> 
> Dave.
>

Dave Airlie July 19, 2020, 9:52 p.m. UTC | #5

On Thu, 16 Jul 2020 at 20:11, Matthew Auld <matthew.auld@intel.com> wrote:
>
> On 16/07/2020 01:43, Dave Airlie wrote:
> > On Wed, 15 Jul 2020 at 00:35, Matthew Auld <matthew.auld@intel.com> wrote:
> >>
> >> On 13/07/2020 06:09, Dave Airlie wrote:
> >>> On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
> >>>>
> >>>> We need to add support for pwrite'ing an LMEM object.
> >>>
> >>> why? DG1 is a discrete GPU, these interfaces we already gross and
> >>> overly hacky for integrated, I'd prefer not to drag them across into
> >>> discrete land.
> >>>
> >>> same goes for pread.
> >>>
> >>> You have no legacy userspace here, userspace needs change to support
> >>> LMEM, it can be fixed to avoid legacy ioctls paths.
> >>
> >> Ok, there have also been similar discussions internally in the past. I
> >> think one of the reasons was around IGT, and how keeping the
> >> pread/pwrite interface meant slightly less pain, also it's not much
> >> effort to implement for LMEM. If this is a NACK, then I guess the other
> >> idea was to somehow fallback to mmap and update IGT accordingly.
> >
> > I just don't think we should have internal kernel interfaces for
> > mapping ram in the kernel address space, seems pointless, makes less
> > sense with a discrete GPU in the mix, so yes I think NAK for
> > pread/pwrite at least at this time.
>
> Ok.
>
> >
> > I'd also like to see a hard no relocs policy for DG1 enforced in the kernel.
>
> Ok, just checking, is that the case even if we don't require extra code
> to support it? We recently dropped the CPU reloc path completely, in
> favour of single GPU reloc path, and so no special code is required to
> support LMEM, it should just work. IGT of course makes heavy use of
> relocs, so that would need an overhaul.

The GPU reloc path is optimising a path that we simply shouldn't need
or be using.

IGT tests relocs, ripping out relocs should reduce the amount of
testing IGT has to do and reduce CI run times. Why carry the techincal
debt deliberately.

I expect the kernel team to be a bit more authorative inside Intel on
why uAPI gets exposed and why, it seems like everytime there is an
attempt to limit the tech debt of carrying forward unnecessary uAPIs
there is some push back for media driver or IGT. Fix stuff and be
harder in pushing back on carrying unneeded interfaces forward so we
future products are less mired in pointless debt. DG1 uAPI should
really be a chance to full review the legacy of integrated graphics +
pre-48-bit VM interfaces and they should all be turned off for DG1 so
that future discrete GPUs can move forward cleaner. I really shouldn't
be the one enforcing this, the i915 team needs to be a bit
authoritative on what is necessary to support.

Dave.

Joonas Lahtinen Aug. 7, 2020, 9:32 a.m. UTC | #6

Quoting Dave Airlie (2020-07-20 00:52:19)
> On Thu, 16 Jul 2020 at 20:11, Matthew Auld <matthew.auld@intel.com> wrote:
> >
> > On 16/07/2020 01:43, Dave Airlie wrote:
> > > On Wed, 15 Jul 2020 at 00:35, Matthew Auld <matthew.auld@intel.com> wrote:
> > >>
> > >> On 13/07/2020 06:09, Dave Airlie wrote:
> > >>> On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
> > >>>>
> > >>>> We need to add support for pwrite'ing an LMEM object.
> > >>>
> > >>> why? DG1 is a discrete GPU, these interfaces we already gross and
> > >>> overly hacky for integrated, I'd prefer not to drag them across into
> > >>> discrete land.
> > >>>
> > >>> same goes for pread.
> > >>>
> > >>> You have no legacy userspace here, userspace needs change to support
> > >>> LMEM, it can be fixed to avoid legacy ioctls paths.
> > >>
> > >> Ok, there have also been similar discussions internally in the past. I
> > >> think one of the reasons was around IGT, and how keeping the
> > >> pread/pwrite interface meant slightly less pain, also it's not much
> > >> effort to implement for LMEM. If this is a NACK, then I guess the other
> > >> idea was to somehow fallback to mmap and update IGT accordingly.
> > >
> > > I just don't think we should have internal kernel interfaces for
> > > mapping ram in the kernel address space, seems pointless, makes less
> > > sense with a discrete GPU in the mix, so yes I think NAK for
> > > pread/pwrite at least at this time.
> >
> > Ok.
> >
> > >
> > > I'd also like to see a hard no relocs policy for DG1 enforced in the kernel.
> >
> > Ok, just checking, is that the case even if we don't require extra code
> > to support it? We recently dropped the CPU reloc path completely, in
> > favour of single GPU reloc path, and so no special code is required to
> > support LMEM, it should just work. IGT of course makes heavy use of
> > relocs, so that would need an overhaul.
> 
> The GPU reloc path is optimising a path that we simply shouldn't need
> or be using.
> 
> IGT tests relocs, ripping out relocs should reduce the amount of
> testing IGT has to do and reduce CI run times. Why carry the techincal
> debt deliberately.

We still have to optimize and keep the the relocations for the older
generations, where they are used. So can't really be eliminated from
codebase as much of the code is shared.

Agreed on the benefit in the more distant future coming from dropping
the relocations, once pre-Gen12 hardware is no more.

Note that IGT also uses relocations indirectly in non-relocation-specific
testtests, so there is quite some work according to our validation team.

Wrt this RFC, as no extra code is needed, it is faster to get stack
up and running with relocations. It also keeps the changes between
iGFX and dGFX minimal, which should help debugging. So that path was
taken to get the functional RFC out as fast as possible.

Moving away from relocations in both IGT and media driver is being
discussed and worked on. See below.

> I expect the kernel team to be a bit more authorative inside Intel on
> why uAPI gets exposed and why, it seems like everytime there is an
> attempt to limit the tech debt of carrying forward unnecessary uAPIs
> there is some push back for media driver or IGT. Fix stuff and be
> harder in pushing back on carrying unneeded interfaces forward so we
> future products are less mired in pointless debt. DG1 uAPI should
> really be a chance to full review the legacy of integrated graphics +
> pre-48-bit VM interfaces and they should all be turned off for DG1 so
> that future discrete GPUs can move forward cleaner. I really shouldn't
> be the one enforcing this, the i915 team needs to be a bit
> authoritative on what is necessary to support.

The patches were sent out as RFC to collect comments. Based on the
comments, we're expediting the work to eliminate the use of relocations.

Regards, Joonas

Joonas Lahtinen Aug. 7, 2020, 9:46 a.m. UTC | #7

Quoting Dave Airlie (2020-07-13 08:09:30)
> On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
> >
> > We need to add support for pwrite'ing an LMEM object.
> 
> why? DG1 is a discrete GPU, these interfaces we already gross and
> overly hacky for integrated, I'd prefer not to drag them across into
> discrete land.
> 
> same goes for pread.
> 
> You have no legacy userspace here, userspace needs change to support
> LMEM, it can be fixed to avoid legacy ioctls paths.

(This answer is really along the same lines as related to the
relocations, which I sent earlier in the end of this thread)

PREAD/PWRITE are used by IGT tests indirectly in the testing, as
a means to validate test end results as an example. So IGT
reworking is needed not to lose testing coverage when the
functionality is disabled.

Same reasoning as with relocations, as to why this is included
in the RFC; it will get a functional stack with least changes and
is virtually no extra effort to carry. It's recognized that once
pre-Gen12 hardware ceases to exist, there are optimization
opportunities.

Based on the feedback to this Request for Comments series, we are
expediting the work on those IGT reworks.

Regards, Joonas

Dave Airlie Aug. 9, 2020, 9:06 p.m. UTC | #8

On Fri, 7 Aug 2020 at 19:46, Joonas Lahtinen
<joonas.lahtinen@linux.intel.com> wrote:
>
> Quoting Dave Airlie (2020-07-13 08:09:30)
> > On Fri, 10 Jul 2020 at 22:00, Matthew Auld <matthew.auld@intel.com> wrote:
> > >
> > > We need to add support for pwrite'ing an LMEM object.
> >
> > why? DG1 is a discrete GPU, these interfaces we already gross and
> > overly hacky for integrated, I'd prefer not to drag them across into
> > discrete land.
> >
> > same goes for pread.
> >
> > You have no legacy userspace here, userspace needs change to support
> > LMEM, it can be fixed to avoid legacy ioctls paths.
>
> (This answer is really along the same lines as related to the
> relocations, which I sent earlier in the end of this thread)
>
> PREAD/PWRITE are used by IGT tests indirectly in the testing, as
> a means to validate test end results as an example. So IGT
> reworking is needed not to lose testing coverage when the
> functionality is disabled.
>
> Same reasoning as with relocations, as to why this is included
> in the RFC; it will get a functional stack with least changes and
> is virtually no extra effort to carry. It's recognized that once
> pre-Gen12 hardware ceases to exist, there are optimization
> opportunities.

Why do you have to wait until pre-GEN12 hw ceases to exist?

There are clear optimization opportunities since Broadwell, there have
been clear pointless optimisations done to the kernel because
userspace was stuck to the legacy paths way past when it made sense.

I'm pretty sure the media driver is broadwell+ in terms of hw support,
but it's all relocation heavy, like why isn't that the priority of the
kernel team to just go fix it rather than make the kernel long term
harder to maintain. Siloed development is an artifact of *your*
orgchart, and shouldn't leak into the upstream kernel development if
not required.

Would IGT ever have been fixed up? it's test code, it should have been
roadmap and resourced to fix up instead of writing kernel code for
platforms that don't require it. You can valdiate IGT on the older
hardware anyways.

If you need test interfaces in the kernel, please put them under
BROKEN or somewhere that only CI finds them, until you can rework CI
tests, but we need to be a bit more stringent on carrying forward
interfaces that aren't needed by modern HW, or else you will never get
to pre-GEN12 disappearing, like in reality it should in a lot of cases
be pre-GEN8.

Dave.

[RFC,33/60] drm/i915/lmem: support pwrite

Commit Message

Comments

Patch