Message ID | 20221222222127.34560-1-matthew.brost@intel.com (mailing list archive) |
---|---|
Headers | show |
Series | Initial Xe driver submission | expand |
Hi Am 22.12.22 um 23:21 schrieb Matthew Brost: > Hello, > > This is a submission for Xe, a new driver for Intel GPUs that supports both > integrated and discrete platforms starting with Tiger Lake (first platform with > Intel Xe Architecture). The intention of this new driver is to have a fresh base > to work from that is unencumbered by older platforms, whilst also taking the > opportunity to rearchitect our driver to increase sharing across the drm > subsystem, both leveraging and allowing us to contribute more towards other > shared components like TTM and drm/scheduler. The memory model is based on VM > bind which is similar to the i915 implementation. Likewise the execbuf > implementation for Xe is very similar to execbuf3 in the i915 [1]. After Xe has stabilized, will i915 loose the ability to drive this hardware (and possibly other)? I'm specfically thinking of the i915 code that requires TTM. Keeping that dependecy within Xe only might benefit DRM as a whole. > > The code is at a stage where it is already functional and has experimental > support for multiple platforms starting from Tiger Lake, with initial support > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > implementation will be released publicly early next year. We also have a suite > of IGTs for XE that will appear on the IGT list shortly. > > It has been built with the assumption of supporting multiple architectures from > the get-go, right now with tests running both on X86 and ARM hosts. And we > intend to continue working on it and improving on it as part of the kernel > community upstream. > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > ready Xe for production throughout 2023. > > As for display, the intent is to share the display code with the i915 driver so > that there is maximum reuse there. Currently this is being done by compiling the > display code twice, but alternatives to that are under consideration and we want > to have more discussion on what the best final solution will look like over the > next few months. Right now, work is ongoing in refactoring the display codebase > to remove as much as possible any unnecessary dependencies on i915 specific data > structures there.. Could both drivers reside in a common parent directory and share something like a DRM Intel helper module with the common code? This would fit well with the common design of DRM helpers. Best regards Thomas > > We currently have 2 submission backends, execlists and GuC. The execlist is > meant mostly for testing and is not fully functional while GuC backend is fully > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > required and should be placed in /lib/firmware/xe. > > The GuC firmware can be found in the below location: > https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 > > The easiest way to setup firmware is: > cp -r /lib/firmware/i915 /lib/firmware/xe > > The code has been organized such that we have all patches that touch areas > outside of drm/xe first for review, and then the actual new driver in a separate > commit. The code which is outside of drm/xe is included in this RFC while > drm/xe is not due to the size of the commit. The drm/xe is code is available in > a public repo listed below. > > Xe driver commit: > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 > > Xe kernel repo: > https://cgit.freedesktop.org/drm/drm-xe/ > > There's a lot of work still to happen on Xe but we're very excited about it and > wanted to share it early and welcome feedback and discussion. > > Cheers, > Matthew Brost > > [1] https://patchwork.freedesktop.org/series/105879/ > [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 > > Maarten Lankhorst (12): > drm/amd: Convert amdgpu to use suballocation helper. > drm/radeon: Use the drm suballocation manager implementation. > drm/i915: Remove gem and overlay frontbuffer tracking > drm/i915/display: Neuter frontbuffer tracking harder > drm/i915/display: Add more macros to remove all direct calls to uncore > drm/i915/display: Remove all uncore mmio accesses in favor of intel_de > drm/i915: Rename find_section to find_bdb_section > drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe > drm/i915/display: Fix a use-after-free when intel_edp_init_connector > fails > drm/i915/display: Remaining changes to make xe compile > sound/hda: Allow XE as i915 replacement for sound > mei/hdcp: Also enable for XE > > Matthew Brost (5): > drm/sched: Convert drm scheduler to use a work queue rather than > kthread > drm/sched: Add generic scheduler message interface > drm/sched: Start run wq before TDR in drm_sched_start > drm/sched: Submit job before starting TDR > drm/sched: Add helper to set TDR timeout > > Thomas Hellström (3): > drm/suballoc: Introduce a generic suballocation manager > drm: Add a gpu page-table walker helper > drm/ttm: Don't print error message if eviction was interrupted > > drivers/gpu/drm/Kconfig | 5 + > drivers/gpu/drm/Makefile | 4 + > drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + > drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- > drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ > drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ > drivers/gpu/drm/i915/Makefile | 2 +- > drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- > drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + > drivers/gpu/drm/i915/display/intel_atomic.c | 2 + > .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- > .../gpu/drm/i915/display/intel_backlight.c | 2 +- > drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- > drivers/gpu/drm/i915/display/intel_bw.c | 36 +- > drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- > drivers/gpu/drm/i915/display/intel_color.c | 1 + > drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- > drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- > drivers/gpu/drm/i915/display/intel_de.h | 38 +++ > drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- > drivers/gpu/drm/i915/display/intel_display.h | 9 +- > .../gpu/drm/i915/display/intel_display_core.h | 5 +- > .../drm/i915/display/intel_display_debugfs.c | 8 + > .../drm/i915/display/intel_display_power.c | 40 ++- > .../drm/i915/display/intel_display_power.h | 6 + > .../i915/display/intel_display_power_map.c | 7 + > .../i915/display/intel_display_power_well.c | 24 +- > .../drm/i915/display/intel_display_reg_defs.h | 4 + > .../drm/i915/display/intel_display_trace.h | 6 + > .../drm/i915/display/intel_display_types.h | 32 +- > drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- > drivers/gpu/drm/i915/display/intel_dp.c | 11 +- > drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + > drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- > drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + > drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- > drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + > drivers/gpu/drm/i915/display/intel_drrs.c | 1 + > drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- > drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- > drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- > drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - > drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- > drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- > .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- > .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- > drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- > drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- > drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - > .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + > .../drm/i915/display/intel_modeset_setup.c | 11 +- > drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- > drivers/gpu/drm/i915/display/intel_overlay.c | 14 - > .../gpu/drm/i915/display/intel_pch_display.h | 16 + > .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + > drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + > .../drm/i915/display/intel_plane_initial.c | 3 +- > drivers/gpu/drm/i915/display/intel_psr.c | 1 + > drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ > drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- > drivers/gpu/drm/i915/display/intel_vga.c | 5 + > drivers/gpu/drm/i915/display/skl_scaler.c | 2 + > .../drm/i915/display/skl_universal_plane.c | 52 ++- > drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- > drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - > drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - > .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - > drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- > drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- > drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - > drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- > drivers/gpu/drm/i915/i915_driver.c | 1 + > drivers/gpu/drm/i915/i915_gem.c | 8 - > drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - > drivers/gpu/drm/i915/i915_reg_defs.h | 8 + > drivers/gpu/drm/i915/i915_vma.c | 12 - > drivers/gpu/drm/radeon/radeon.h | 55 +-- > drivers/gpu/drm/radeon/radeon_ib.c | 12 +- > drivers/gpu/drm/radeon/radeon_object.h | 25 +- > drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- > drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- > drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- > drivers/gpu/drm/ttm/ttm_bo.c | 3 +- > drivers/misc/mei/hdcp/Kconfig | 2 +- > drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- > include/drm/drm_pt_walk.h | 161 +++++++++ > include/drm/drm_suballoc.h | 112 ++++++ > include/drm/gpu_scheduler.h | 41 ++- > sound/hda/hdac_i915.c | 17 +- > sound/pci/hda/hda_intel.c | 56 +-- > sound/soc/intel/avs/core.c | 13 +- > sound/soc/sof/intel/hda.c | 7 +- > 98 files changed, 2076 insertions(+), 1325 deletions(-) > create mode 100644 drivers/gpu/drm/drm_pt_walk.c > create mode 100644 drivers/gpu/drm/drm_suballoc.c > create mode 100644 include/drm/drm_pt_walk.h > create mode 100644 include/drm/drm_suballoc.h >
On Mon, 02 Jan 2023, Thomas Zimmermann <tzimmermann@suse.de> wrote: > Hi > > Am 22.12.22 um 23:21 schrieb Matthew Brost: >> Hello, >> >> This is a submission for Xe, a new driver for Intel GPUs that supports both >> integrated and discrete platforms starting with Tiger Lake (first platform with >> Intel Xe Architecture). The intention of this new driver is to have a fresh base >> to work from that is unencumbered by older platforms, whilst also taking the >> opportunity to rearchitect our driver to increase sharing across the drm >> subsystem, both leveraging and allowing us to contribute more towards other >> shared components like TTM and drm/scheduler. The memory model is based on VM >> bind which is similar to the i915 implementation. Likewise the execbuf >> implementation for Xe is very similar to execbuf3 in the i915 [1]. > > After Xe has stabilized, will i915 loose the ability to drive this > hardware (and possibly other)? I'm specfically thinking of the i915 > code that requires TTM. Keeping that dependecy within Xe only might > benefit DRM as a whole. There's going to be a number of platforms supported by both drivers, and from purely a i915 standpoint dropping any currently supported platforms or that dependency from i915 would be a regression. >> >> The code is at a stage where it is already functional and has experimental >> support for multiple platforms starting from Tiger Lake, with initial support >> implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well >> as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO >> implementation will be released publicly early next year. We also have a suite >> of IGTs for XE that will appear on the IGT list shortly. >> >> It has been built with the assumption of supporting multiple architectures from >> the get-go, right now with tests running both on X86 and ARM hosts. And we >> intend to continue working on it and improving on it as part of the kernel >> community upstream. >> >> The new Xe driver leverages a lot from i915 and work on i915 continues as we >> ready Xe for production throughout 2023. >> >> As for display, the intent is to share the display code with the i915 driver so >> that there is maximum reuse there. Currently this is being done by compiling the >> display code twice, but alternatives to that are under consideration and we want >> to have more discussion on what the best final solution will look like over the >> next few months. Right now, work is ongoing in refactoring the display codebase >> to remove as much as possible any unnecessary dependencies on i915 specific data >> structures there.. > > Could both drivers reside in a common parent directory and share > something like a DRM Intel helper module with the common code? This > would fit well with the common design of DRM helpers. I think it's too early to tell. For one thing, setting that up would be a lot of up front infrastructure work. I'm not sure how to even pull that off when Xe is still out-of-tree and i915 development plunges on upstream as ever. For another, realistically, the overlap between supported platforms is going to end at some point, and eventually new platforms are only going to be supported with Xe. That's going to open up new possibilities for refactoring also the display code. I think it would be premature to lock in to a common directory structure or a common helper module at this point. I'm not saying no to the idea, and we've contemplated it before, but I think there are still too many moving parts to decide to go that way. BR, Jani. > > Best regards > Thomas > >> >> We currently have 2 submission backends, execlists and GuC. The execlist is >> meant mostly for testing and is not fully functional while GuC backend is fully >> functional. As with the i915 and GuC submission, in Xe the GuC firmware is >> required and should be placed in /lib/firmware/xe. >> >> The GuC firmware can be found in the below location: >> https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 >> >> The easiest way to setup firmware is: >> cp -r /lib/firmware/i915 /lib/firmware/xe >> >> The code has been organized such that we have all patches that touch areas >> outside of drm/xe first for review, and then the actual new driver in a separate >> commit. The code which is outside of drm/xe is included in this RFC while >> drm/xe is not due to the size of the commit. The drm/xe is code is available in >> a public repo listed below. >> >> Xe driver commit: >> https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 >> >> Xe kernel repo: >> https://cgit.freedesktop.org/drm/drm-xe/ >> >> There's a lot of work still to happen on Xe but we're very excited about it and >> wanted to share it early and welcome feedback and discussion. >> >> Cheers, >> Matthew Brost >> >> [1] https://patchwork.freedesktop.org/series/105879/ >> [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 >> >> Maarten Lankhorst (12): >> drm/amd: Convert amdgpu to use suballocation helper. >> drm/radeon: Use the drm suballocation manager implementation. >> drm/i915: Remove gem and overlay frontbuffer tracking >> drm/i915/display: Neuter frontbuffer tracking harder >> drm/i915/display: Add more macros to remove all direct calls to uncore >> drm/i915/display: Remove all uncore mmio accesses in favor of intel_de >> drm/i915: Rename find_section to find_bdb_section >> drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe >> drm/i915/display: Fix a use-after-free when intel_edp_init_connector >> fails >> drm/i915/display: Remaining changes to make xe compile >> sound/hda: Allow XE as i915 replacement for sound >> mei/hdcp: Also enable for XE >> >> Matthew Brost (5): >> drm/sched: Convert drm scheduler to use a work queue rather than >> kthread >> drm/sched: Add generic scheduler message interface >> drm/sched: Start run wq before TDR in drm_sched_start >> drm/sched: Submit job before starting TDR >> drm/sched: Add helper to set TDR timeout >> >> Thomas Hellström (3): >> drm/suballoc: Introduce a generic suballocation manager >> drm: Add a gpu page-table walker helper >> drm/ttm: Don't print error message if eviction was interrupted >> >> drivers/gpu/drm/Kconfig | 5 + >> drivers/gpu/drm/Makefile | 4 + >> drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + >> drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- >> drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- >> drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- >> drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- >> drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- >> drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- >> drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- >> drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ >> drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ >> drivers/gpu/drm/i915/Makefile | 2 +- >> drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- >> drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + >> drivers/gpu/drm/i915/display/intel_atomic.c | 2 + >> .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- >> .../gpu/drm/i915/display/intel_backlight.c | 2 +- >> drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- >> drivers/gpu/drm/i915/display/intel_bw.c | 36 +- >> drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- >> drivers/gpu/drm/i915/display/intel_color.c | 1 + >> drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- >> drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- >> drivers/gpu/drm/i915/display/intel_de.h | 38 +++ >> drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- >> drivers/gpu/drm/i915/display/intel_display.h | 9 +- >> .../gpu/drm/i915/display/intel_display_core.h | 5 +- >> .../drm/i915/display/intel_display_debugfs.c | 8 + >> .../drm/i915/display/intel_display_power.c | 40 ++- >> .../drm/i915/display/intel_display_power.h | 6 + >> .../i915/display/intel_display_power_map.c | 7 + >> .../i915/display/intel_display_power_well.c | 24 +- >> .../drm/i915/display/intel_display_reg_defs.h | 4 + >> .../drm/i915/display/intel_display_trace.h | 6 + >> .../drm/i915/display/intel_display_types.h | 32 +- >> drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- >> drivers/gpu/drm/i915/display/intel_dp.c | 11 +- >> drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + >> drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- >> drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + >> drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- >> drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + >> drivers/gpu/drm/i915/display/intel_drrs.c | 1 + >> drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- >> drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- >> drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- >> drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - >> drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- >> drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- >> .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- >> .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- >> drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- >> drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- >> drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - >> .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + >> .../drm/i915/display/intel_modeset_setup.c | 11 +- >> drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- >> drivers/gpu/drm/i915/display/intel_overlay.c | 14 - >> .../gpu/drm/i915/display/intel_pch_display.h | 16 + >> .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + >> drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + >> .../drm/i915/display/intel_plane_initial.c | 3 +- >> drivers/gpu/drm/i915/display/intel_psr.c | 1 + >> drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ >> drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- >> drivers/gpu/drm/i915/display/intel_vga.c | 5 + >> drivers/gpu/drm/i915/display/skl_scaler.c | 2 + >> .../drm/i915/display/skl_universal_plane.c | 52 ++- >> drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- >> drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - >> drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - >> .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - >> drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- >> drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- >> drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - >> drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- >> drivers/gpu/drm/i915/i915_driver.c | 1 + >> drivers/gpu/drm/i915/i915_gem.c | 8 - >> drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - >> drivers/gpu/drm/i915/i915_reg_defs.h | 8 + >> drivers/gpu/drm/i915/i915_vma.c | 12 - >> drivers/gpu/drm/radeon/radeon.h | 55 +-- >> drivers/gpu/drm/radeon/radeon_ib.c | 12 +- >> drivers/gpu/drm/radeon/radeon_object.h | 25 +- >> drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- >> drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- >> drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- >> drivers/gpu/drm/ttm/ttm_bo.c | 3 +- >> drivers/misc/mei/hdcp/Kconfig | 2 +- >> drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- >> include/drm/drm_pt_walk.h | 161 +++++++++ >> include/drm/drm_suballoc.h | 112 ++++++ >> include/drm/gpu_scheduler.h | 41 ++- >> sound/hda/hdac_i915.c | 17 +- >> sound/pci/hda/hda_intel.c | 56 +-- >> sound/soc/intel/avs/core.c | 13 +- >> sound/soc/sof/intel/hda.c | 7 +- >> 98 files changed, 2076 insertions(+), 1325 deletions(-) >> create mode 100644 drivers/gpu/drm/drm_pt_walk.c >> create mode 100644 drivers/gpu/drm/drm_suballoc.c >> create mode 100644 include/drm/drm_pt_walk.h >> create mode 100644 include/drm/drm_suballoc.h >>
On 22/12/2022 22:21, Matthew Brost wrote: > Hello, > > This is a submission for Xe, a new driver for Intel GPUs that supports both > integrated and discrete platforms starting with Tiger Lake (first platform with > Intel Xe Architecture). The intention of this new driver is to have a fresh base > to work from that is unencumbered by older platforms, whilst also taking the > opportunity to rearchitect our driver to increase sharing across the drm > subsystem, both leveraging and allowing us to contribute more towards other > shared components like TTM and drm/scheduler. The memory model is based on VM > bind which is similar to the i915 implementation. Likewise the execbuf > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > The code is at a stage where it is already functional and has experimental > support for multiple platforms starting from Tiger Lake, with initial support > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > implementation will be released publicly early next year. We also have a suite > of IGTs for XE that will appear on the IGT list shortly. > > It has been built with the assumption of supporting multiple architectures from > the get-go, right now with tests running both on X86 and ARM hosts. And we > intend to continue working on it and improving on it as part of the kernel > community upstream. > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > ready Xe for production throughout 2023. > > As for display, the intent is to share the display code with the i915 driver so > that there is maximum reuse there. Currently this is being done by compiling the > display code twice, but alternatives to that are under consideration and we want > to have more discussion on what the best final solution will look like over the > next few months. Right now, work is ongoing in refactoring the display codebase > to remove as much as possible any unnecessary dependencies on i915 specific data > structures there.. > > We currently have 2 submission backends, execlists and GuC. The execlist is > meant mostly for testing and is not fully functional while GuC backend is fully > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > required and should be placed in /lib/firmware/xe. What is the plan going forward for the execlists backend? I think it would be preferable to not upstream something semi-functional and so to carry technical debt in the brand new code base, from the very start. If it is for Tigerlake, which is the starting platform for Xe, could it be made GuC only Tigerlake for instance? Regards, Tvrtko
Hi, On Mon, 02 Jan 2023 13:42:46 +0200 Jani Nikula <jani.nikula@linux.intel.com> wrote: > On Mon, 02 Jan 2023, Thomas Zimmermann <tzimmermann@suse.de> wrote: > > Hi > > > > Am 22.12.22 um 23:21 schrieb Matthew Brost: > >> Hello, > >> > >> This is a submission for Xe, a new driver for Intel GPUs that supports both > >> integrated and discrete platforms starting with Tiger Lake (first platform with > >> Intel Xe Architecture). The intention of this new driver is to have a fresh base > >> to work from that is unencumbered by older platforms, whilst also taking the > >> opportunity to rearchitect our driver to increase sharing across the drm > >> subsystem, both leveraging and allowing us to contribute more towards other > >> shared components like TTM and drm/scheduler. The memory model is based on VM > >> bind which is similar to the i915 implementation. Likewise the execbuf > >> implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > After Xe has stabilized, will i915 loose the ability to drive this > > hardware (and possibly other)? I'm specfically thinking of the i915 > > code that requires TTM. Keeping that dependecy within Xe only might > > benefit DRM as a whole. > > There's going to be a number of platforms supported by both drivers, and > from purely a i915 standpoint dropping any currently supported platforms > or that dependency from i915 would be a regression. > > >> > >> The code is at a stage where it is already functional and has experimental > >> support for multiple platforms starting from Tiger Lake, with initial support > >> implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > >> as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > >> implementation will be released publicly early next year. We also have a suite > >> of IGTs for XE that will appear on the IGT list shortly. > >> > >> It has been built with the assumption of supporting multiple architectures from > >> the get-go, right now with tests running both on X86 and ARM hosts. And we > >> intend to continue working on it and improving on it as part of the kernel > >> community upstream. > >> > >> The new Xe driver leverages a lot from i915 and work on i915 continues as we > >> ready Xe for production throughout 2023. > >> > >> As for display, the intent is to share the display code with the i915 driver so > >> that there is maximum reuse there. Currently this is being done by compiling the > >> display code twice, but alternatives to that are under consideration and we want > >> to have more discussion on what the best final solution will look like over the > >> next few months. Right now, work is ongoing in refactoring the display codebase > >> to remove as much as possible any unnecessary dependencies on i915 specific data > >> structures there.. > > > > Could both drivers reside in a common parent directory and share > > something like a DRM Intel helper module with the common code? This > > would fit well with the common design of DRM helpers. > > I think it's too early to tell. > > For one thing, setting that up would be a lot of up front infrastructure > work. I'm not sure how to even pull that off when Xe is still > out-of-tree and i915 development plunges on upstream as ever. > > For another, realistically, the overlap between supported platforms is > going to end at some point, and eventually new platforms are only going > to be supported with Xe. That's going to open up new possibilities for > refactoring also the display code. I think it would be premature to lock > in to a common directory structure or a common helper module at this > point. > > I'm not saying no to the idea, and we've contemplated it before, but I > think there are still too many moving parts to decide to go that way. FWIW, I actually have the same dilemma with the driver for new Mali GPUs I'm working on. I initially started making it a sub-driver of the existing panfrost driver (some HW blocks are similar, like the IOMMU and a few other things, and some SW abstracts can be shared here and there, like the GEM allocator logic). But I'm now considering forking the driver (after Alyssa planted the seed :-)), not only because I want to start from a clean sheet on the the uAPI front (wouldn't be an issue in your case, because you're talking about sharing helpers, not the driver frontend), but also because any refactor to panfrost is a potential source of regression for existing users. So, I tend to agree with Jani here, trying to share code before things have settled down is likely to cause pain to both Xe and i915 users+developers. Best Regards, Boris
> > For one thing, setting that up would be a lot of up front infrastructure > > work. I'm not sure how to even pull that off when Xe is still > > out-of-tree and i915 development plunges on upstream as ever. > > > > For another, realistically, the overlap between supported platforms is > > going to end at some point, and eventually new platforms are only going > > to be supported with Xe. That's going to open up new possibilities for > > refactoring also the display code. I think it would be premature to lock > > in to a common directory structure or a common helper module at this > > point. > > > > I'm not saying no to the idea, and we've contemplated it before, but I > > think there are still too many moving parts to decide to go that way. > > FWIW, I actually have the same dilemma with the driver for new Mali GPUs > I'm working on. I initially started making it a sub-driver of the > existing panfrost driver (some HW blocks are similar, like the > IOMMU and a few other things, and some SW abstracts can be shared here > and there, like the GEM allocator logic). But I'm now considering > forking the driver (after Alyssa planted the seed :-)), not only > because I want to start from a clean sheet on the the uAPI front > (wouldn't be an issue in your case, because you're talking about > sharing helpers, not the driver frontend), but also because any refactor > to panfrost is a potential source of regression for existing users. So, > I tend to agree with Jani here, trying to share code before things have > settled down is likely to cause pain to both Xe and i915 > users+developers. ++ I pretend to have never written a kernel driver, so will not comment there. But Boris and I were previously bit trying to share code between our GL and VK drivers, before VK settled down, causing pain for both. I don't want a kernelside repeat of that (for either Mali or Intel). I tend to think that, if you're tempted to share a driver frontend without the backend, that's a sign that there's too much boilerplate for the frontend and maybe there needs to be more helpers somewhere. For Xe, that doesn't apply since the hw overlaps between the drivers, but for Mali, there really is more different than similar and there's an obvious, acute break between "old Mali" and "new Mali". The shared "instantiate a DRM driver boilerplate" is pretty trivial, and the MMU code is as simple as it gets...
On Tue, Jan 03, 2023 at 12:21:08PM +0000, Tvrtko Ursulin wrote: > > On 22/12/2022 22:21, Matthew Brost wrote: > > Hello, > > > > This is a submission for Xe, a new driver for Intel GPUs that supports both > > integrated and discrete platforms starting with Tiger Lake (first platform with > > Intel Xe Architecture). The intention of this new driver is to have a fresh base > > to work from that is unencumbered by older platforms, whilst also taking the > > opportunity to rearchitect our driver to increase sharing across the drm > > subsystem, both leveraging and allowing us to contribute more towards other > > shared components like TTM and drm/scheduler. The memory model is based on VM > > bind which is similar to the i915 implementation. Likewise the execbuf > > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > The code is at a stage where it is already functional and has experimental > > support for multiple platforms starting from Tiger Lake, with initial support > > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > > implementation will be released publicly early next year. We also have a suite > > of IGTs for XE that will appear on the IGT list shortly. > > > > It has been built with the assumption of supporting multiple architectures from > > the get-go, right now with tests running both on X86 and ARM hosts. And we > > intend to continue working on it and improving on it as part of the kernel > > community upstream. > > > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > > ready Xe for production throughout 2023. > > > > As for display, the intent is to share the display code with the i915 driver so > > that there is maximum reuse there. Currently this is being done by compiling the > > display code twice, but alternatives to that are under consideration and we want > > to have more discussion on what the best final solution will look like over the > > next few months. Right now, work is ongoing in refactoring the display codebase > > to remove as much as possible any unnecessary dependencies on i915 specific data > > structures there.. > > > > We currently have 2 submission backends, execlists and GuC. The execlist is > > meant mostly for testing and is not fully functional while GuC backend is fully > > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > > required and should be placed in /lib/firmware/xe. > > What is the plan going forward for the execlists backend? I think it would > be preferable to not upstream something semi-functional and so to carry > technical debt in the brand new code base, from the very start. If it is for > Tigerlake, which is the starting platform for Xe, could it be made GuC only > Tigerlake for instance? > A little background here. In the original PoC written by Jason and Dave, the execlist backend was the only one present and it was in semi-working state. As soon as myself and a few others started working on Xe we went full in a on the GuC backend. We left the execlist backend basically in the state it was in. We left it in place for 2 reasons. 1. Having 2 backends from the start ensured we layered our code correctly. The layer was a complete disaster in the i915 so we really wanted to avoid that. 2. The thought was it might be needed for early product bring up one day. As I think about this a bit more, we likely just delete execlist backend before merging this upstream and perhaps just carry 1 large patch internally with this implementation that we can use as needed. Final decession TDB though. Matt > Regards, > > Tvrtko
+Frank, who's also working on the pvr uAPI. Hi, On Thu, 22 Dec 2022 14:21:07 -0800 Matthew Brost <matthew.brost@intel.com> wrote: > The code has been organized such that we have all patches that touch areas > outside of drm/xe first for review, and then the actual new driver in a separate > commit. The code which is outside of drm/xe is included in this RFC while > drm/xe is not due to the size of the commit. The drm/xe is code is available in > a public repo listed below. > > Xe driver commit: > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 > > Xe kernel repo: > https://cgit.freedesktop.org/drm/drm-xe/ Sorry to hijack this thread, again, but I'm currently working on the pancsf uAPI, and I was wondering how DRM maintainers/developers felt about the new direction taken by the Xe driver on some aspects of their uAPI (to decide if I should copy these patterns or go the old way): - plan for ioctl extensions through '__u64 extensions;' fields (the vulkan way, basically) - turning the GETPARAM in DEV_QUERY which can return more than a 64-bit integer at a time - having ioctls taking sub-operations instead of one ioctl per operation (I'm referring to VM_BIND here, which handles map, unmap, restart, ... through a single entry point) Regards, Boris
On Thu, Jan 05, 2023 at 09:27:57PM +0000, Matthew Brost wrote: >On Tue, Jan 03, 2023 at 12:21:08PM +0000, Tvrtko Ursulin wrote: >> >> On 22/12/2022 22:21, Matthew Brost wrote: >> > Hello, >> > >> > This is a submission for Xe, a new driver for Intel GPUs that supports both >> > integrated and discrete platforms starting with Tiger Lake (first platform with >> > Intel Xe Architecture). The intention of this new driver is to have a fresh base >> > to work from that is unencumbered by older platforms, whilst also taking the >> > opportunity to rearchitect our driver to increase sharing across the drm >> > subsystem, both leveraging and allowing us to contribute more towards other >> > shared components like TTM and drm/scheduler. The memory model is based on VM >> > bind which is similar to the i915 implementation. Likewise the execbuf >> > implementation for Xe is very similar to execbuf3 in the i915 [1]. >> > >> > The code is at a stage where it is already functional and has experimental >> > support for multiple platforms starting from Tiger Lake, with initial support >> > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well >> > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO >> > implementation will be released publicly early next year. We also have a suite >> > of IGTs for XE that will appear on the IGT list shortly. >> > >> > It has been built with the assumption of supporting multiple architectures from >> > the get-go, right now with tests running both on X86 and ARM hosts. And we >> > intend to continue working on it and improving on it as part of the kernel >> > community upstream. >> > >> > The new Xe driver leverages a lot from i915 and work on i915 continues as we >> > ready Xe for production throughout 2023. >> > >> > As for display, the intent is to share the display code with the i915 driver so >> > that there is maximum reuse there. Currently this is being done by compiling the >> > display code twice, but alternatives to that are under consideration and we want >> > to have more discussion on what the best final solution will look like over the >> > next few months. Right now, work is ongoing in refactoring the display codebase >> > to remove as much as possible any unnecessary dependencies on i915 specific data >> > structures there.. >> > >> > We currently have 2 submission backends, execlists and GuC. The execlist is >> > meant mostly for testing and is not fully functional while GuC backend is fully >> > functional. As with the i915 and GuC submission, in Xe the GuC firmware is >> > required and should be placed in /lib/firmware/xe. >> >> What is the plan going forward for the execlists backend? I think it would >> be preferable to not upstream something semi-functional and so to carry >> technical debt in the brand new code base, from the very start. If it is for >> Tigerlake, which is the starting platform for Xe, could it be made GuC only >> Tigerlake for instance? >> > >A little background here. In the original PoC written by Jason and Dave, >the execlist backend was the only one present and it was in semi-working >state. As soon as myself and a few others started working on Xe we went >full in a on the GuC backend. We left the execlist backend basically in >the state it was in. We left it in place for 2 reasons. > >1. Having 2 backends from the start ensured we layered our code >correctly. The layer was a complete disaster in the i915 so we really >wanted to avoid that. >2. The thought was it might be needed for early product bring up one >day. > >As I think about this a bit more, we likely just delete execlist backend >before merging this upstream and perhaps just carry 1 large patch >internally with this implementation that we can use as needed. Final >decession TDB though. but that might regress after some time on "let's keep 2 backends so we layer the code correctly". Leaving the additional backend behind CONFIG_BROKEN or XE_EXPERIMENTAL, or something like that, not enabled by distros, but enabled in CI would be a good idea IMO. Carrying a large patch out of tree would make things harder for new platforms. A perfect backend split would make it possible, but like I said, we are likely not to have it if we delete the second backend. Lucas De Marchi > >Matt > >> Regards, >> >> Tvrtko
On Thu, Jan 12, 2023 at 10:54:25AM +0100, Lucas De Marchi wrote: > On Thu, Jan 05, 2023 at 09:27:57PM +0000, Matthew Brost wrote: > > On Tue, Jan 03, 2023 at 12:21:08PM +0000, Tvrtko Ursulin wrote: > > > > > > On 22/12/2022 22:21, Matthew Brost wrote: > > > > Hello, > > > > > > > > This is a submission for Xe, a new driver for Intel GPUs that supports both > > > > integrated and discrete platforms starting with Tiger Lake (first platform with > > > > Intel Xe Architecture). The intention of this new driver is to have a fresh base > > > > to work from that is unencumbered by older platforms, whilst also taking the > > > > opportunity to rearchitect our driver to increase sharing across the drm > > > > subsystem, both leveraging and allowing us to contribute more towards other > > > > shared components like TTM and drm/scheduler. The memory model is based on VM > > > > bind which is similar to the i915 implementation. Likewise the execbuf > > > > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > > > > > The code is at a stage where it is already functional and has experimental > > > > support for multiple platforms starting from Tiger Lake, with initial support > > > > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > > > > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > > > > implementation will be released publicly early next year. We also have a suite > > > > of IGTs for XE that will appear on the IGT list shortly. > > > > > > > > It has been built with the assumption of supporting multiple architectures from > > > > the get-go, right now with tests running both on X86 and ARM hosts. And we > > > > intend to continue working on it and improving on it as part of the kernel > > > > community upstream. > > > > > > > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > > > > ready Xe for production throughout 2023. > > > > > > > > As for display, the intent is to share the display code with the i915 driver so > > > > that there is maximum reuse there. Currently this is being done by compiling the > > > > display code twice, but alternatives to that are under consideration and we want > > > > to have more discussion on what the best final solution will look like over the > > > > next few months. Right now, work is ongoing in refactoring the display codebase > > > > to remove as much as possible any unnecessary dependencies on i915 specific data > > > > structures there.. > > > > > > > > We currently have 2 submission backends, execlists and GuC. The execlist is > > > > meant mostly for testing and is not fully functional while GuC backend is fully > > > > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > > > > required and should be placed in /lib/firmware/xe. > > > > > > What is the plan going forward for the execlists backend? I think it would > > > be preferable to not upstream something semi-functional and so to carry > > > technical debt in the brand new code base, from the very start. If it is for > > > Tigerlake, which is the starting platform for Xe, could it be made GuC only > > > Tigerlake for instance? > > > > > > > A little background here. In the original PoC written by Jason and Dave, > > the execlist backend was the only one present and it was in semi-working > > state. As soon as myself and a few others started working on Xe we went > > full in a on the GuC backend. We left the execlist backend basically in > > the state it was in. We left it in place for 2 reasons. > > > > 1. Having 2 backends from the start ensured we layered our code > > correctly. The layer was a complete disaster in the i915 so we really > > wanted to avoid that. > > 2. The thought was it might be needed for early product bring up one > > day. > > > > As I think about this a bit more, we likely just delete execlist backend > > before merging this upstream and perhaps just carry 1 large patch > > internally with this implementation that we can use as needed. Final > > decession TDB though. > > but that might regress after some time on "let's keep 2 backends so we > layer the code correctly". Leaving the additional backend behind > CONFIG_BROKEN or XE_EXPERIMENTAL, or something like that, not > enabled by distros, but enabled in CI would be a good idea IMO. > > Carrying a large patch out of tree would make things harder for new > platforms. A perfect backend split would make it possible, but like I > said, we are likely not to have it if we delete the second backend. > Good points here Lucas. One thing that we absolutely have wrong is falling back to execlists if GuC firmware is missing. We def should not be doing that as it creates confusion. I kinda like the idea hiding it behind a config option + module parameter to use the backend so you really, really need to try to be able to use it + with this in the code it make us disciplined in our layering. At some point we will likely another supported backend and at that point we may decide to delete this backend. Matt > Lucas De Marchi > > > > > Matt > > > > > Regards, > > > > > > Tvrtko
On Thu, Dec 22, 2022 at 4:29 PM Matthew Brost <matthew.brost@intel.com> wrote: > Hello, > > This is a submission for Xe, a new driver for Intel GPUs that supports both > integrated and discrete platforms starting with Tiger Lake (first platform > with > Intel Xe Architecture). The intention of this new driver is to have a > fresh base > to work from that is unencumbered by older platforms, whilst also taking > the > opportunity to rearchitect our driver to increase sharing across the drm > subsystem, both leveraging and allowing us to contribute more towards other > shared components like TTM and drm/scheduler. The memory model is based on > VM > bind which is similar to the i915 implementation. Likewise the execbuf > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > The code is at a stage where it is already functional and has experimental > support for multiple platforms starting from Tiger Lake, with initial > support > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as > well > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > implementation will be released publicly early next year. We also have a > suite > of IGTs for XE that will appear on the IGT list shortly. > > It has been built with the assumption of supporting multiple architectures > from > the get-go, right now with tests running both on X86 and ARM hosts. And we > intend to continue working on it and improving on it as part of the kernel > community upstream. > > The new Xe driver leverages a lot from i915 and work on i915 continues as > we > ready Xe for production throughout 2023. > > As for display, the intent is to share the display code with the i915 > driver so > that there is maximum reuse there. Currently this is being done by > compiling the > display code twice, but alternatives to that are under consideration and > we want > to have more discussion on what the best final solution will look like > over the > next few months. Right now, work is ongoing in refactoring the display > codebase > to remove as much as possible any unnecessary dependencies on i915 > specific data > structures there.. > > We currently have 2 submission backends, execlists and GuC. The execlist is > meant mostly for testing and is not fully functional while GuC backend is > fully > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > required and should be placed in /lib/firmware/xe. > > The GuC firmware can be found in the below location: > > https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 > > The easiest way to setup firmware is: > cp -r /lib/firmware/i915 /lib/firmware/xe > > The code has been organized such that we have all patches that touch areas > outside of drm/xe first for review, and then the actual new driver in a > separate > commit. The code which is outside of drm/xe is included in this RFC while > drm/xe is not due to the size of the commit. The drm/xe is code is > available in > a public repo listed below. > > Xe driver commit: > > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 Drive-by comment here because I don't see any actual xe patches on the list: You probably want to drop DRM_XE_SYNC_DMA_BUF from the uAPI. Now that we've landed the new dma-buf ioctls for sync_file import/export, there's really no reason to have it as part of submit. Dropping it should also make locking a tiny bit easier. --Jason > Xe kernel repo: > https://cgit.freedesktop.org/drm/drm-xe/ > > There's a lot of work still to happen on Xe but we're very excited about > it and > wanted to share it early and welcome feedback and discussion. > > Cheers, > Matthew Brost > > [1] https://patchwork.freedesktop.org/series/105879/ > [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 > > Maarten Lankhorst (12): > drm/amd: Convert amdgpu to use suballocation helper. > drm/radeon: Use the drm suballocation manager implementation. > drm/i915: Remove gem and overlay frontbuffer tracking > drm/i915/display: Neuter frontbuffer tracking harder > drm/i915/display: Add more macros to remove all direct calls to uncore > drm/i915/display: Remove all uncore mmio accesses in favor of intel_de > drm/i915: Rename find_section to find_bdb_section > drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe > drm/i915/display: Fix a use-after-free when intel_edp_init_connector > fails > drm/i915/display: Remaining changes to make xe compile > sound/hda: Allow XE as i915 replacement for sound > mei/hdcp: Also enable for XE > > Matthew Brost (5): > drm/sched: Convert drm scheduler to use a work queue rather than > kthread > drm/sched: Add generic scheduler message interface > drm/sched: Start run wq before TDR in drm_sched_start > drm/sched: Submit job before starting TDR > drm/sched: Add helper to set TDR timeout > > Thomas Hellström (3): > drm/suballoc: Introduce a generic suballocation manager > drm: Add a gpu page-table walker helper > drm/ttm: Don't print error message if eviction was interrupted > > drivers/gpu/drm/Kconfig | 5 + > drivers/gpu/drm/Makefile | 4 + > drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + > drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- > drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ > drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ > drivers/gpu/drm/i915/Makefile | 2 +- > drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- > drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + > drivers/gpu/drm/i915/display/intel_atomic.c | 2 + > .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- > .../gpu/drm/i915/display/intel_backlight.c | 2 +- > drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- > drivers/gpu/drm/i915/display/intel_bw.c | 36 +- > drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- > drivers/gpu/drm/i915/display/intel_color.c | 1 + > drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- > drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- > drivers/gpu/drm/i915/display/intel_de.h | 38 +++ > drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- > drivers/gpu/drm/i915/display/intel_display.h | 9 +- > .../gpu/drm/i915/display/intel_display_core.h | 5 +- > .../drm/i915/display/intel_display_debugfs.c | 8 + > .../drm/i915/display/intel_display_power.c | 40 ++- > .../drm/i915/display/intel_display_power.h | 6 + > .../i915/display/intel_display_power_map.c | 7 + > .../i915/display/intel_display_power_well.c | 24 +- > .../drm/i915/display/intel_display_reg_defs.h | 4 + > .../drm/i915/display/intel_display_trace.h | 6 + > .../drm/i915/display/intel_display_types.h | 32 +- > drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- > drivers/gpu/drm/i915/display/intel_dp.c | 11 +- > drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + > drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- > drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + > drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- > drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + > drivers/gpu/drm/i915/display/intel_drrs.c | 1 + > drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- > drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- > drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- > drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - > drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- > drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- > .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- > .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- > drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- > drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- > drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - > .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + > .../drm/i915/display/intel_modeset_setup.c | 11 +- > drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- > drivers/gpu/drm/i915/display/intel_overlay.c | 14 - > .../gpu/drm/i915/display/intel_pch_display.h | 16 + > .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + > drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + > .../drm/i915/display/intel_plane_initial.c | 3 +- > drivers/gpu/drm/i915/display/intel_psr.c | 1 + > drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ > drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- > drivers/gpu/drm/i915/display/intel_vga.c | 5 + > drivers/gpu/drm/i915/display/skl_scaler.c | 2 + > .../drm/i915/display/skl_universal_plane.c | 52 ++- > drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- > drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - > drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - > .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - > drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- > drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- > drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - > drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- > drivers/gpu/drm/i915/i915_driver.c | 1 + > drivers/gpu/drm/i915/i915_gem.c | 8 - > drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - > drivers/gpu/drm/i915/i915_reg_defs.h | 8 + > drivers/gpu/drm/i915/i915_vma.c | 12 - > drivers/gpu/drm/radeon/radeon.h | 55 +-- > drivers/gpu/drm/radeon/radeon_ib.c | 12 +- > drivers/gpu/drm/radeon/radeon_object.h | 25 +- > drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- > drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- > drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- > drivers/gpu/drm/ttm/ttm_bo.c | 3 +- > drivers/misc/mei/hdcp/Kconfig | 2 +- > drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- > include/drm/drm_pt_walk.h | 161 +++++++++ > include/drm/drm_suballoc.h | 112 ++++++ > include/drm/gpu_scheduler.h | 41 ++- > sound/hda/hdac_i915.c | 17 +- > sound/pci/hda/hda_intel.c | 56 +-- > sound/soc/intel/avs/core.c | 13 +- > sound/soc/sof/intel/hda.c | 7 +- > 98 files changed, 2076 insertions(+), 1325 deletions(-) > create mode 100644 drivers/gpu/drm/drm_pt_walk.c > create mode 100644 drivers/gpu/drm/drm_suballoc.c > create mode 100644 include/drm/drm_pt_walk.h > create mode 100644 include/drm/drm_suballoc.h > > -- > 2.37.3 > >
On Thu, Jan 12, 2023 at 11:17 AM Matthew Brost <matthew.brost@intel.com> wrote: > On Thu, Jan 12, 2023 at 10:54:25AM +0100, Lucas De Marchi wrote: > > On Thu, Jan 05, 2023 at 09:27:57PM +0000, Matthew Brost wrote: > > > On Tue, Jan 03, 2023 at 12:21:08PM +0000, Tvrtko Ursulin wrote: > > > > > > > > On 22/12/2022 22:21, Matthew Brost wrote: > > > > > Hello, > > > > > > > > > > This is a submission for Xe, a new driver for Intel GPUs that > supports both > > > > > integrated and discrete platforms starting with Tiger Lake (first > platform with > > > > > Intel Xe Architecture). The intention of this new driver is to > have a fresh base > > > > > to work from that is unencumbered by older platforms, whilst also > taking the > > > > > opportunity to rearchitect our driver to increase sharing across > the drm > > > > > subsystem, both leveraging and allowing us to contribute more > towards other > > > > > shared components like TTM and drm/scheduler. The memory model is > based on VM > > > > > bind which is similar to the i915 implementation. Likewise the > execbuf > > > > > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > > > > > > > The code is at a stage where it is already functional and has > experimental > > > > > support for multiple platforms starting from Tiger Lake, with > initial support > > > > > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan > drivers), as well > > > > > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] > and NEO > > > > > implementation will be released publicly early next year. We also > have a suite > > > > > of IGTs for XE that will appear on the IGT list shortly. > > > > > > > > > > It has been built with the assumption of supporting multiple > architectures from > > > > > the get-go, right now with tests running both on X86 and ARM > hosts. And we > > > > > intend to continue working on it and improving on it as part of > the kernel > > > > > community upstream. > > > > > > > > > > The new Xe driver leverages a lot from i915 and work on i915 > continues as we > > > > > ready Xe for production throughout 2023. > > > > > > > > > > As for display, the intent is to share the display code with the > i915 driver so > > > > > that there is maximum reuse there. Currently this is being done by > compiling the > > > > > display code twice, but alternatives to that are under > consideration and we want > > > > > to have more discussion on what the best final solution will look > like over the > > > > > next few months. Right now, work is ongoing in refactoring the > display codebase > > > > > to remove as much as possible any unnecessary dependencies on i915 > specific data > > > > > structures there.. > > > > > > > > > > We currently have 2 submission backends, execlists and GuC. The > execlist is > > > > > meant mostly for testing and is not fully functional while GuC > backend is fully > > > > > functional. As with the i915 and GuC submission, in Xe the GuC > firmware is > > > > > required and should be placed in /lib/firmware/xe. > > > > > > > > What is the plan going forward for the execlists backend? I think it > would > > > > be preferable to not upstream something semi-functional and so to > carry > > > > technical debt in the brand new code base, from the very start. If > it is for > > > > Tigerlake, which is the starting platform for Xe, could it be made > GuC only > > > > Tigerlake for instance? > > > > > > > > > > A little background here. In the original PoC written by Jason and > Dave, > > > the execlist backend was the only one present and it was in > semi-working > > > state. As soon as myself and a few others started working on Xe we went > > > full in a on the GuC backend. We left the execlist backend basically in > > > the state it was in. We left it in place for 2 reasons. > > > > > > 1. Having 2 backends from the start ensured we layered our code > > > correctly. The layer was a complete disaster in the i915 so we really > > > wanted to avoid that. > > > 2. The thought was it might be needed for early product bring up one > > > day. > > > > > > As I think about this a bit more, we likely just delete execlist > backend > > > before merging this upstream and perhaps just carry 1 large patch > > > internally with this implementation that we can use as needed. Final > > > decession TDB though. > > > > but that might regress after some time on "let's keep 2 backends so we > > layer the code correctly". Leaving the additional backend behind > > CONFIG_BROKEN or XE_EXPERIMENTAL, or something like that, not > > enabled by distros, but enabled in CI would be a good idea IMO. > > > > Carrying a large patch out of tree would make things harder for new > > platforms. A perfect backend split would make it possible, but like I > > said, we are likely not to have it if we delete the second backend. > > > > Good points here Lucas. One thing that we absolutely have wrong is > falling back to execlists if GuC firmware is missing. We def should not > be doing that as it creates confusion. > Yeah, we certainly shouldn't be falling back on it silently. That's a recipe for disaster. If it stays, it should be behind a config option that's clearly labeled as broken or not intended for production use. If someone is a zero-firmware purist and wants to enable it and accept the brokenness, that's their choice. I'm not especially attached to the execlist back-end so I'm not going to insist on anything here RE keeping it. There is more to me starting with execlists than avoiding GuC, though. One of the reasons I did it was to prove that the same core Xe scheduling model [3] doesn't depend on firmware. As long as your hardware has some ability to juggle independent per-context rings, you can get the same separation and it makes everything cleaner. If this is the direction things are headed (and I really think it is; I need to blog about it), being able to do the Xe model on more primitive hardware which lacks competent firmware-based submission is important. I wanted to prototype that to show that it could be done. I also kinda wanted to prove that execlists didn't have to be horrible like in i915. You know, for funzies.... --Jason [3]: https://lists.freedesktop.org/archives/dri-devel/2023-January/386381.html > I kinda like the idea hiding it behind a config option + module > parameter to use the backend so you really, really need to try to be > able to use it + with this in the code it make us disciplined in our > layering. At some point we will likely another supported backend and at > that point we may decide to delete this backend. > > Matt > > > Lucas De Marchi > > > > > > > > Matt > > > > > > > Regards, > > > > > > > > Tvrtko >
Hi all, [I thought I've sent this out earlier this week, but alas got stuck, kinda bad timing now since I'm out next week but oh well] So xe is a quite substantial thing, and I think we need a clear plan how to land this or it will take forever, and managers will panic. Also I'm not a big fan of "Dave/me reviews everything", we defacto had that for amd's dc/dal and it was not fun. The idea here is how to get everything reviewed without having two people end up somewhat arbitrary as deciders. I've compiled a bunch of topics on what I think the important areas are, first code that should be consistent about new-style render drivers that are aimed for vk/compute userspace as the primary feature driver: - figure out consensus solution for fw scheduler and drm/sched frontend among interested driver parties (probably xe, amdgpu, nouveau, new panfrost) - for the interface itself it might be good to have the drm_gpu_scheduler as the single per-hw-engine driver api object (but internally a new structure), while renaming the current drm_gpu_scheduler to drm_gpu_sched_internal. That way I think we can address the main critique of the current xe scheduler plan - keep the drm_gpu_sched_internal : drm_sched_entity 1:1 relationship for fw scheduler - keep the driver api relationship of drm_gpu_scheduler : drm_sched_entity 1:n, the api functions simply iterate over a mutex protect list of internal schedulers. this should also help drivers with locking mistakes around setup/teardown and gpu reset. - drivers select with a flag or something between the current mode (where the drm_gpu_sched_internal is attached to the drm_gpu_scheduler api object) or the new fw scheduler mode (where drm_gpu_sched_internal is attached to the drm_sched_entity) - overall still no fundamental changes (like the current patches) to drm/sched data structures and algorithms. But unlike the current patches we keep the possibility open for eventual refactoring without having to again refactor all the drivers. Even better, we can delay such refactoring until we have a handful of real-word drivers test-driving this all so we know we actually do the right thing. This should allow us to address all the fairness/efficiency/whatever concerns that have been floating around without having to fix them all up upfront, before we actually know what needs to be fixed. - the generic scheduler code should also including the handling of endless compute contexts, with the minimal scaffolding for preempt-ctx fences (probably on the drm_sched_entity) and making sure drm/sched can cope with the lack of job completion fence. This is very minimal amounts of code, but it helps a lot for cross-driver review if this works the same (with the same locking and all that) for everyone. Ideally this gets extracted from amdkfd, but as long as it's going to be used by all drivers supporting endless/compute context going forward it's good enough. - I'm assuming this also means Matt Brost will include a patch to add himself as drm/sched reviewer in MAINTAINERS, or at least something like that - adopt the gem_exec/vma helpers. again we probably want consensus here among the same driver projects. I don't care whether these helpers specify the ioctl structs or not, but they absolutely need to enforce the overall locking scheme for all major structs and list (so vm and vma). - we also should have cross-driver consensus on async vm_bind support. I think everyone added in-syncobj support, the real fun is probably more in/out userspace memory fences (and personally I'm still not sure that's a good idea but ... *eh*). I think cross driver consensus on how this should work (ideally with helper support so people don't get it wrong in all the possible ways) would be best. - this also means some userptr integration and some consensus how userptr should work for vm_bind across drivers. I don't think allowing drivers to reinvent that wheel is a bright idea, there's just a bit too much to get wrong here. - for some of these the consensus might land on more/less shared code than what I sketched out above, the important part really is that we have consensus on these. Kinda similar to how the atomic kms infrastructure move a _lot_ more of the code back into drivers, because they really just needed the flexibility to program the hw correctly. Right now we definitely don't have enough shared code, for sure with i915-gem, but we also need to make sure we're not overcorrecting too badly (a bit of overcorrecting generally doesn't hurt). All the above will make sure that the driver overall is in concepts and design aligned with the overall community direction, but I think it'd still be good if someone outside of the intel gpu group reviews the driver code itself. Last time we had a huge driver submission (amd's DC/DAL) this fell on Dave&me, but this time around I think we have a perfect candidate with Oded: - Oded needs/wants to spend some time on ramping up on how drm render drivers work anyway, and xe is probably the best example of a driver that's both supposed to be full-featured, but also doesn't contain an entire display driver on the side. - Oded is in Habana, which is legally part of Intel. Bean counter budget shuffling to make this happen should be possible. - Habana is still fairly distinct entity within Intel, so that is probably the best approach for some independent review, without making the xe team beholden to some non-Intel people. The above should yield some pretty clear road towards landing xe, without any big review fights with Dave/me like we had with amd's DC/DAL, which took a rather long time to land unfortunately :-( These are just my thoughts, let the bikeshed commence! Ideally we put them all into a TODO like we've done for DC/DAL, once we have some consensus. Cheers, Daniel On Thu, Dec 22, 2022 at 02:21:07PM -0800, Matthew Brost wrote: > Hello, > > This is a submission for Xe, a new driver for Intel GPUs that supports both > integrated and discrete platforms starting with Tiger Lake (first platform with > Intel Xe Architecture). The intention of this new driver is to have a fresh base > to work from that is unencumbered by older platforms, whilst also taking the > opportunity to rearchitect our driver to increase sharing across the drm > subsystem, both leveraging and allowing us to contribute more towards other > shared components like TTM and drm/scheduler. The memory model is based on VM > bind which is similar to the i915 implementation. Likewise the execbuf > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > The code is at a stage where it is already functional and has experimental > support for multiple platforms starting from Tiger Lake, with initial support > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > implementation will be released publicly early next year. We also have a suite > of IGTs for XE that will appear on the IGT list shortly. > > It has been built with the assumption of supporting multiple architectures from > the get-go, right now with tests running both on X86 and ARM hosts. And we > intend to continue working on it and improving on it as part of the kernel > community upstream. > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > ready Xe for production throughout 2023. > > As for display, the intent is to share the display code with the i915 driver so > that there is maximum reuse there. Currently this is being done by compiling the > display code twice, but alternatives to that are under consideration and we want > to have more discussion on what the best final solution will look like over the > next few months. Right now, work is ongoing in refactoring the display codebase > to remove as much as possible any unnecessary dependencies on i915 specific data > structures there.. > > We currently have 2 submission backends, execlists and GuC. The execlist is > meant mostly for testing and is not fully functional while GuC backend is fully > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > required and should be placed in /lib/firmware/xe. > > The GuC firmware can be found in the below location: > https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 > > The easiest way to setup firmware is: > cp -r /lib/firmware/i915 /lib/firmware/xe > > The code has been organized such that we have all patches that touch areas > outside of drm/xe first for review, and then the actual new driver in a separate > commit. The code which is outside of drm/xe is included in this RFC while > drm/xe is not due to the size of the commit. The drm/xe is code is available in > a public repo listed below. > > Xe driver commit: > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 > > Xe kernel repo: > https://cgit.freedesktop.org/drm/drm-xe/ > > There's a lot of work still to happen on Xe but we're very excited about it and > wanted to share it early and welcome feedback and discussion. > > Cheers, > Matthew Brost > > [1] https://patchwork.freedesktop.org/series/105879/ > [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 > > Maarten Lankhorst (12): > drm/amd: Convert amdgpu to use suballocation helper. > drm/radeon: Use the drm suballocation manager implementation. > drm/i915: Remove gem and overlay frontbuffer tracking > drm/i915/display: Neuter frontbuffer tracking harder > drm/i915/display: Add more macros to remove all direct calls to uncore > drm/i915/display: Remove all uncore mmio accesses in favor of intel_de > drm/i915: Rename find_section to find_bdb_section > drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe > drm/i915/display: Fix a use-after-free when intel_edp_init_connector > fails > drm/i915/display: Remaining changes to make xe compile > sound/hda: Allow XE as i915 replacement for sound > mei/hdcp: Also enable for XE > > Matthew Brost (5): > drm/sched: Convert drm scheduler to use a work queue rather than > kthread > drm/sched: Add generic scheduler message interface > drm/sched: Start run wq before TDR in drm_sched_start > drm/sched: Submit job before starting TDR > drm/sched: Add helper to set TDR timeout > > Thomas Hellström (3): > drm/suballoc: Introduce a generic suballocation manager > drm: Add a gpu page-table walker helper > drm/ttm: Don't print error message if eviction was interrupted > > drivers/gpu/drm/Kconfig | 5 + > drivers/gpu/drm/Makefile | 4 + > drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + > drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- > drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- > drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ > drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ > drivers/gpu/drm/i915/Makefile | 2 +- > drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- > drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + > drivers/gpu/drm/i915/display/intel_atomic.c | 2 + > .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- > .../gpu/drm/i915/display/intel_backlight.c | 2 +- > drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- > drivers/gpu/drm/i915/display/intel_bw.c | 36 +- > drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- > drivers/gpu/drm/i915/display/intel_color.c | 1 + > drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- > drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- > drivers/gpu/drm/i915/display/intel_de.h | 38 +++ > drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- > drivers/gpu/drm/i915/display/intel_display.h | 9 +- > .../gpu/drm/i915/display/intel_display_core.h | 5 +- > .../drm/i915/display/intel_display_debugfs.c | 8 + > .../drm/i915/display/intel_display_power.c | 40 ++- > .../drm/i915/display/intel_display_power.h | 6 + > .../i915/display/intel_display_power_map.c | 7 + > .../i915/display/intel_display_power_well.c | 24 +- > .../drm/i915/display/intel_display_reg_defs.h | 4 + > .../drm/i915/display/intel_display_trace.h | 6 + > .../drm/i915/display/intel_display_types.h | 32 +- > drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- > drivers/gpu/drm/i915/display/intel_dp.c | 11 +- > drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + > drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- > drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + > drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- > drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + > drivers/gpu/drm/i915/display/intel_drrs.c | 1 + > drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- > drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- > drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- > drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - > drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- > drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- > .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- > .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- > drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- > drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- > drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - > .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + > .../drm/i915/display/intel_modeset_setup.c | 11 +- > drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- > drivers/gpu/drm/i915/display/intel_overlay.c | 14 - > .../gpu/drm/i915/display/intel_pch_display.h | 16 + > .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + > drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + > .../drm/i915/display/intel_plane_initial.c | 3 +- > drivers/gpu/drm/i915/display/intel_psr.c | 1 + > drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ > drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- > drivers/gpu/drm/i915/display/intel_vga.c | 5 + > drivers/gpu/drm/i915/display/skl_scaler.c | 2 + > .../drm/i915/display/skl_universal_plane.c | 52 ++- > drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- > drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - > drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - > .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - > drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- > drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- > drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - > drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- > drivers/gpu/drm/i915/i915_driver.c | 1 + > drivers/gpu/drm/i915/i915_gem.c | 8 - > drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - > drivers/gpu/drm/i915/i915_reg_defs.h | 8 + > drivers/gpu/drm/i915/i915_vma.c | 12 - > drivers/gpu/drm/radeon/radeon.h | 55 +-- > drivers/gpu/drm/radeon/radeon_ib.c | 12 +- > drivers/gpu/drm/radeon/radeon_object.h | 25 +- > drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- > drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- > drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- > drivers/gpu/drm/ttm/ttm_bo.c | 3 +- > drivers/misc/mei/hdcp/Kconfig | 2 +- > drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- > include/drm/drm_pt_walk.h | 161 +++++++++ > include/drm/drm_suballoc.h | 112 ++++++ > include/drm/gpu_scheduler.h | 41 ++- > sound/hda/hdac_i915.c | 17 +- > sound/pci/hda/hda_intel.c | 56 +-- > sound/soc/intel/avs/core.c | 13 +- > sound/soc/sof/intel/hda.c | 7 +- > 98 files changed, 2076 insertions(+), 1325 deletions(-) > create mode 100644 drivers/gpu/drm/drm_pt_walk.c > create mode 100644 drivers/gpu/drm/drm_suballoc.c > create mode 100644 include/drm/drm_pt_walk.h > create mode 100644 include/drm/drm_suballoc.h > > -- > 2.37.3 >
On Fri, Feb 17, 2023 at 10:51 PM Daniel Vetter <daniel@ffwll.ch> wrote: > > Hi all, > > [I thought I've sent this out earlier this week, but alas got stuck, kinda > bad timing now since I'm out next week but oh well] > > So xe is a quite substantial thing, and I think we need a clear plan how to land > this or it will take forever, and managers will panic. Also I'm not a big fan of > "Dave/me reviews everything", we defacto had that for amd's dc/dal and it was > not fun. The idea here is how to get everything reviewed without having two > people end up somewhat arbitrary as deciders. > > I've compiled a bunch of topics on what I think the important areas are, first > code that should be consistent about new-style render drivers that are aimed for > vk/compute userspace as the primary feature driver: > > - figure out consensus solution for fw scheduler and drm/sched frontend among > interested driver parties (probably xe, amdgpu, nouveau, new panfrost) > > - for the interface itself it might be good to have the drm_gpu_scheduler as the > single per-hw-engine driver api object (but internally a new structure), while > renaming the current drm_gpu_scheduler to drm_gpu_sched_internal. That way I > think we can address the main critique of the current xe scheduler plan > - keep the drm_gpu_sched_internal : drm_sched_entity 1:1 relationship for fw > scheduler > - keep the driver api relationship of drm_gpu_scheduler : drm_sched_entity > 1:n, the api functions simply iterate over a mutex protect list of internal > schedulers. this should also help drivers with locking mistakes around > setup/teardown and gpu reset. > - drivers select with a flag or something between the current mode (where the > drm_gpu_sched_internal is attached to the drm_gpu_scheduler api object) or > the new fw scheduler mode (where drm_gpu_sched_internal is attached to the > drm_sched_entity) > - overall still no fundamental changes (like the current patches) to drm/sched > data structures and algorithms. But unlike the current patches we keep the > possibility open for eventual refactoring without having to again refactor > all the drivers. Even better, we can delay such refactoring until we have a > handful of real-word drivers test-driving this all so we know we actually do > the right thing. This should allow us to address all the > fairness/efficiency/whatever concerns that have been floating around without > having to fix them all up upfront, before we actually know what needs to be > fixed. > > - the generic scheduler code should also including the handling of endless > compute contexts, with the minimal scaffolding for preempt-ctx fences > (probably on the drm_sched_entity) and making sure drm/sched can cope with the > lack of job completion fence. This is very minimal amounts of code, but it > helps a lot for cross-driver review if this works the same (with the same > locking and all that) for everyone. Ideally this gets extracted from amdkfd, > but as long as it's going to be used by all drivers supporting > endless/compute context going forward it's good enough. > > - I'm assuming this also means Matt Brost will include a patch to add himself as > drm/sched reviewer in MAINTAINERS, or at least something like that > > - adopt the gem_exec/vma helpers. again we probably want consensus here among > the same driver projects. I don't care whether these helpers specify the ioctl > structs or not, but they absolutely need to enforce the overall locking scheme > for all major structs and list (so vm and vma). > > - we also should have cross-driver consensus on async vm_bind support. I think > everyone added in-syncobj support, the real fun is probably more in/out > userspace memory fences (and personally I'm still not sure that's a good idea > but ... *eh*). I think cross driver consensus on how this should work (ideally > with helper support so people don't get it wrong in all the possible ways) > would be best. > > - this also means some userptr integration and some consensus how userptr should > work for vm_bind across drivers. I don't think allowing drivers to reinvent > that wheel is a bright idea, there's just a bit too much to get wrong here. > > - for some of these the consensus might land on more/less shared code than what > I sketched out above, the important part really is that we have consensus on > these. Kinda similar to how the atomic kms infrastructure move a _lot_ more of > the code back into drivers, because they really just needed the flexibility to > program the hw correctly. Right now we definitely don't have enough shared > code, for sure with i915-gem, but we also need to make sure we're not > overcorrecting too badly (a bit of overcorrecting generally doesn't hurt). > > All the above will make sure that the driver overall is in concepts and design > aligned with the overall community direction, but I think it'd still be good if > someone outside of the intel gpu group reviews the driver code itself. Last time > we had a huge driver submission (amd's DC/DAL) this fell on Dave&me, but this > time around I think we have a perfect candidate with Oded: > > - Oded needs/wants to spend some time on ramping up on how drm render drivers > work anyway, and xe is probably the best example of a driver that's both > supposed to be full-featured, but also doesn't contain an entire display > driver on the side. > > - Oded is in Habana, which is legally part of Intel. Bean counter budget > shuffling to make this happen should be possible. > > - Habana is still fairly distinct entity within Intel, so that is probably the > best approach for some independent review, without making the xe team > beholden to some non-Intel people. Hi Daniel, Thanks for suggesting it, I'll gladly do it. I guess I'll have more feedback on the plan itself after I'll start going over the current Xe driver code. Oded > > The above should yield some pretty clear road towards landing xe, without any > big review fights with Dave/me like we had with amd's DC/DAL, which took a > rather long time to land unfortunately :-( > > These are just my thoughts, let the bikeshed commence! > > Ideally we put them all into a TODO like we've done for DC/DAL, once we have > some consensus. > > Cheers, Daniel > > On Thu, Dec 22, 2022 at 02:21:07PM -0800, Matthew Brost wrote: > > Hello, > > > > This is a submission for Xe, a new driver for Intel GPUs that supports both > > integrated and discrete platforms starting with Tiger Lake (first platform with > > Intel Xe Architecture). The intention of this new driver is to have a fresh base > > to work from that is unencumbered by older platforms, whilst also taking the > > opportunity to rearchitect our driver to increase sharing across the drm > > subsystem, both leveraging and allowing us to contribute more towards other > > shared components like TTM and drm/scheduler. The memory model is based on VM > > bind which is similar to the i915 implementation. Likewise the execbuf > > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > The code is at a stage where it is already functional and has experimental > > support for multiple platforms starting from Tiger Lake, with initial support > > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > > implementation will be released publicly early next year. We also have a suite > > of IGTs for XE that will appear on the IGT list shortly. > > > > It has been built with the assumption of supporting multiple architectures from > > the get-go, right now with tests running both on X86 and ARM hosts. And we > > intend to continue working on it and improving on it as part of the kernel > > community upstream. > > > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > > ready Xe for production throughout 2023. > > > > As for display, the intent is to share the display code with the i915 driver so > > that there is maximum reuse there. Currently this is being done by compiling the > > display code twice, but alternatives to that are under consideration and we want > > to have more discussion on what the best final solution will look like over the > > next few months. Right now, work is ongoing in refactoring the display codebase > > to remove as much as possible any unnecessary dependencies on i915 specific data > > structures there.. > > > > We currently have 2 submission backends, execlists and GuC. The execlist is > > meant mostly for testing and is not fully functional while GuC backend is fully > > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > > required and should be placed in /lib/firmware/xe. > > > > The GuC firmware can be found in the below location: > > https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 > > > > The easiest way to setup firmware is: > > cp -r /lib/firmware/i915 /lib/firmware/xe > > > > The code has been organized such that we have all patches that touch areas > > outside of drm/xe first for review, and then the actual new driver in a separate > > commit. The code which is outside of drm/xe is included in this RFC while > > drm/xe is not due to the size of the commit. The drm/xe is code is available in > > a public repo listed below. > > > > Xe driver commit: > > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 > > > > Xe kernel repo: > > https://cgit.freedesktop.org/drm/drm-xe/ > > > > There's a lot of work still to happen on Xe but we're very excited about it and > > wanted to share it early and welcome feedback and discussion. > > > > Cheers, > > Matthew Brost > > > > [1] https://patchwork.freedesktop.org/series/105879/ > > [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 > > > > Maarten Lankhorst (12): > > drm/amd: Convert amdgpu to use suballocation helper. > > drm/radeon: Use the drm suballocation manager implementation. > > drm/i915: Remove gem and overlay frontbuffer tracking > > drm/i915/display: Neuter frontbuffer tracking harder > > drm/i915/display: Add more macros to remove all direct calls to uncore > > drm/i915/display: Remove all uncore mmio accesses in favor of intel_de > > drm/i915: Rename find_section to find_bdb_section > > drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe > > drm/i915/display: Fix a use-after-free when intel_edp_init_connector > > fails > > drm/i915/display: Remaining changes to make xe compile > > sound/hda: Allow XE as i915 replacement for sound > > mei/hdcp: Also enable for XE > > > > Matthew Brost (5): > > drm/sched: Convert drm scheduler to use a work queue rather than > > kthread > > drm/sched: Add generic scheduler message interface > > drm/sched: Start run wq before TDR in drm_sched_start > > drm/sched: Submit job before starting TDR > > drm/sched: Add helper to set TDR timeout > > > > Thomas Hellström (3): > > drm/suballoc: Introduce a generic suballocation manager > > drm: Add a gpu page-table walker helper > > drm/ttm: Don't print error message if eviction was interrupted > > > > drivers/gpu/drm/Kconfig | 5 + > > drivers/gpu/drm/Makefile | 4 + > > drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + > > drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- > > drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ > > drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ > > drivers/gpu/drm/i915/Makefile | 2 +- > > drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- > > drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + > > drivers/gpu/drm/i915/display/intel_atomic.c | 2 + > > .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- > > .../gpu/drm/i915/display/intel_backlight.c | 2 +- > > drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- > > drivers/gpu/drm/i915/display/intel_bw.c | 36 +- > > drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- > > drivers/gpu/drm/i915/display/intel_color.c | 1 + > > drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- > > drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- > > drivers/gpu/drm/i915/display/intel_de.h | 38 +++ > > drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- > > drivers/gpu/drm/i915/display/intel_display.h | 9 +- > > .../gpu/drm/i915/display/intel_display_core.h | 5 +- > > .../drm/i915/display/intel_display_debugfs.c | 8 + > > .../drm/i915/display/intel_display_power.c | 40 ++- > > .../drm/i915/display/intel_display_power.h | 6 + > > .../i915/display/intel_display_power_map.c | 7 + > > .../i915/display/intel_display_power_well.c | 24 +- > > .../drm/i915/display/intel_display_reg_defs.h | 4 + > > .../drm/i915/display/intel_display_trace.h | 6 + > > .../drm/i915/display/intel_display_types.h | 32 +- > > drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- > > drivers/gpu/drm/i915/display/intel_dp.c | 11 +- > > drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + > > drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- > > drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + > > drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- > > drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + > > drivers/gpu/drm/i915/display/intel_drrs.c | 1 + > > drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- > > drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- > > drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- > > drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - > > drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- > > drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- > > .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- > > .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- > > drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- > > drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- > > drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - > > .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + > > .../drm/i915/display/intel_modeset_setup.c | 11 +- > > drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- > > drivers/gpu/drm/i915/display/intel_overlay.c | 14 - > > .../gpu/drm/i915/display/intel_pch_display.h | 16 + > > .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + > > drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + > > .../drm/i915/display/intel_plane_initial.c | 3 +- > > drivers/gpu/drm/i915/display/intel_psr.c | 1 + > > drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ > > drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- > > drivers/gpu/drm/i915/display/intel_vga.c | 5 + > > drivers/gpu/drm/i915/display/skl_scaler.c | 2 + > > .../drm/i915/display/skl_universal_plane.c | 52 ++- > > drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- > > drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - > > drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - > > .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - > > drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- > > drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- > > drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - > > drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- > > drivers/gpu/drm/i915/i915_driver.c | 1 + > > drivers/gpu/drm/i915/i915_gem.c | 8 - > > drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - > > drivers/gpu/drm/i915/i915_reg_defs.h | 8 + > > drivers/gpu/drm/i915/i915_vma.c | 12 - > > drivers/gpu/drm/radeon/radeon.h | 55 +-- > > drivers/gpu/drm/radeon/radeon_ib.c | 12 +- > > drivers/gpu/drm/radeon/radeon_object.h | 25 +- > > drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- > > drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- > > drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- > > drivers/gpu/drm/ttm/ttm_bo.c | 3 +- > > drivers/misc/mei/hdcp/Kconfig | 2 +- > > drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- > > include/drm/drm_pt_walk.h | 161 +++++++++ > > include/drm/drm_suballoc.h | 112 ++++++ > > include/drm/gpu_scheduler.h | 41 ++- > > sound/hda/hdac_i915.c | 17 +- > > sound/pci/hda/hda_intel.c | 56 +-- > > sound/soc/intel/avs/core.c | 13 +- > > sound/soc/sof/intel/hda.c | 7 +- > > 98 files changed, 2076 insertions(+), 1325 deletions(-) > > create mode 100644 drivers/gpu/drm/drm_pt_walk.c > > create mode 100644 drivers/gpu/drm/drm_suballoc.c > > create mode 100644 include/drm/drm_pt_walk.h > > create mode 100644 include/drm/drm_suballoc.h > > > > -- > > 2.37.3 > > > > -- > Daniel Vetter > Software Engineer, Intel Corporation > http://blog.ffwll.ch
On Fri, Feb 17, 2023 at 09:51:37PM +0100, Daniel Vetter wrote: > Hi all, > > [I thought I've sent this out earlier this week, but alas got stuck, kinda > bad timing now since I'm out next week but oh well] > > So xe is a quite substantial thing, and I think we need a clear plan how to land > this or it will take forever, and managers will panic. Also I'm not a big fan of > "Dave/me reviews everything", we defacto had that for amd's dc/dal and it was > not fun. The idea here is how to get everything reviewed without having two > people end up somewhat arbitrary as deciders. Thank you so much for taking time to write it down. We need to get alignment on the critical topics to see how we can move this forward. > > I've compiled a bunch of topics on what I think the important areas are, first > code that should be consistent about new-style render drivers that are aimed for > vk/compute userspace as the primary feature driver: > > - figure out consensus solution for fw scheduler and drm/sched frontend among > interested driver parties (probably xe, amdgpu, nouveau, new panfrost) Yeap. We do need to figure this out. But just to ensure that we are in the same page here. What I had in mind was that Matt would upstream the 5 or 6 drm_sched related patches that we have underneath Xe patches on drm-misc with addressing the community feedback, then we would merge Xe with the current schedule solution (or modifications based on the modifications of these mentioned patches) and then we would continue to work with the other drivers to improve the drm sched frontend while we are already in tree. Possible? or do you want to see fundamental changes before we can even consider to get in? Like the ones below? > > - for the interface itself it might be good to have the drm_gpu_scheduler as the > single per-hw-engine driver api object (but internally a new structure), while > renaming the current drm_gpu_scheduler to drm_gpu_sched_internal. That way I > think we can address the main critique of the current xe scheduler plan > - keep the drm_gpu_sched_internal : drm_sched_entity 1:1 relationship for fw > scheduler > - keep the driver api relationship of drm_gpu_scheduler : drm_sched_entity > 1:n, the api functions simply iterate over a mutex protect list of internal > schedulers. this should also help drivers with locking mistakes around > setup/teardown and gpu reset. > - drivers select with a flag or something between the current mode (where the > drm_gpu_sched_internal is attached to the drm_gpu_scheduler api object) or > the new fw scheduler mode (where drm_gpu_sched_internal is attached to the > drm_sched_entity) > - overall still no fundamental changes (like the current patches) to drm/sched > data structures and algorithms. But unlike the current patches we keep the > possibility open for eventual refactoring without having to again refactor > all the drivers. Even better, we can delay such refactoring until we have a > handful of real-word drivers test-driving this all so we know we actually do > the right thing. This should allow us to address all the > fairness/efficiency/whatever concerns that have been floating around without > having to fix them all up upfront, before we actually know what needs to be > fixed. do you believe this has to be decided and moved towards one of this before we get merged? > > - the generic scheduler code should also including the handling of endless > compute contexts, with the minimal scaffolding for preempt-ctx fences > (probably on the drm_sched_entity) and making sure drm/sched can cope with the > lack of job completion fence. This is very minimal amounts of code, but it > helps a lot for cross-driver review if this works the same (with the same > locking and all that) for everyone. Ideally this gets extracted from amdkfd, > but as long as it's going to be used by all drivers supporting > endless/compute context going forward it's good enough. On this one I'm a bit clueless to be honest. I thought the biggest problem with the long running context or even endless were due to the hangcheck premption or migrations that would end in some pagefaults. But yeap, it looks that there are opens to get these kind of workloads properly supported. But with this in mind do you see any real blocker on Xe? or any must-have thing? > > - I'm assuming this also means Matt Brost will include a patch to add himself as > drm/sched reviewer in MAINTAINERS, or at least something like that +1 on this idea! This enforces our engagement and commitment with the drm_sched imho. > > - adopt the gem_exec/vma helpers. again we probably want consensus here among > the same driver projects. I don't care whether these helpers specify the ioctl > structs or not, but they absolutely need to enforce the overall locking scheme > for all major structs and list (so vm and vma). On this front I thought we would need to align on a common drm_vm_bind based on the common parts of xe vm_bind and nouveau one. And also some engagement that I thought it would be easier after we are integrated and part of the drm-next. Do we need to do this earlier? Could you please open it a bit more on what exactly you want to see before we can be considered to get merged or after? > > - we also should have cross-driver consensus on async vm_bind support. I think > everyone added in-syncobj support, the real fun is probably more in/out > userspace memory fences (and personally I'm still not sure that's a good idea > but ... *eh*). I think cross driver consensus on how this should work (ideally > with helper support so people don't get it wrong in all the possible ways) > would be best. Should the consensus API come first? should this block the nouveau implementation and move us all towards the drm_vm_bind? or can we sync in-tree? > > - this also means some userptr integration and some consensus how userptr should > work for vm_bind across drivers. I don't think allowing drivers to reinvent > that wheel is a bright idea, there's just a bit too much to get wrong here. ack. but kind of same question. is it a blocker to align before? or easier to align in tree? > > - for some of these the consensus might land on more/less shared code than what > I sketched out above, the important part really is that we have consensus on > these. Kinda similar to how the atomic kms infrastructure move a _lot_ more of > the code back into drivers, because they really just needed the flexibility to > program the hw correctly. Right now we definitely don't have enough shared > code, for sure with i915-gem, but we also need to make sure we're not > overcorrecting too badly (a bit of overcorrecting generally doesn't hurt). +1 on this. We need to work more in the drm layers like display has done successfully! > > All the above will make sure that the driver overall is in concepts and design > aligned with the overall community direction, but I think it'd still be good if > someone outside of the intel gpu group reviews the driver code itself. Last time > we had a huge driver submission (amd's DC/DAL) this fell on Dave&me, but this > time around I think we have a perfect candidate with Oded: > > - Oded needs/wants to spend some time on ramping up on how drm render drivers > work anyway, and xe is probably the best example of a driver that's both > supposed to be full-featured, but also doesn't contain an entire display > driver on the side. > > - Oded is in Habana, which is legally part of Intel. Bean counter budget > shuffling to make this happen should be possible. > > - Habana is still fairly distinct entity within Intel, so that is probably the > best approach for some independent review, without making the xe team > beholden to some non-Intel people. +1 on this entire idea here as well. > > The above should yield some pretty clear road towards landing xe, without any > big review fights with Dave/me like we had with amd's DC/DAL, which took a > rather long time to land unfortunately :-( As I wrote already, I really agree with you that we should work more with the drm and more with the other drivers. But for the logistics of the work and the rebase pains and to avoid a situation where we have a totally divergent driver, I believe the fastest way is to solve any blockers and big issues before, then merge, then work towards more collaboration on the next step. Specially when with Xe we are not planning to remove the force_probe flag for a while, what puts us in a "staging" situation. We could even make use of the CONFIG_STAGING if needed. Thoughts? And most than that, any already know big blockers? > > These are just my thoughts, let the bikeshed commence! :) > > Ideally we put them all into a TODO like we've done for DC/DAL, once we have > some consensus. I like the TODO list idea. And also we need to use more the RFC doc section as well, like i915-vmbind used. On the TODO part, where do you recommend to add in the doc? Again, thank you so much, Rodrigo. > > Cheers, Daniel > > On Thu, Dec 22, 2022 at 02:21:07PM -0800, Matthew Brost wrote: > > Hello, > > > > This is a submission for Xe, a new driver for Intel GPUs that supports both > > integrated and discrete platforms starting with Tiger Lake (first platform with > > Intel Xe Architecture). The intention of this new driver is to have a fresh base > > to work from that is unencumbered by older platforms, whilst also taking the > > opportunity to rearchitect our driver to increase sharing across the drm > > subsystem, both leveraging and allowing us to contribute more towards other > > shared components like TTM and drm/scheduler. The memory model is based on VM > > bind which is similar to the i915 implementation. Likewise the execbuf > > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > The code is at a stage where it is already functional and has experimental > > support for multiple platforms starting from Tiger Lake, with initial support > > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > > implementation will be released publicly early next year. We also have a suite > > of IGTs for XE that will appear on the IGT list shortly. > > > > It has been built with the assumption of supporting multiple architectures from > > the get-go, right now with tests running both on X86 and ARM hosts. And we > > intend to continue working on it and improving on it as part of the kernel > > community upstream. > > > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > > ready Xe for production throughout 2023. > > > > As for display, the intent is to share the display code with the i915 driver so > > that there is maximum reuse there. Currently this is being done by compiling the > > display code twice, but alternatives to that are under consideration and we want > > to have more discussion on what the best final solution will look like over the > > next few months. Right now, work is ongoing in refactoring the display codebase > > to remove as much as possible any unnecessary dependencies on i915 specific data > > structures there.. > > > > We currently have 2 submission backends, execlists and GuC. The execlist is > > meant mostly for testing and is not fully functional while GuC backend is fully > > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > > required and should be placed in /lib/firmware/xe. > > > > The GuC firmware can be found in the below location: > > https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 > > > > The easiest way to setup firmware is: > > cp -r /lib/firmware/i915 /lib/firmware/xe > > > > The code has been organized such that we have all patches that touch areas > > outside of drm/xe first for review, and then the actual new driver in a separate > > commit. The code which is outside of drm/xe is included in this RFC while > > drm/xe is not due to the size of the commit. The drm/xe is code is available in > > a public repo listed below. > > > > Xe driver commit: > > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 > > > > Xe kernel repo: > > https://cgit.freedesktop.org/drm/drm-xe/ > > > > There's a lot of work still to happen on Xe but we're very excited about it and > > wanted to share it early and welcome feedback and discussion. > > > > Cheers, > > Matthew Brost > > > > [1] https://patchwork.freedesktop.org/series/105879/ > > [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 > > > > Maarten Lankhorst (12): > > drm/amd: Convert amdgpu to use suballocation helper. > > drm/radeon: Use the drm suballocation manager implementation. > > drm/i915: Remove gem and overlay frontbuffer tracking > > drm/i915/display: Neuter frontbuffer tracking harder > > drm/i915/display: Add more macros to remove all direct calls to uncore > > drm/i915/display: Remove all uncore mmio accesses in favor of intel_de > > drm/i915: Rename find_section to find_bdb_section > > drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe > > drm/i915/display: Fix a use-after-free when intel_edp_init_connector > > fails > > drm/i915/display: Remaining changes to make xe compile > > sound/hda: Allow XE as i915 replacement for sound > > mei/hdcp: Also enable for XE > > > > Matthew Brost (5): > > drm/sched: Convert drm scheduler to use a work queue rather than > > kthread > > drm/sched: Add generic scheduler message interface > > drm/sched: Start run wq before TDR in drm_sched_start > > drm/sched: Submit job before starting TDR > > drm/sched: Add helper to set TDR timeout > > > > Thomas Hellström (3): > > drm/suballoc: Introduce a generic suballocation manager > > drm: Add a gpu page-table walker helper > > drm/ttm: Don't print error message if eviction was interrupted > > > > drivers/gpu/drm/Kconfig | 5 + > > drivers/gpu/drm/Makefile | 4 + > > drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + > > drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- > > drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- > > drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ > > drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ > > drivers/gpu/drm/i915/Makefile | 2 +- > > drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- > > drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + > > drivers/gpu/drm/i915/display/intel_atomic.c | 2 + > > .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- > > .../gpu/drm/i915/display/intel_backlight.c | 2 +- > > drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- > > drivers/gpu/drm/i915/display/intel_bw.c | 36 +- > > drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- > > drivers/gpu/drm/i915/display/intel_color.c | 1 + > > drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- > > drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- > > drivers/gpu/drm/i915/display/intel_de.h | 38 +++ > > drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- > > drivers/gpu/drm/i915/display/intel_display.h | 9 +- > > .../gpu/drm/i915/display/intel_display_core.h | 5 +- > > .../drm/i915/display/intel_display_debugfs.c | 8 + > > .../drm/i915/display/intel_display_power.c | 40 ++- > > .../drm/i915/display/intel_display_power.h | 6 + > > .../i915/display/intel_display_power_map.c | 7 + > > .../i915/display/intel_display_power_well.c | 24 +- > > .../drm/i915/display/intel_display_reg_defs.h | 4 + > > .../drm/i915/display/intel_display_trace.h | 6 + > > .../drm/i915/display/intel_display_types.h | 32 +- > > drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- > > drivers/gpu/drm/i915/display/intel_dp.c | 11 +- > > drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + > > drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- > > drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + > > drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- > > drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + > > drivers/gpu/drm/i915/display/intel_drrs.c | 1 + > > drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- > > drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- > > drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- > > drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - > > drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- > > drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- > > .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- > > .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- > > drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- > > drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- > > drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - > > .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + > > .../drm/i915/display/intel_modeset_setup.c | 11 +- > > drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- > > drivers/gpu/drm/i915/display/intel_overlay.c | 14 - > > .../gpu/drm/i915/display/intel_pch_display.h | 16 + > > .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + > > drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + > > .../drm/i915/display/intel_plane_initial.c | 3 +- > > drivers/gpu/drm/i915/display/intel_psr.c | 1 + > > drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ > > drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- > > drivers/gpu/drm/i915/display/intel_vga.c | 5 + > > drivers/gpu/drm/i915/display/skl_scaler.c | 2 + > > .../drm/i915/display/skl_universal_plane.c | 52 ++- > > drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- > > drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - > > drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - > > .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - > > drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- > > drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- > > drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - > > drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- > > drivers/gpu/drm/i915/i915_driver.c | 1 + > > drivers/gpu/drm/i915/i915_gem.c | 8 - > > drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - > > drivers/gpu/drm/i915/i915_reg_defs.h | 8 + > > drivers/gpu/drm/i915/i915_vma.c | 12 - > > drivers/gpu/drm/radeon/radeon.h | 55 +-- > > drivers/gpu/drm/radeon/radeon_ib.c | 12 +- > > drivers/gpu/drm/radeon/radeon_object.h | 25 +- > > drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- > > drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- > > drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- > > drivers/gpu/drm/ttm/ttm_bo.c | 3 +- > > drivers/misc/mei/hdcp/Kconfig | 2 +- > > drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- > > include/drm/drm_pt_walk.h | 161 +++++++++ > > include/drm/drm_suballoc.h | 112 ++++++ > > include/drm/gpu_scheduler.h | 41 ++- > > sound/hda/hdac_i915.c | 17 +- > > sound/pci/hda/hda_intel.c | 56 +-- > > sound/soc/intel/avs/core.c | 13 +- > > sound/soc/sof/intel/hda.c | 7 +- > > 98 files changed, 2076 insertions(+), 1325 deletions(-) > > create mode 100644 drivers/gpu/drm/drm_pt_walk.c > > create mode 100644 drivers/gpu/drm/drm_suballoc.c > > create mode 100644 include/drm/drm_pt_walk.h > > create mode 100644 include/drm/drm_suballoc.h > > > > -- > > 2.37.3 > > > > -- > Daniel Vetter > Software Engineer, Intel Corporation > http://blog.ffwll.ch
On Thu, 2 Mar 2023 at 00:00, Rodrigo Vivi <rodrigo.vivi@intel.com> wrote: > On Fri, Feb 17, 2023 at 09:51:37PM +0100, Daniel Vetter wrote: > > Hi all, > > > > [I thought I've sent this out earlier this week, but alas got stuck, kinda > > bad timing now since I'm out next week but oh well] > > > > So xe is a quite substantial thing, and I think we need a clear plan how to land > > this or it will take forever, and managers will panic. Also I'm not a big fan of > > "Dave/me reviews everything", we defacto had that for amd's dc/dal and it was > > not fun. The idea here is how to get everything reviewed without having two > > people end up somewhat arbitrary as deciders. > > Thank you so much for taking time to write it down. We need to get alignment > on the critical topics to see how we can move this forward. Sorry for the delay on my side, last week was carneval and this week big team meeting. > > I've compiled a bunch of topics on what I think the important areas are, first > > code that should be consistent about new-style render drivers that are aimed for > > vk/compute userspace as the primary feature driver: > > > > - figure out consensus solution for fw scheduler and drm/sched frontend among > > interested driver parties (probably xe, amdgpu, nouveau, new panfrost) > > Yeap. We do need to figure this out. But just to ensure that we are in the same > page here. What I had in mind was that Matt would upstream the 5 or 6 drm_sched > related patches that we have underneath Xe patches on drm-misc with addressing > the community feedback, then we would merge Xe with the current schedule solution > (or modifications based on the modifications of these mentioned patches) and > then we would continue to work with the other drivers to improve the drm sched > frontend while we are already in tree. Possible? or do you want to see > fundamental changes before we can even consider to get in? Like the ones below? The trouble with that is that then you'll have a lot more driver changes and big renames in drivers after they landed. Which might be too painful, and why I suggested the below minimal-most driver-api wrapping to decouple that. My worry is that if you don't do that, then the driver merging will be bogged down in endless discussions about what the refactoring should look like exactly (the discussions here and elsewhere kinda gave a preview), and it'll make driver merging actually slower. Hence my suggestion to just decouple things enough so that people agree to merge now and refactor later as the reasonable thing. Now if there would be consensus that this here is already perfect and nothing more needed for fw scheduling, then I think just going ahead with that would be perfectly fine, but I'm kinda not seeing that. Given that there is so much discussion here I don't want to step in with an arbitrary maintainer verdict, that's largely the approach we've done with amd's dal and I don't think it was the best way forward really. > > - for the interface itself it might be good to have the drm_gpu_scheduler as the > > single per-hw-engine driver api object (but internally a new structure), while > > renaming the current drm_gpu_scheduler to drm_gpu_sched_internal. That way I > > think we can address the main critique of the current xe scheduler plan > > - keep the drm_gpu_sched_internal : drm_sched_entity 1:1 relationship for fw > > scheduler > > - keep the driver api relationship of drm_gpu_scheduler : drm_sched_entity > > 1:n, the api functions simply iterate over a mutex protect list of internal > > schedulers. this should also help drivers with locking mistakes around > > setup/teardown and gpu reset. > > - drivers select with a flag or something between the current mode (where the > > drm_gpu_sched_internal is attached to the drm_gpu_scheduler api object) or > > the new fw scheduler mode (where drm_gpu_sched_internal is attached to the > > drm_sched_entity) > > - overall still no fundamental changes (like the current patches) to drm/sched > > data structures and algorithms. But unlike the current patches we keep the > > possibility open for eventual refactoring without having to again refactor > > all the drivers. Even better, we can delay such refactoring until we have a > > handful of real-word drivers test-driving this all so we know we actually do > > the right thing. This should allow us to address all the > > fairness/efficiency/whatever concerns that have been floating around without > > having to fix them all up upfront, before we actually know what needs to be > > fixed. > > do you believe this has to be decided and moved towards one of this before we > get merged? I think either clear consensus by stakeholders that refactoring afterwards is the right thing, or something like the above to decouple fw scheduling drivers from drm/sched is I think needed. And my gut feeling is that the 2nd option is a much faster and less risky path to xe, but if you want to do the big dri-devel arguing championship, you can do that too :-) > > - the generic scheduler code should also including the handling of endless > > compute contexts, with the minimal scaffolding for preempt-ctx fences > > (probably on the drm_sched_entity) and making sure drm/sched can cope with the > > lack of job completion fence. This is very minimal amounts of code, but it > > helps a lot for cross-driver review if this works the same (with the same > > locking and all that) for everyone. Ideally this gets extracted from amdkfd, > > but as long as it's going to be used by all drivers supporting > > endless/compute context going forward it's good enough. > > On this one I'm a bit clueless to be honest. I thought the biggest problem with > the long running context or even endless were due to the hangcheck premption or > migrations that would end in some pagefaults. > But yeap, it looks that there are opens to get these kind of workloads properly > supported. But with this in mind do you see any real blocker on Xe? or any must-have > thing? Yeah hangcheck is the architectural issue, this here is just my suggestion for how to solve it technically in code in a consistent way across drivers. From a subsystem maintainer pov the important stuff is really that drivers share key concepts as much as possible, and I think this is one such key concept. E.g. for amd's dal we didn't ask them to rewrite the entire thing to our taste, but to properly integrate key drm display concepts and structs directly into their driver so that when you want to look for all crtc related code across all drivers you just have to chase drm_crtc, and not figure out what a crtc is in each driver (e.g. in i915 display speak crtc = pipe). Same here, just minimal data structure and scaffolding for long-running context and preempt ctx fences would be really good I think. > > - I'm assuming this also means Matt Brost will include a patch to add himself as > > drm/sched reviewer in MAINTAINERS, or at least something like that > > +1 on this idea! > This enforces our engagement and commitment with the drm_sched imho. > > > > > - adopt the gem_exec/vma helpers. again we probably want consensus here among > > the same driver projects. I don't care whether these helpers specify the ioctl > > structs or not, but they absolutely need to enforce the overall locking scheme > > for all major structs and list (so vm and vma). > > On this front I thought we would need to align on a common drm_vm_bind based on > the common parts of xe vm_bind and nouveau one. And also some engagement that > I thought it would be easier after we are integrated and part of the drm-next. > Do we need to do this earlier? Could you please open it a bit more on what > exactly you want to see before we can be considered to get merged or after? Again, I don't really want to do the maintainer verdict here and just dictate, I think much better if these discussions are done directly among the involved people. I do generally think that the refactoring should happen upfront for xe, simply due to past track record. Which yes sucks a bit and is a special requirement, but I think a bit a stricter barrier but a really clear one is much better for everyone than some very, very handwaving "enough to make Dave&me happy" super vague thing that will guarantee heated arguments like we've had plenty with amd's dal. > > - we also should have cross-driver consensus on async vm_bind support. I think > > everyone added in-syncobj support, the real fun is probably more in/out > > userspace memory fences (and personally I'm still not sure that's a good idea > > but ... *eh*). I think cross driver consensus on how this should work (ideally > > with helper support so people don't get it wrong in all the possible ways) > > would be best. > > Should the consensus API come first? should this block the nouveau implementation > and move us all towards the drm_vm_bind? or can we sync in-tree? Same as above. Since async isn't a requirement (it's optional for both vk and I guess also for compute since current compute still works on i915-gem, which doesn't have this?) this shouldn't block merging the xe driver. So I think it's not too horrendous to make this a blocker. Of course if the vulkan folks disagree then maybe do some other merge order (and record all that with appropriate amounts of acks). > > - this also means some userptr integration and some consensus how userptr should > > work for vm_bind across drivers. I don't think allowing drivers to reinvent > > that wheel is a bright idea, there's just a bit too much to get wrong here. > > ack. but kind of same question. is it a blocker to align before? or easier to > align in tree? Still same as above, I think best is to make them all as clear per-merge goals so that we avoid endless "is this now good enough" discussions with all the frustration and arbitrary delays this would bring. And yes again I realize this sucks a bit and is a bit special. Kinda the same idea like we've tried doing with the refactoring/feature-landing documents in Documenation/gpu/rfc.rst. Actually maybe the entire xe merge plan should perhaps become Documentation/gpu/rfc/xe.rst? > > - for some of these the consensus might land on more/less shared code than what > > I sketched out above, the important part really is that we have consensus on > > these. Kinda similar to how the atomic kms infrastructure move a _lot_ more of > > the code back into drivers, because they really just needed the flexibility to > > program the hw correctly. Right now we definitely don't have enough shared > > code, for sure with i915-gem, but we also need to make sure we're not > > overcorrecting too badly (a bit of overcorrecting generally doesn't hurt). > > +1 on this. We need to work more in the drm layers like display has done successfully! > > > > > All the above will make sure that the driver overall is in concepts and design > > aligned with the overall community direction, but I think it'd still be good if > > someone outside of the intel gpu group reviews the driver code itself. Last time > > we had a huge driver submission (amd's DC/DAL) this fell on Dave&me, but this > > time around I think we have a perfect candidate with Oded: > > > > - Oded needs/wants to spend some time on ramping up on how drm render drivers > > work anyway, and xe is probably the best example of a driver that's both > > supposed to be full-featured, but also doesn't contain an entire display > > driver on the side. > > > > - Oded is in Habana, which is legally part of Intel. Bean counter budget > > shuffling to make this happen should be possible. > > > > - Habana is still fairly distinct entity within Intel, so that is probably the > > best approach for some independent review, without making the xe team > > beholden to some non-Intel people. > > +1 on this entire idea here as well. > > > > > The above should yield some pretty clear road towards landing xe, without any > > big review fights with Dave/me like we had with amd's DC/DAL, which took a > > rather long time to land unfortunately :-( > > As I wrote already, I really agree with you that we should work more with the > drm and more with the other drivers. But for the logistics of the work and > the rebase pains and to avoid a situation where we have a totally divergent > driver, I believe the fastest way is to solve any blockers and big issues > before, then merge, then work towards more collaboration on the next step. > > Specially when with Xe we are not planning to remove the force_probe > flag for a while, what puts us in a "staging" situation. > We could even make use of the CONFIG_STAGING if needed. I general, I agree with this. But also, I've acked a bunch of these plans for intel-gem (like the i915 guc scheduler), and then we had to backtrack those because everyone realize that "refactor in-tree" was actually impossible. It's definitely a bit "burned too many times on this" reaction, but I'd like to make sure we don't end up in that situation again with the next big pile of intel-gem code. > Thoughts? > And most than that, any already know big blockers? > > > > > These are just my thoughts, let the bikeshed commence! > > :) > > > > > Ideally we put them all into a TODO like we've done for DC/DAL, once we have > > some consensus. > > I like the TODO list idea. > And also we need to use more the RFC doc section as well, like > i915-vmbind used. > > On the TODO part, where do you recommend to add in the doc? See above, I think we have the right place with the rfc section already. Cheers, Daniel > > Again, thank you so much, > Rodrigo. > > > > > Cheers, Daniel > > > > On Thu, Dec 22, 2022 at 02:21:07PM -0800, Matthew Brost wrote: > > > Hello, > > > > > > This is a submission for Xe, a new driver for Intel GPUs that supports both > > > integrated and discrete platforms starting with Tiger Lake (first platform with > > > Intel Xe Architecture). The intention of this new driver is to have a fresh base > > > to work from that is unencumbered by older platforms, whilst also taking the > > > opportunity to rearchitect our driver to increase sharing across the drm > > > subsystem, both leveraging and allowing us to contribute more towards other > > > shared components like TTM and drm/scheduler. The memory model is based on VM > > > bind which is similar to the i915 implementation. Likewise the execbuf > > > implementation for Xe is very similar to execbuf3 in the i915 [1]. > > > > > > The code is at a stage where it is already functional and has experimental > > > support for multiple platforms starting from Tiger Lake, with initial support > > > implemented in Mesa (for Iris and Anv, our OpenGL and Vulkan drivers), as well > > > as in NEO (for OpenCL and Level0). A Mesa MR has been posted [2] and NEO > > > implementation will be released publicly early next year. We also have a suite > > > of IGTs for XE that will appear on the IGT list shortly. > > > > > > It has been built with the assumption of supporting multiple architectures from > > > the get-go, right now with tests running both on X86 and ARM hosts. And we > > > intend to continue working on it and improving on it as part of the kernel > > > community upstream. > > > > > > The new Xe driver leverages a lot from i915 and work on i915 continues as we > > > ready Xe for production throughout 2023. > > > > > > As for display, the intent is to share the display code with the i915 driver so > > > that there is maximum reuse there. Currently this is being done by compiling the > > > display code twice, but alternatives to that are under consideration and we want > > > to have more discussion on what the best final solution will look like over the > > > next few months. Right now, work is ongoing in refactoring the display codebase > > > to remove as much as possible any unnecessary dependencies on i915 specific data > > > structures there.. > > > > > > We currently have 2 submission backends, execlists and GuC. The execlist is > > > meant mostly for testing and is not fully functional while GuC backend is fully > > > functional. As with the i915 and GuC submission, in Xe the GuC firmware is > > > required and should be placed in /lib/firmware/xe. > > > > > > The GuC firmware can be found in the below location: > > > https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915 > > > > > > The easiest way to setup firmware is: > > > cp -r /lib/firmware/i915 /lib/firmware/xe > > > > > > The code has been organized such that we have all patches that touch areas > > > outside of drm/xe first for review, and then the actual new driver in a separate > > > commit. The code which is outside of drm/xe is included in this RFC while > > > drm/xe is not due to the size of the commit. The drm/xe is code is available in > > > a public repo listed below. > > > > > > Xe driver commit: > > > https://cgit.freedesktop.org/drm/drm-xe/commit/?h=drm-xe-next&id=9cb016ebbb6a275f57b1cb512b95d5a842391ad7 > > > > > > Xe kernel repo: > > > https://cgit.freedesktop.org/drm/drm-xe/ > > > > > > There's a lot of work still to happen on Xe but we're very excited about it and > > > wanted to share it early and welcome feedback and discussion. > > > > > > Cheers, > > > Matthew Brost > > > > > > [1] https://patchwork.freedesktop.org/series/105879/ > > > [2] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20418 > > > > > > Maarten Lankhorst (12): > > > drm/amd: Convert amdgpu to use suballocation helper. > > > drm/radeon: Use the drm suballocation manager implementation. > > > drm/i915: Remove gem and overlay frontbuffer tracking > > > drm/i915/display: Neuter frontbuffer tracking harder > > > drm/i915/display: Add more macros to remove all direct calls to uncore > > > drm/i915/display: Remove all uncore mmio accesses in favor of intel_de > > > drm/i915: Rename find_section to find_bdb_section > > > drm/i915/regs: Set DISPLAY_MMIO_BASE to 0 for xe > > > drm/i915/display: Fix a use-after-free when intel_edp_init_connector > > > fails > > > drm/i915/display: Remaining changes to make xe compile > > > sound/hda: Allow XE as i915 replacement for sound > > > mei/hdcp: Also enable for XE > > > > > > Matthew Brost (5): > > > drm/sched: Convert drm scheduler to use a work queue rather than > > > kthread > > > drm/sched: Add generic scheduler message interface > > > drm/sched: Start run wq before TDR in drm_sched_start > > > drm/sched: Submit job before starting TDR > > > drm/sched: Add helper to set TDR timeout > > > > > > Thomas Hellström (3): > > > drm/suballoc: Introduce a generic suballocation manager > > > drm: Add a gpu page-table walker helper > > > drm/ttm: Don't print error message if eviction was interrupted > > > > > > drivers/gpu/drm/Kconfig | 5 + > > > drivers/gpu/drm/Makefile | 4 + > > > drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + > > > drivers/gpu/drm/amd/amdgpu/amdgpu.h | 26 +- > > > drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c | 14 +- > > > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 12 +- > > > drivers/gpu/drm/amd/amdgpu/amdgpu_ib.c | 5 +- > > > drivers/gpu/drm/amd/amdgpu/amdgpu_object.h | 23 +- > > > drivers/gpu/drm/amd/amdgpu/amdgpu_ring.h | 3 +- > > > drivers/gpu/drm/amd/amdgpu/amdgpu_sa.c | 320 +----------------- > > > drivers/gpu/drm/drm_pt_walk.c | 159 +++++++++ > > > drivers/gpu/drm/drm_suballoc.c | 301 ++++++++++++++++ > > > drivers/gpu/drm/i915/Makefile | 2 +- > > > drivers/gpu/drm/i915/display/hsw_ips.c | 7 +- > > > drivers/gpu/drm/i915/display/i9xx_plane.c | 1 + > > > drivers/gpu/drm/i915/display/intel_atomic.c | 2 + > > > .../gpu/drm/i915/display/intel_atomic_plane.c | 25 +- > > > .../gpu/drm/i915/display/intel_backlight.c | 2 +- > > > drivers/gpu/drm/i915/display/intel_bios.c | 71 ++-- > > > drivers/gpu/drm/i915/display/intel_bw.c | 36 +- > > > drivers/gpu/drm/i915/display/intel_cdclk.c | 68 ++-- > > > drivers/gpu/drm/i915/display/intel_color.c | 1 + > > > drivers/gpu/drm/i915/display/intel_crtc.c | 14 +- > > > drivers/gpu/drm/i915/display/intel_cursor.c | 14 +- > > > drivers/gpu/drm/i915/display/intel_de.h | 38 +++ > > > drivers/gpu/drm/i915/display/intel_display.c | 155 +++++++-- > > > drivers/gpu/drm/i915/display/intel_display.h | 9 +- > > > .../gpu/drm/i915/display/intel_display_core.h | 5 +- > > > .../drm/i915/display/intel_display_debugfs.c | 8 + > > > .../drm/i915/display/intel_display_power.c | 40 ++- > > > .../drm/i915/display/intel_display_power.h | 6 + > > > .../i915/display/intel_display_power_map.c | 7 + > > > .../i915/display/intel_display_power_well.c | 24 +- > > > .../drm/i915/display/intel_display_reg_defs.h | 4 + > > > .../drm/i915/display/intel_display_trace.h | 6 + > > > .../drm/i915/display/intel_display_types.h | 32 +- > > > drivers/gpu/drm/i915/display/intel_dmc.c | 17 +- > > > drivers/gpu/drm/i915/display/intel_dp.c | 11 +- > > > drivers/gpu/drm/i915/display/intel_dp_aux.c | 6 + > > > drivers/gpu/drm/i915/display/intel_dpio_phy.c | 9 +- > > > drivers/gpu/drm/i915/display/intel_dpio_phy.h | 15 + > > > drivers/gpu/drm/i915/display/intel_dpll.c | 8 +- > > > drivers/gpu/drm/i915/display/intel_dpll_mgr.c | 4 + > > > drivers/gpu/drm/i915/display/intel_drrs.c | 1 + > > > drivers/gpu/drm/i915/display/intel_dsb.c | 124 +++++-- > > > drivers/gpu/drm/i915/display/intel_dsi_vbt.c | 26 +- > > > drivers/gpu/drm/i915/display/intel_fb.c | 108 ++++-- > > > drivers/gpu/drm/i915/display/intel_fb_pin.c | 6 - > > > drivers/gpu/drm/i915/display/intel_fbc.c | 49 ++- > > > drivers/gpu/drm/i915/display/intel_fbdev.c | 108 +++++- > > > .../gpu/drm/i915/display/intel_frontbuffer.c | 103 +----- > > > .../gpu/drm/i915/display/intel_frontbuffer.h | 67 +--- > > > drivers/gpu/drm/i915/display/intel_gmbus.c | 2 +- > > > drivers/gpu/drm/i915/display/intel_hdcp.c | 9 +- > > > drivers/gpu/drm/i915/display/intel_hdmi.c | 1 - > > > .../gpu/drm/i915/display/intel_lpe_audio.h | 8 + > > > .../drm/i915/display/intel_modeset_setup.c | 11 +- > > > drivers/gpu/drm/i915/display/intel_opregion.c | 2 +- > > > drivers/gpu/drm/i915/display/intel_overlay.c | 14 - > > > .../gpu/drm/i915/display/intel_pch_display.h | 16 + > > > .../gpu/drm/i915/display/intel_pch_refclk.h | 8 + > > > drivers/gpu/drm/i915/display/intel_pipe_crc.c | 1 + > > > .../drm/i915/display/intel_plane_initial.c | 3 +- > > > drivers/gpu/drm/i915/display/intel_psr.c | 1 + > > > drivers/gpu/drm/i915/display/intel_sprite.c | 21 ++ > > > drivers/gpu/drm/i915/display/intel_vbt_defs.h | 2 +- > > > drivers/gpu/drm/i915/display/intel_vga.c | 5 + > > > drivers/gpu/drm/i915/display/skl_scaler.c | 2 + > > > .../drm/i915/display/skl_universal_plane.c | 52 ++- > > > drivers/gpu/drm/i915/display/skl_watermark.c | 25 +- > > > drivers/gpu/drm/i915/gem/i915_gem_clflush.c | 4 - > > > drivers/gpu/drm/i915/gem/i915_gem_domain.c | 7 - > > > .../gpu/drm/i915/gem/i915_gem_execbuffer.c | 2 - > > > drivers/gpu/drm/i915/gem/i915_gem_object.c | 25 -- > > > drivers/gpu/drm/i915/gem/i915_gem_object.h | 22 -- > > > drivers/gpu/drm/i915/gem/i915_gem_phys.c | 4 - > > > drivers/gpu/drm/i915/gt/intel_gt_regs.h | 3 +- > > > drivers/gpu/drm/i915/i915_driver.c | 1 + > > > drivers/gpu/drm/i915/i915_gem.c | 8 - > > > drivers/gpu/drm/i915/i915_gem_gtt.c | 1 - > > > drivers/gpu/drm/i915/i915_reg_defs.h | 8 + > > > drivers/gpu/drm/i915/i915_vma.c | 12 - > > > drivers/gpu/drm/radeon/radeon.h | 55 +-- > > > drivers/gpu/drm/radeon/radeon_ib.c | 12 +- > > > drivers/gpu/drm/radeon/radeon_object.h | 25 +- > > > drivers/gpu/drm/radeon/radeon_sa.c | 314 ++--------------- > > > drivers/gpu/drm/radeon/radeon_semaphore.c | 6 +- > > > drivers/gpu/drm/scheduler/sched_main.c | 182 +++++++--- > > > drivers/gpu/drm/ttm/ttm_bo.c | 3 +- > > > drivers/misc/mei/hdcp/Kconfig | 2 +- > > > drivers/misc/mei/hdcp/mei_hdcp.c | 3 +- > > > include/drm/drm_pt_walk.h | 161 +++++++++ > > > include/drm/drm_suballoc.h | 112 ++++++ > > > include/drm/gpu_scheduler.h | 41 ++- > > > sound/hda/hdac_i915.c | 17 +- > > > sound/pci/hda/hda_intel.c | 56 +-- > > > sound/soc/intel/avs/core.c | 13 +- > > > sound/soc/sof/intel/hda.c | 7 +- > > > 98 files changed, 2076 insertions(+), 1325 deletions(-) > > > create mode 100644 drivers/gpu/drm/drm_pt_walk.c > > > create mode 100644 drivers/gpu/drm/drm_suballoc.c > > > create mode 100644 include/drm/drm_pt_walk.h > > > create mode 100644 include/drm/drm_suballoc.h > > > > > > -- > > > 2.37.3 > > > > > > > -- > > Daniel Vetter > > Software Engineer, Intel Corporation > > http://blog.ffwll.ch