[v3,16/37] drm/i915/lmem: support CPU relocations

Message ID	20190809222643.23142-17-matthew.auld@intel.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <dri-devel-bounces@lists.freedesktop.org> From: Matthew Auld <matthew.auld@intel.com> To: intel-gfx@lists.freedesktop.org Subject: [PATCH v3 16/37] drm/i915/lmem: support CPU relocations Date: Fri, 9 Aug 2019 23:26:22 +0100 Message-Id: <20190809222643.23142-17-matthew.auld@intel.com> In-Reply-To: <20190809222643.23142-1-matthew.auld@intel.com> References: <20190809222643.23142-1-matthew.auld@intel.com> MIME-Version: 1.0 Precedence: list Cc: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>, dri-devel@lists.freedesktop.org, Rodrigo Vivi <rodrigo.vivi@intel.com> Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" <dri-devel-bounces@lists.freedesktop.org>
Series	Introduce memory region concept (including device local memory) \| expand [v3,00/37] Introduce memory region concept (including device local memory) [v3,01/37] drm/i915: buddy allocator [v3,02/37] drm/i915: introduce intel_memory_region [v3,03/37] drm/i915/region: support basic eviction [v3,04/37] drm/i915/region: support continuous allocations [v3,05/37] drm/i915/region: support volatile objects [v3,06/37] drm/i915: Add memory region information to device_info [v3,07/37] drm/i915: support creating LMEM objects [v3,08/37] drm/i915: setup io-mapping for LMEM [v3,09/37] drm/i915/lmem: support kernel mapping [v3,10/37] drm/i915/blt: don't assume pinned intel_context [v3,11/37] drm/i915/blt: bump size restriction [v3,12/37] drm/i915/blt: support copying objects [v3,13/37] drm/i915/selftests: move gpu-write-dw into utils [v3,14/37] drm/i915/selftests: add write-dword test for LMEM [v3,15/37] drm/i915/selftest: extend coverage to include LMEM huge-pages [v3,16/37] drm/i915/lmem: support CPU relocations [v3,17/37] drm/i915/lmem: support pread [v3,18/37] drm/i915/lmem: support pwrite [v3,19/37] drm/i915: enumerate and init each supported region [v3,20/37] drm/i915: treat shmem as a region [v3,21/37] drm/i915: treat stolen as a region [v3,22/37] drm/i915: define HAS_MAPPABLE_APERTURE [v3,23/37] drm/i915: do not map aperture if it is not available. [v3,24/37] drm/i915: set num_fence_regs to 0 if there is no aperture [v3,25/37] drm/i915/selftests: check for missing aperture [v3,26/37] drm/i915: error capture with no ggtt slot [v3,27/37] drm/i915: Don't try to place HWS in non-existing mappable region [v3,28/37] drm/i915: check for missing aperture in insert_mappable_node [v3,29/37] drm/i915: Allow i915 to manage the vma offset nodes instead of drm core [v3,30/37] drm/i915: Introduce DRM_I915_GEM_MMAP_OFFSET [v3,31/37] drm/i915/lmem: add helper to get CPU accessible offset [v3,32/37] drm/i915: Add cpu and lmem fault handlers [v3,33/37] drm/i915: cpu-map based dumb buffers [v3,34/37] drm/i915: support basic object migration [v3,35/37] drm/i915: Introduce GEM_OBJECT_SETPARAM with I915_PARAM_MEMORY_REGION [v3,36/37] drm/i915/query: Expose memory regions through the query uAPI [v3,37/37] HAX drm/i915: add the fake lmem region

Message ID

20190809222643.23142-17-matthew.auld@intel.com (mailing list archive)

State

New, archived

Headers

From: Matthew Auld <matthew.auld@intel.com>
To: intel-gfx@lists.freedesktop.org
Subject: [PATCH v3 16/37] drm/i915/lmem: support CPU relocations
Date: Fri,  9 Aug 2019 23:26:22 +0100
Message-Id: <20190809222643.23142-17-matthew.auld@intel.com>
In-Reply-To: <20190809222643.23142-1-matthew.auld@intel.com>
References: <20190809222643.23142-1-matthew.auld@intel.com>
MIME-Version: 1.0
Precedence: list
Cc: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>,
 dri-devel@lists.freedesktop.org, Rodrigo Vivi <rodrigo.vivi@intel.com>
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: dri-devel-bounces@lists.freedesktop.org
Sender: "dri-devel" <dri-devel-bounces@lists.freedesktop.org>

Series

Introduce memory region concept (including device local memory) | expand

Commit Message

Matthew Auld Aug. 9, 2019, 10:26 p.m. UTC

Add LMEM support for the CPU reloc path. When doing relocations we have
both a GPU and CPU reloc path, as well as some debugging options to force a
particular path. The GPU reloc path is preferred when the object
is not currently idle, otherwise we use the CPU reloc path. Since we
can't kmap the object, and the mappable aperture might not be available,
add support for mapping it through LMEMBAR.

Signed-off-by: Matthew Auld <matthew.auld@intel.com>
Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
Cc: Abdiel Janulgue <abdiel.janulgue@linux.intel.com>
Cc: Rodrigo Vivi <rodrigo.vivi@intel.com>
---
 .../gpu/drm/i915/gem/i915_gem_execbuffer.c    | 55 +++++++++++++++++--
 1 file changed, 51 insertions(+), 4 deletions(-)

Comments

Chris Wilson Aug. 10, 2019, 10:50 a.m. UTC | #1

Quoting Matthew Auld (2019-08-09 23:26:22)
> @@ -1017,10 +1020,14 @@ static void reloc_cache_reset(struct reloc_cache *cache)
>         } else {
>                 struct i915_ggtt *ggtt = cache_to_ggtt(cache);
>  
> -               intel_gt_flush_ggtt_writes(ggtt->vm.gt);
> +               if (!cache->is_lmem)
> +                       intel_gt_flush_ggtt_writes(ggtt->vm.gt);

I love an optimist. At the least you might need the wmb(). But we have
yet to see how many mistakes they've carried over into the new
implementation ;)
-Chris

diff --git a/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c b/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c
index 2fa08357944e..d70b3e6dc12d 100644
--- a/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c
+++ b/drivers/gpu/drm/i915/gem/i915_gem_execbuffer.c
@@ -15,6 +15,7 @@ 
 #include "display/intel_frontbuffer.h"
 
 #include "gem/i915_gem_ioctls.h"
+#include "gem/i915_gem_lmem.h"
 #include "gt/intel_context.h"
 #include "gt/intel_engine_pool.h"
 #include "gt/intel_gt.h"
@@ -251,6 +252,7 @@  struct i915_execbuffer {
 		bool has_llc : 1;
 		bool has_fence : 1;
 		bool needs_unfenced : 1;
+		bool is_lmem : 1;
 
 		struct i915_request *rq;
 		u32 *rq_cmd;
@@ -959,6 +961,7 @@  static void reloc_cache_init(struct reloc_cache *cache,
 	cache->use_64bit_reloc = HAS_64BIT_RELOC(i915);
 	cache->has_fence = cache->gen < 4;
 	cache->needs_unfenced = INTEL_INFO(i915)->unfenced_needs_alignment;
+	cache->is_lmem = false;
 	cache->node.allocated = false;
 	cache->rq = NULL;
 	cache->rq_size = 0;
@@ -1017,10 +1020,14 @@  static void reloc_cache_reset(struct reloc_cache *cache)
 	} else {
 		struct i915_ggtt *ggtt = cache_to_ggtt(cache);
 
-		intel_gt_flush_ggtt_writes(ggtt->vm.gt);
+		if (!cache->is_lmem)
+			intel_gt_flush_ggtt_writes(ggtt->vm.gt);
 		io_mapping_unmap_atomic((void __iomem *)vaddr);
 
-		if (cache->node.allocated) {
+		if (cache->is_lmem) {
+			i915_gem_object_unpin_pages((struct drm_i915_gem_object *)cache->node.mm);
+			cache->is_lmem = false;
+		} else if (cache->node.allocated) {
 			ggtt->vm.clear_range(&ggtt->vm,
 					     cache->node.start,
 					     cache->node.size);
@@ -1066,6 +1073,42 @@  static void *reloc_kmap(struct drm_i915_gem_object *obj,
 	return vaddr;
 }
 
+static void *reloc_lmem(struct drm_i915_gem_object *obj,
+			struct reloc_cache *cache,
+			unsigned long page)
+{
+	void *vaddr;
+	int err;
+
+	GEM_BUG_ON(use_cpu_reloc(cache, obj));
+
+	if (cache->vaddr) {
+		io_mapping_unmap_atomic((void __force __iomem *) unmask_page(cache->vaddr));
+	} else {
+		err = i915_gem_object_pin_pages(obj);
+		if (err)
+			return ERR_PTR(err);
+
+		i915_gem_object_lock(obj);
+		err = i915_gem_object_set_to_wc_domain(obj, true);
+		i915_gem_object_unlock(obj);
+		if (err) {
+			i915_gem_object_unpin_pages(obj);
+			return ERR_PTR(err);
+		}
+
+		cache->node.mm = (void *)obj;
+		cache->is_lmem = true;
+	}
+
+	vaddr = i915_gem_object_lmem_io_map_page_atomic(obj, page);
+
+	cache->vaddr = (unsigned long)vaddr;
+	cache->page = page;
+
+	return vaddr;
+}
+
 static void *reloc_iomap(struct drm_i915_gem_object *obj,
 			 struct reloc_cache *cache,
 			 unsigned long page)
@@ -1142,8 +1185,12 @@  static void *reloc_vaddr(struct drm_i915_gem_object *obj,
 		vaddr = unmask_page(cache->vaddr);
 	} else {
 		vaddr = NULL;
-		if ((cache->vaddr & KMAP) == 0)
-			vaddr = reloc_iomap(obj, cache, page);
+		if ((cache->vaddr & KMAP) == 0) {
+			if (i915_gem_object_is_lmem(obj))
+				vaddr = reloc_lmem(obj, cache, page);
+			else
+				vaddr = reloc_iomap(obj, cache, page);
+		}
 		if (!vaddr)
 			vaddr = reloc_kmap(obj, cache, page);
 	}

[v3,16/37] drm/i915/lmem: support CPU relocations

Commit Message

Comments

Patch