[RFC,4/8] drm/i915: Refactor PAT/object cache handling

From: Tvrtko Ursulin <tvrtko.ursulin@intel.com>

From: Tvrtko Ursulin <tvrtko.ursulin@intel.com>

Commit 9275277d5324 ("drm/i915: use pat_index instead of cache_level") has
introduced PAT indices to i915 internal APIs, partially replacing the
usage of driver internal cache_level, but has also added a few sub-
optimal design decisions which this patch tries to improve upon.

Principal change here is to invert the per platform cache level to PAT
index table which was added by the referenced commit, and by doing so
enable i915 to understand the cache mode between PAT indices, changing
them from opaque to transparent.

Once we have the inverted table we are able to remove the hidden false
"return true" from i915_gem_object_has_cache_level and make the involved
code path clearer.

To achieve this we replace the enum i915_cache_level with i915_cache_t,
composed of a more detailed representation of each cache mode (base mode
plus flags).

In this way we are able to express the differences between different
write-back mode coherency settings on Meteorlake, which in turn enables us
to map the i915 "cached" mode to the correct Meteorlake PAT index.

We can also replace the platform dependent cache mode to string code in
debugfs and elsewhere by the single implementation based on i915_cache_t.

v2:
 * Fix PAT-to-cache-mode table for PVC. (Fei)
 * Cache display caching mode too. (Fei)
 * Improve and document criteria in i915_gem_object_can_bypass_llc() (Matt)

v3:
 * Checkpath issues.
 * Cache mode flags check fixed.

v4:
 * Fix intel_device_info->cache_modes array size. (Matt)
 * Boolean cache mode and flags query. (Matt)
 * Reduce number of cache macros with some macro magic.
 * One more checkpatch fix.
 * Tweak tables to show legacy and Gen12 WB is fully coherent.

Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
References: 9275277d5324 ("drm/i915: use pat_index instead of cache_level")
Cc: Chris Wilson <chris.p.wilson@linux.intel.com>
Cc: Fei Yang <fei.yang@intel.com>
Cc: Andi Shyti <andi.shyti@linux.intel.com>
Cc: Matt Roper <matthew.d.roper@intel.com>
---
 drivers/gpu/drm/i915/gem/i915_gem_domain.c    |  60 +++++----
 drivers/gpu/drm/i915/gem/i915_gem_domain.h    |   5 +-
 .../gpu/drm/i915/gem/i915_gem_execbuffer.c    |   3 +-
 drivers/gpu/drm/i915/gem/i915_gem_internal.c  |   2 +-
 drivers/gpu/drm/i915/gem/i915_gem_mman.c      |   4 +-
 drivers/gpu/drm/i915/gem/i915_gem_object.c    | 117 ++++++++++--------
 drivers/gpu/drm/i915/gem/i915_gem_object.h    |  11 +-
 .../gpu/drm/i915/gem/i915_gem_object_types.h  | 116 +----------------
 drivers/gpu/drm/i915/gem/i915_gem_shmem.c     |   8 +-
 drivers/gpu/drm/i915/gem/i915_gem_stolen.c    |   2 +-
 drivers/gpu/drm/i915/gem/i915_gem_ttm_move.c  |  20 +--
 drivers/gpu/drm/i915/gem/i915_gem_userptr.c   |   2 +-
 .../drm/i915/gem/selftests/huge_gem_object.c  |   2 +-
 .../gpu/drm/i915/gem/selftests/huge_pages.c   |   3 +-
 drivers/gpu/drm/i915/gt/gen8_ppgtt.c          |  10 +-
 drivers/gpu/drm/i915/gt/intel_engine_cs.c     |   2 +-
 drivers/gpu/drm/i915/gt/intel_ggtt.c          |  25 ++--
 drivers/gpu/drm/i915/gt/intel_ggtt_gmch.c     |   4 +-
 drivers/gpu/drm/i915/gt/intel_gtt.c           |   2 +-
 drivers/gpu/drm/i915/gt/intel_gtt.h           |   3 +-
 drivers/gpu/drm/i915/gt/intel_ppgtt.c         |   6 +-
 .../gpu/drm/i915/gt/intel_ring_submission.c   |   4 +-
 drivers/gpu/drm/i915/gt/intel_timeline.c      |   2 +-
 drivers/gpu/drm/i915/gt/selftest_hangcheck.c  |   2 +-
 .../gpu/drm/i915/gt/selftest_workarounds.c    |   2 +-
 drivers/gpu/drm/i915/i915_cache.c             |  89 +++++++++++--
 drivers/gpu/drm/i915/i915_cache.h             |  70 ++++++++++-
 drivers/gpu/drm/i915/i915_debugfs.c           |  53 ++------
 drivers/gpu/drm/i915/i915_driver.c            |   4 +-
 drivers/gpu/drm/i915/i915_gem.c               |  13 --
 drivers/gpu/drm/i915/i915_pci.c               |  84 +++++++------
 drivers/gpu/drm/i915/i915_perf.c              |   2 +-
 drivers/gpu/drm/i915/intel_device_info.h      |   6 +-
 .../gpu/drm/i915/selftests/i915_gem_evict.c   |   4 +-
 drivers/gpu/drm/i915/selftests/igt_spinner.c  |   2 +-
 .../gpu/drm/i915/selftests/mock_gem_device.c  |  14 +--
 36 files changed, 391 insertions(+), 367 deletions(-)

Message ID	20230727145504.1919316-5-tvrtko.ursulin@linux.intel.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <intel-gfx-bounces@lists.freedesktop.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 65DCEC001DC for <intel-gfx@archiver.kernel.org>; Thu, 27 Jul 2023 14:55:42 +0000 (UTC) Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 36DDC10E5B4; Thu, 27 Jul 2023 14:55:31 +0000 (UTC) Received: from mgamail.intel.com (unknown [192.55.52.88]) by gabe.freedesktop.org (Postfix) with ESMTPS id 19FA810E5A6; Thu, 27 Jul 2023 14:55:24 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1690469724; x=1722005724; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=m+mC4xmMFddeehuXIv5a4rjAzZF9/o7Q4U9kXemInrw=; b=GDgBSG5UOEuv4G30Ryn7+zDlz66MQ310o4v8sO5zHO7T5Z6pHFAIdUOz jLh8A5z2gNFLRLKNrxlKdDdgrYlhF7rtdo/fLtx3x/rwuMaUVsRnFSguY lpDjhwpEEOAxYsNBmaOE/3rWIx5pzab4P+AsTV8xu9KWeLi93Hu4de9vr ikcAZM145kTeLIGta12jlyUwjLVkH3gcRlTA/kPLgraSUssyjPKckWaBQ WdQ9/UBKxTQjNhccfDEijvP96tUE8hUYcLSt/bmrVHKJ0pMYWDTwNHBLT ZIHumPKyW92tHXyz/Urg92AN83f/WmVpvP1okfiXQjPz6Og9fzX2Rkt+I g==; X-IronPort-AV: E=McAfee;i="6600,9927,10784"; a="399268405" X-IronPort-AV: E=Sophos;i="6.01,235,1684825200"; d="scan'208";a="399268405" Received: from fmsmga001.fm.intel.com ([10.253.24.23]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 Jul 2023 07:55:23 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.01,202,1684825200"; d="scan'208";a="870433734" Received: from jlenehan-mobl1.ger.corp.intel.com (HELO localhost.localdomain) ([10.213.228.208]) by fmsmga001-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 27 Jul 2023 07:55:23 -0700 From: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com> To: Intel-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org Date: Thu, 27 Jul 2023 15:55:00 +0100 Message-Id: <20230727145504.1919316-5-tvrtko.ursulin@linux.intel.com> X-Mailer: git-send-email 2.39.2 In-Reply-To: <20230727145504.1919316-1-tvrtko.ursulin@linux.intel.com> References: <20230727145504.1919316-1-tvrtko.ursulin@linux.intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Subject: [Intel-gfx] [RFC 4/8] drm/i915: Refactor PAT/object cache handling X-BeenThere: intel-gfx@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel graphics driver community testing & development <intel-gfx.lists.freedesktop.org> List-Unsubscribe: <https://lists.freedesktop.org/mailman/options/intel-gfx>, <mailto:intel-gfx-request@lists.freedesktop.org?subject=unsubscribe> List-Archive: <https://lists.freedesktop.org/archives/intel-gfx> List-Post: <mailto:intel-gfx@lists.freedesktop.org> List-Help: <mailto:intel-gfx-request@lists.freedesktop.org?subject=help> List-Subscribe: <https://lists.freedesktop.org/mailman/listinfo/intel-gfx>, <mailto:intel-gfx-request@lists.freedesktop.org?subject=subscribe> Cc: Matt Roper <matthew.d.roper@intel.com>, Chris Wilson <chris.p.wilson@linux.intel.com> Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" <intel-gfx-bounces@lists.freedesktop.org>
Series	Another take on PAT/object cache mode refactoring \| expand [RFC,0/8] Another take on PAT/object cache mode refactoring [RFC,1/8] drm/i915: Skip clflush after GPU writes on Meteorlake [RFC,2/8] drm/i915: Split PTE encode between Gen12 and Meteorlake [RFC,3/8] drm/i915: Cache PAT index used by the driver [RFC,4/8] drm/i915: Refactor PAT/object cache handling [RFC,5/8] drm/i915: Improve the vm_fault_gtt user PAT index restriction [RFC,6/8] drm/i915: Lift the user PAT restriction from gpu_write_needs_clflush [RFC,7/8] drm/i915: Lift the user PAT restriction from use_cpu_reloc [RFC,8/8] drm/i915: Refine the caching check in i915_gem_object_can_bypass_llc

[RFC,4/8] drm/i915: Refactor PAT/object cache handling

Commit Message

Comments

Patch