From patchwork Fri Feb 17 13:44:06 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: Danilo Krummrich X-Patchwork-Id: 13144720 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id B8DE5C05027 for ; Fri, 17 Feb 2023 13:44:37 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id EC7FE6B0072; Fri, 17 Feb 2023 08:44:36 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id E51046B0073; Fri, 17 Feb 2023 08:44:36 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id CCAB36B0074; Fri, 17 Feb 2023 08:44:36 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id B6F496B0072 for ; Fri, 17 Feb 2023 08:44:36 -0500 (EST) Received: from smtpin12.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 6CEEDC05E3 for ; Fri, 17 Feb 2023 13:44:36 +0000 (UTC) X-FDA: 80476903752.12.112887B Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by imf02.hostedemail.com (Postfix) with ESMTP id B7B5480016 for ; Fri, 17 Feb 2023 13:44:32 +0000 (UTC) Authentication-Results: imf02.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=TDmQgmux; dmarc=pass (policy=none) header.from=redhat.com; spf=pass (imf02.hostedemail.com: domain of dakr@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=dakr@redhat.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1676641472; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding:in-reply-to: references:dkim-signature; bh=LwLShPp83DT/SjdrQ8e0j5FoWk8xRUBiFNvXjirXtcY=; b=7gahE8B5HYoXeLiJhY2yeGCEGde8c0W5BzlEJDH7/LFp5M6PEGHnZ5wj5SOjtkoJDZ0bMV jIxZqRuoIypF7gUn7mzaTiMvuVfKILPQXOk43NeVA6iBw03LiZJIBvKlVyvcL6Ms5Sb0c7 lN+VZ/1K5zVmHo3wYRtS6iagzlF5FEI= ARC-Authentication-Results: i=1; imf02.hostedemail.com; dkim=pass header.d=redhat.com header.s=mimecast20190719 header.b=TDmQgmux; dmarc=pass (policy=none) header.from=redhat.com; spf=pass (imf02.hostedemail.com: domain of dakr@redhat.com designates 170.10.133.124 as permitted sender) smtp.mailfrom=dakr@redhat.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1676641472; a=rsa-sha256; cv=none; b=Rj8kM3sxidIBBRalNo+QNm7QCB/SyZWgzyAabo6of/xYRqqtx+yl9x0tkLR/W7S8+RSIeo 1KSmCNovhs6l/yqQKCC1G0UfQA43mY/xS5BWqwNjcS8/zWFd4GCkqyr0nFHk2C8PfzeRer ptMAZtRLgiGk6rod5X3fDgOgyFvSLOs= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1676641472; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=LwLShPp83DT/SjdrQ8e0j5FoWk8xRUBiFNvXjirXtcY=; b=TDmQgmux8g7X8Z0KLEEbt0CvNXgeQTaCSCyFf6HKk/Ll4E6sRYaKgSxdxDkV8VcfkrPz7G 4tnemQ27TPgzPGfJ1hV+U3atMT5H/cppUxqnntP7JqcyVSfFKlmrAeg78MYXv+E/FFlj62 0TswPZRp4fze9BDTGQdBf/op/5zCMiw= Received: from mail-ed1-f71.google.com (mail-ed1-f71.google.com [209.85.208.71]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_128_GCM_SHA256) id us-mta-206-QL9MiAZ6OQG895RM8gPj-g-1; Fri, 17 Feb 2023 08:44:30 -0500 X-MC-Unique: QL9MiAZ6OQG895RM8gPj-g-1 Received: by mail-ed1-f71.google.com with SMTP id cb12-20020a0564020b6c00b0046790cd9082so2146112edb.21 for ; Fri, 17 Feb 2023 05:44:30 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=LwLShPp83DT/SjdrQ8e0j5FoWk8xRUBiFNvXjirXtcY=; b=tgVdrnRaDrq9qu9+mNHBQP3H+d0oSRFZatD1PW7tc4nFIXlXk+x824tMiMMSTiwWFF agotkaqVwTCANE0AIfFbPddYnsCHrmHqnyhvvZ6LJw2NLF8uI41RU/f0Cu8FxTOk14+U Op+W67fgsVYaOjS1z6q4pRisod9Mhj5SDevqOzsKvuS8iN2+mNaiDAP8tZUXxERISheH PhwVkEsloo9jbir8PFMgN8WcxhBkw3veHKZR9E5ygU4atepsXgC1y0gtknz2j1q/d5SP z9iy3SEkouUuUnW1rMMHk7SLzmT0PKvx+VgB4PLCvlP07wremaMTfcrvygoOtPYS7+K6 XPsQ== X-Gm-Message-State: AO0yUKUebYhZ3f+eI85LfZoX6upE5xVPX6Mn8TC4TZjMWarqZTZExuCJ i6NKcbcA19SsRD//s6wzeKX7MEfIJljUblD2vvEXDiMk0RZEn65XT3d5YJ1AueFoRHfg12L9s7N nANToCAaLyNg= X-Received: by 2002:a17:906:d104:b0:8b1:279c:82c7 with SMTP id b4-20020a170906d10400b008b1279c82c7mr855066ejz.50.1676641469593; Fri, 17 Feb 2023 05:44:29 -0800 (PST) X-Google-Smtp-Source: AK7set9bwPc8c8ahszSaP/wQTMd4R9rpXWxJ4wDJVBZN063mM6nFIe9FLzQVWxLMjS8msuQDnJMLeg== X-Received: by 2002:a17:906:d104:b0:8b1:279c:82c7 with SMTP id b4-20020a170906d10400b008b1279c82c7mr855038ejz.50.1676641469294; Fri, 17 Feb 2023 05:44:29 -0800 (PST) Received: from cassiopeiae.. ([2a02:810d:4b3f:de78:642:1aff:fe31:a19f]) by smtp.gmail.com with ESMTPSA id s24-20020a508d18000000b004acb2c8aeaesm2326370eds.85.2023.02.17.05.44.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 17 Feb 2023 05:44:28 -0800 (PST) From: Danilo Krummrich To: airlied@gmail.com, daniel@ffwll.ch, tzimmermann@suse.de, mripard@kernel.org, corbet@lwn.net, christian.koenig@amd.com, bskeggs@redhat.com, Liam.Howlett@oracle.com, matthew.brost@intel.com, boris.brezillon@collabora.com, alexdeucher@gmail.com, ogabbay@kernel.org, bagasdotme@gmail.com, willy@infradead.org, jason@jlekstrand.net Cc: dri-devel@lists.freedesktop.org, nouveau@lists.freedesktop.org, linux-doc@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Danilo Krummrich Subject: [PATCH drm-next v2 00/16] [RFC] DRM GPUVA Manager & Nouveau VM_BIND UAPI Date: Fri, 17 Feb 2023 14:44:06 +0100 Message-Id: <20230217134422.14116-1-dakr@redhat.com> X-Mailer: git-send-email 2.39.1 MIME-Version: 1.0 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com X-Rspamd-Queue-Id: B7B5480016 X-Rspamd-Server: rspam09 X-Rspam-User: X-Stat-Signature: aimk9n948kyxt47p6s73h3aptawznb79 X-HE-Tag: 1676641472-669723 X-HE-Meta: U2FsdGVkX18dHNP57ZIFw669HHrdi9+ZvHWPmC/gBBiMW474K3lzdCsnjk28KuTPSkdGekoAeuIMOAM+VzSeh5Us0YMHWI20mRwcBRW9xblr999xP3ncA6NIOGJp1QuMefbv5BmLc4olvazQr1Y1KD8SE6Fk/zz/BCYuUBxy1WVP1BL1/lB8jbMbXeVhXrD5Z/ooN0ltQqK708y1fFpMnFaTlsxt9VbtUQ2K9wA0WcYuSBV4ssDeCitm5LmnH02mhjdmrwdfX0ecsjpxLKIVDSSJPRKitxkLh1Z0i7czTDojzQHimTWnGI/ZwCTkom9AaO9IHi9AdQNTPAxFtJi1VhkxwLVXjT85gwRH3ffyKdQqEEOwwTGCfIOBOcJ6oseZFrZEp3ODS7lIYdU68ubnsHN4o199ABC04NK9yFaiDOOS267rZ55mESDQkiaz53o9YdOAp3uPA9u02MMMTOU59Og+h6jAnVVAwbIL2XyKtQXCe6ts3L4DDfBdUdFZL6SVEacK0lMnjE5JHdZYW7iufldH2mbvTprN8Gnk8SGxLC/dNRBa/qFBQ11Y7w3+5f3XtfIaunGPI/V/NTTDuYaa3FDZIs4uHioG0bRSTDMwUZPZDSXCyhPr4G6ruh0IWdUJpfOv4wVt5YpI/gpFEabDwUmPXBw30VfcEc02wpVqsLWztHs0tB6zgS3Z16d0vdzhyLEM6a5+sfv/jHIEtwOmZZYF12MppNEMjfnPIiN2mWZV1ixsALox0qZSHunShSReLKo/izzrZE4w0qiNuZyGkSG3R8PvckO/8y6yQQYRFF3HMU/xvUej5p/4nQl0tY+n4RPhgNo5x41fiz/KyD/SVqb7MI90nk0afG55IJh3aA0JXlTQS8hr23QfKQNobBu4v/2l0WPg1MYgYTNhGfbkEaDTQ+pJaqg4gz8RIxXh5UHqfWaTCHfBLnwjDq6BtA1cA/+8dcM5meMLxOMswLB +CvUFxn0 GRwR0cJDI2mjjQ4zOxHvCEELur4Legntrg/p08i3XBElJZfxFGHWNtHY7wJC++h2Iwflej6soWppqcJLKsngqwDh4FhoNCxxMxOeae1mRF9GzcCrD9H05IHUP4LKOwjeAWo1UAC7gw+LVEi+f9m4KxXSmMEPCu21/1JLKDCWUX/VvmyK+XWFeprQmf7csPN64YRkqNoahuyef7FlGuHLYwD2hjJXltjNVkaCsB5Alx1KYMMf/9yc6rKq8khPRNI0waO23vR5AM9M3q4FRMxsewJYGw5D9SFIUhzLNrGQaOHhFRamI3pnJCVTDgpaK6f1Y9aj1PlZNCUTcQk1y+BN8t1aZmUk9EbZprwD5xqvqzcU6SuplFgXxtvp2u0tHHqYgBFUnOnKKgJ+sxTyWsizJOS4GprBkG65nLg+F0xeIO6mAsFQYPbz30gTFTf/C9SMojO76lkOifa0i9LINid1j2rPUTzqODpMBRVPj X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: This patch series provides a new UAPI for the Nouveau driver in order to support Vulkan features, such as sparse bindings and sparse residency. Furthermore, with the DRM GPUVA manager it provides a new DRM core feature to keep track of GPU virtual address (VA) mappings in a more generic way. The DRM GPUVA manager is indented to help drivers implement userspace-manageable GPU VA spaces in reference to the Vulkan API. In order to achieve this goal it serves the following purposes in this context. 1) Provide infrastructure to track GPU VA allocations and mappings, making use of the maple_tree. 2) Generically connect GPU VA mappings to their backing buffers, in particular DRM GEM objects. 3) Provide a common implementation to perform more complex mapping operations on the GPU VA space. In particular splitting and merging of GPU VA mappings, e.g. for intersecting mapping requests or partial unmap requests. The new VM_BIND Nouveau UAPI build on top of the DRM GPUVA manager, itself providing the following new interfaces. 1) Initialize a GPU VA space via the new DRM_IOCTL_NOUVEAU_VM_INIT ioctl for UMDs to specify the portion of VA space managed by the kernel and userspace, respectively. 2) Allocate and free a VA space region as well as bind and unbind memory to the GPUs VA space via the new DRM_IOCTL_NOUVEAU_VM_BIND ioctl. 3) Execute push buffers with the new DRM_IOCTL_NOUVEAU_EXEC ioctl. Both, DRM_IOCTL_NOUVEAU_VM_BIND and DRM_IOCTL_NOUVEAU_EXEC, make use of the DRM scheduler to queue jobs and support asynchronous processing with DRM syncobjs as synchronization mechanism. By default DRM_IOCTL_NOUVEAU_VM_BIND does synchronous processing, DRM_IOCTL_NOUVEAU_EXEC supports asynchronous processing only. The new VM_BIND UAPI for Nouveau makes also use of drm_exec (execution context for GEM buffers) by Christian König. Since the patch implementing drm_exec was not yet merged into drm-next it is part of this series, as well as a small fix for this patch, which was found while testing this series. This patch series is also available at [1]. There is a Mesa NVK merge request by Dave Airlie [2] implementing the corresponding userspace parts for this series. The Vulkan CTS test suite passes the sparse binding and sparse residency test cases for the new UAPI together with Dave's Mesa work. There are also some test cases in the igt-gpu-tools project [3] for the new UAPI and hence the DRM GPU VA manager. However, most of them are testing the DRM GPU VA manager's logic through Nouveau's new UAPI and should be considered just as helper for implementation. However, I absolutely intend to change those test cases to proper kunit test cases for the DRM GPUVA manager, once and if we agree on it's usefulness and design. [1] https://gitlab.freedesktop.org/nouvelles/kernel/-/tree/new-uapi-drm-next / https://gitlab.freedesktop.org/nouvelles/kernel/-/merge_requests/1 [2] https://gitlab.freedesktop.org/nouveau/mesa/-/merge_requests/150/ [3] https://gitlab.freedesktop.org/dakr/igt-gpu-tools/-/tree/wip_nouveau_vm_bind Changes in V2: ============== Nouveau: - Reworked the Nouveau VM_BIND UAPI to avoid memory allocations in fence signalling critical sections. Updates to the VA space are split up in three separate stages, where only the 2. stage executes in a fence signalling critical section: 1. update the VA space, allocate new structures and page tables 2. (un-)map the requested memory bindings 3. free structures and page tables - Separated generic job scheduler code from specific job implementations. - Separated the EXEC and VM_BIND implementation of the UAPI. - Reworked the locking parts of the nvkm/vmm RAW interface, such that (un-)map operations can be executed in fence signalling critical sections. GPUVA Manager: - made drm_gpuva_regions optional for users of the GPUVA manager - allow NULL GEMs for drm_gpuva entries - swichted from drm_mm to maple_tree for track drm_gpuva / drm_gpuva_region entries - provide callbacks for users to allocate custom drm_gpuva_op structures to allow inheritance - added user bits to drm_gpuva_flags - added a prefetch operation type in order to support generating prefetch operations in the same way other operations generated - hand the responsibility for mutual exclusion for a GEM's drm_gpuva list to the user; simplified corresponding (un-)link functions Maple Tree: - I added two maple tree patches to the series, one to support custom tree walk macros and one to hand the locking responsibility to the user of the GPUVA manager without pre-defined lockdep checks. TODO ==== Maple Tree: - Maple tree uses the 'unsinged long' type for node entries. While this works for 64bit, it's incompatible with the DRM GPUVA Manager on 32bit, since the DRM GPUVA Manager uses the u64 type and so do drivers using it. While it's questionable whether a 32bit kernel and a > 32bit GPU address space make any sense, it creates tons of compiler warnings when compiling for 32bit. Maybe it makes sense to expand the maple tree API to let users decide which size to pick - other ideas / proposals are welcome. Christian König (1): drm: execution context for GEM buffers Danilo Krummrich (15): drm/exec: fix memory leak in drm_exec_prepare_obj() maple_tree: split up MA_STATE() macro maple_tree: add flag MT_FLAGS_LOCK_NONE drm: manager to keep track of GPUs VA mappings drm: debugfs: provide infrastructure to dump a DRM GPU VA space drm/nouveau: new VM_BIND uapi interfaces drm/nouveau: get vmm via nouveau_cli_vmm() drm/nouveau: bo: initialize GEM GPU VA interface drm/nouveau: move usercopy helpers to nouveau_drv.h drm/nouveau: fence: fail to emit when fence context is killed drm/nouveau: chan: provide nouveau_channel_kill() drm/nouveau: nvkm/vmm: implement raw ops to manage uvmm drm/nouveau: implement uvmm for user mode bindings drm/nouveau: implement new VM_BIND UAPI drm/nouveau: debugfs: implement DRM GPU VA debugfs Documentation/gpu/driver-uapi.rst | 11 + Documentation/gpu/drm-mm.rst | 43 + drivers/gpu/drm/Kconfig | 6 + drivers/gpu/drm/Makefile | 3 + drivers/gpu/drm/amd/amdgpu/Kconfig | 1 + drivers/gpu/drm/drm_debugfs.c | 56 + drivers/gpu/drm/drm_exec.c | 294 +++ drivers/gpu/drm/drm_gem.c | 3 + drivers/gpu/drm/drm_gpuva_mgr.c | 1704 +++++++++++++++++ drivers/gpu/drm/nouveau/Kbuild | 3 + drivers/gpu/drm/nouveau/Kconfig | 2 + drivers/gpu/drm/nouveau/include/nvif/if000c.h | 26 +- drivers/gpu/drm/nouveau/include/nvif/vmm.h | 19 +- .../gpu/drm/nouveau/include/nvkm/subdev/mmu.h | 20 +- drivers/gpu/drm/nouveau/nouveau_abi16.c | 23 + drivers/gpu/drm/nouveau/nouveau_abi16.h | 1 + drivers/gpu/drm/nouveau/nouveau_bo.c | 152 +- drivers/gpu/drm/nouveau/nouveau_bo.h | 2 +- drivers/gpu/drm/nouveau/nouveau_chan.c | 16 +- drivers/gpu/drm/nouveau/nouveau_chan.h | 1 + drivers/gpu/drm/nouveau/nouveau_debugfs.c | 24 + drivers/gpu/drm/nouveau/nouveau_drm.c | 26 +- drivers/gpu/drm/nouveau/nouveau_drv.h | 92 +- drivers/gpu/drm/nouveau/nouveau_exec.c | 322 ++++ drivers/gpu/drm/nouveau/nouveau_exec.h | 39 + drivers/gpu/drm/nouveau/nouveau_fence.c | 7 + drivers/gpu/drm/nouveau/nouveau_fence.h | 2 +- drivers/gpu/drm/nouveau/nouveau_gem.c | 57 +- drivers/gpu/drm/nouveau/nouveau_mem.h | 5 + drivers/gpu/drm/nouveau/nouveau_prime.c | 2 +- drivers/gpu/drm/nouveau/nouveau_sched.c | 467 +++++ drivers/gpu/drm/nouveau/nouveau_sched.h | 96 + drivers/gpu/drm/nouveau/nouveau_svm.c | 2 +- drivers/gpu/drm/nouveau/nouveau_uvmm.c | 1536 +++++++++++++++ drivers/gpu/drm/nouveau/nouveau_uvmm.h | 138 ++ drivers/gpu/drm/nouveau/nouveau_vmm.c | 4 +- drivers/gpu/drm/nouveau/nvif/vmm.c | 100 +- .../gpu/drm/nouveau/nvkm/subdev/mmu/uvmm.c | 213 ++- drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmm.c | 197 +- drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmm.h | 25 + .../drm/nouveau/nvkm/subdev/mmu/vmmgf100.c | 16 +- .../drm/nouveau/nvkm/subdev/mmu/vmmgp100.c | 16 +- .../gpu/drm/nouveau/nvkm/subdev/mmu/vmmnv50.c | 27 +- include/drm/drm_debugfs.h | 25 + include/drm/drm_drv.h | 6 + include/drm/drm_exec.h | 144 ++ include/drm/drm_gem.h | 75 + include/drm/drm_gpuva_mgr.h | 714 +++++++ include/linux/maple_tree.h | 27 +- include/uapi/drm/nouveau_drm.h | 220 +++ lib/maple_tree.c | 7 +- 51 files changed, 6808 insertions(+), 209 deletions(-) create mode 100644 drivers/gpu/drm/drm_exec.c create mode 100644 drivers/gpu/drm/drm_gpuva_mgr.c create mode 100644 drivers/gpu/drm/nouveau/nouveau_exec.c create mode 100644 drivers/gpu/drm/nouveau/nouveau_exec.h create mode 100644 drivers/gpu/drm/nouveau/nouveau_sched.c create mode 100644 drivers/gpu/drm/nouveau/nouveau_sched.h create mode 100644 drivers/gpu/drm/nouveau/nouveau_uvmm.c create mode 100644 drivers/gpu/drm/nouveau/nouveau_uvmm.h create mode 100644 include/drm/drm_exec.h create mode 100644 include/drm/drm_gpuva_mgr.h base-commit: 48075a66fca613477ac1969b576a93ef5db0164f