[v4,09/16] KVM: Introduce KVM_CAP_NOWAIT_ON_FAULT without implementation

Message ID	20230602161921.208564-10-amoorthy@google.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <kvm-owner@vger.kernel.org> Date: Fri, 2 Jun 2023 16:19:14 +0000 In-Reply-To: <20230602161921.208564-1-amoorthy@google.com> Mime-Version: 1.0 References: <20230602161921.208564-1-amoorthy@google.com> Message-ID: <20230602161921.208564-10-amoorthy@google.com> Subject: [PATCH v4 09/16] KVM: Introduce KVM_CAP_NOWAIT_ON_FAULT without implementation From: Anish Moorthy <amoorthy@google.com> To: seanjc@google.com, oliver.upton@linux.dev, kvm@vger.kernel.org, kvmarm@lists.linux.dev Cc: pbonzini@redhat.com, maz@kernel.org, robert.hoo.linux@gmail.com, jthoughton@google.com, amoorthy@google.com, bgardon@google.com, dmatlack@google.com, ricarkol@google.com, axelrasmussen@google.com, peterx@redhat.com, nadav.amit@gmail.com, isaku.yamahata@gmail.com Content-Type: text/plain; charset="UTF-8" Precedence: bulk
Series	Improve scalability of KVM + userfaultfd live migration via annotated memory faults. \| expand [v4,00/16] Improve scalability of KVM + userfaultfd live migration via annotated memory faults. [v4,01/16] KVM: Allow hva_pfn_fast() to resolve read-only faults. [v4,02/16] KVM: x86: Set vCPU exit reason to KVM_EXIT_UNKNOWN at the start of KVM_RUN [v4,03/16] KVM: Add KVM_CAP_MEMORY_FAULT_INFO [v4,04/16] KVM: Add docstrings to __kvm_write_guest_page() and __kvm_read_guest_page() [v4,05/16] KVM: Annotate -EFAULTs from kvm_vcpu_write_guest_page() [v4,06/16] KVM: Annotate -EFAULTs from kvm_vcpu_read_guest_page() [v4,07/16] KVM: Simplify error handling in __gfn_to_pfn_memslot() [v4,08/16] KVM: x86: Annotate -EFAULTs from kvm_handle_error_pfn() [v4,09/16] KVM: Introduce KVM_CAP_NOWAIT_ON_FAULT without implementation [v4,10/16] KVM: x86: Implement KVM_CAP_NOWAIT_ON_FAULT [v4,11/16] KVM: arm64: Implement KVM_CAP_NOWAIT_ON_FAULT [v4,12/16] KVM: selftests: Report per-vcpu demand paging rate from demand paging test [v4,13/16] KVM: selftests: Allow many vCPUs and reader threads per UFFD in demand paging test [v4,14/16] KVM: selftests: Use EPOLL in userfaultfd_util reader threads and signal errors via TEST_… [v4,15/16] KVM: selftests: Add memslot_flags parameter to memstress_create_vm() [v4,16/16] KVM: selftests: Handle memory fault exits in demand_paging_test

diff --git a/Documentation/virt/kvm/api.rst b/Documentation/virt/kvm/api.rst index 5b24059143b3..9daadbe2c7ed 100644 --- a/Documentation/virt/kvm/api.rst +++ b/Documentation/virt/kvm/api.rst @@ -1312,6 +1312,7 @@ yet and must be cleared on entry. /* for kvm_userspace_memory_region::flags */ #define KVM_MEM_LOG_DIRTY_PAGES (1UL << 0) #define KVM_MEM_READONLY (1UL << 1) + #define KVM_MEM_NOWAIT_ON_FAULT (1UL << 2) This ioctl allows the user to create, modify or delete a guest physical memory slot. Bits 0-15 of "slot" specify the slot id and this value @@ -1342,12 +1343,15 @@ It is recommended that the lower 21 bits of guest_phys_addr and userspace_addr be identical. This allows large pages in the guest to be backed by large pages in the host. -The flags field supports two flags: KVM_MEM_LOG_DIRTY_PAGES and -KVM_MEM_READONLY. The former can be set to instruct KVM to keep track of +The flags field supports three flags + +1. KVM_MEM_LOG_DIRTY_PAGES: can be set to instruct KVM to keep track of writes to memory within the slot. See KVM_GET_DIRTY_LOG ioctl to know how to -use it. The latter can be set, if KVM_CAP_READONLY_MEM capability allows it, +use it. +2. KVM_MEM_READONLY: can be set, if KVM_CAP_READONLY_MEM capability allows it, to make a new slot read-only. In this case, writes to this memory will be posted to userspace as KVM_EXIT_MMIO exits. +3. KVM_MEM_NOWAIT_ON_FAULT: see KVM_CAP_NOWAIT_ON_FAULT for details. When the KVM_CAP_SYNC_MMU capability is available, changes in the backing of the memory region are automatically reflected into the guest. For example, an @@ -7776,6 +7780,28 @@ userspace may receive "bare" EFAULTs (i.e. exit reason != KVM_EXIT_MEMORY_FAULT) from KVM_RUN for failures which may be resolvable. These should be considered bugs and reported to the maintainers so that annotations can be added. +7.35 KVM_CAP_NOWAIT_ON_FAULT +---------------------------- + +:Architectures: None +:Returns: -EINVAL. + +The presence of this capability indicates that userspace may pass the +KVM_MEM_NOWAIT_ON_FAULT flag to KVM_SET_USER_MEMORY_REGION to cause KVM_RUN +to fail (-EFAULT) in response to page faults for which resolution would require +the faulting thread to sleep. + +The range of guest physical memory causing the fault is advertised to userspace +through KVM_CAP_MEMORY_FAULT_INFO. + +Userspace should determine how best to make the mapping present, then take +appropriate action. For instance establishing the mapping could involve a +MADV_POPULATE_READ|WRITE, in the context of userfaultfd a UFFDIO_COPY|CONTINUE +could be appropriate, etc. After establishing the mapping, userspace can return +to KVM to retry the memory access. + +Attempts to enable this capability directly will fail. + 8. Other capabilities. ====================== diff --git a/include/linux/kvm_host.h b/include/linux/kvm_host.h index 69a221f71914..abbc5dd72292 100644 --- a/include/linux/kvm_host.h +++ b/include/linux/kvm_host.h @@ -2297,4 +2297,10 @@ static inline void kvm_account_pgtable_pages(void *virt, int nr) */ inline void kvm_populate_efault_info(struct kvm_vcpu *vcpu, uint64_t gpa, uint64_t len, uint64_t flags); + +static inline bool kvm_slot_nowait_on_fault( + const struct kvm_memory_slot *slot) +{ + return slot->flags & KVM_MEM_NOWAIT_ON_FAULT; +} #endif diff --git a/include/uapi/linux/kvm.h b/include/uapi/linux/kvm.h index 143abb334f56..595c3d7d36aa 100644 --- a/include/uapi/linux/kvm.h +++ b/include/uapi/linux/kvm.h @@ -102,6 +102,7 @@ struct kvm_userspace_memory_region { */ #define KVM_MEM_LOG_DIRTY_PAGES (1UL << 0) #define KVM_MEM_READONLY (1UL << 1) +#define KVM_MEM_NOWAIT_ON_FAULT (1UL << 2) /* for KVM_IRQ_LINE */ struct kvm_irq_level { @@ -1198,6 +1199,7 @@ struct kvm_ppc_resize_hpt { #define KVM_CAP_PMU_EVENT_MASKED_EVENTS 226 #define KVM_CAP_COUNTER_OFFSET 227 #define KVM_CAP_MEMORY_FAULT_INFO 228 +#define KVM_CAP_NOWAIT_ON_FAULT 229 #ifdef KVM_CAP_IRQ_ROUTING diff --git a/tools/include/uapi/linux/kvm.h b/tools/include/uapi/linux/kvm.h index 5476fe169921..f64845cd599f 100644 --- a/tools/include/uapi/linux/kvm.h +++ b/tools/include/uapi/linux/kvm.h @@ -102,6 +102,7 @@ struct kvm_userspace_memory_region { */ #define KVM_MEM_LOG_DIRTY_PAGES (1UL << 0) #define KVM_MEM_READONLY (1UL << 1) +#define KVM_MEM_NOWAIT_ON_FAULT (1UL << 2) /* for KVM_IRQ_LINE */ struct kvm_irq_level { diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c index 05d6e7e3994d..2c276d4d0821 100644 --- a/virt/kvm/kvm_main.c +++ b/virt/kvm/kvm_main.c @@ -1527,6 +1527,9 @@ static int check_memory_region_flags(const struct kvm_userspace_memory_region *m valid_flags |= KVM_MEM_READONLY; #endif + if (kvm_vm_ioctl_check_extension(NULL, KVM_CAP_NOWAIT_ON_FAULT)) + valid_flags |= KVM_MEM_NOWAIT_ON_FAULT; + if (mem->flags & ~valid_flags) return -EINVAL;

[v4,09/16] KVM: Introduce KVM_CAP_NOWAIT_ON_FAULT without implementation

Commit Message

Comments

Patch