[v12,45/77] KVM: introspection: add KVMI_VM_PAUSE_VCPU

Message ID	20211006173113.26445-46-alazar@bitdefender.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <kvm-owner@kernel.org> From: =?utf-8?q?Adalbert_Laz=C4=83r?= <alazar@bitdefender.com> To: kvm@vger.kernel.org Cc: virtualization@lists.linux-foundation.org, Paolo Bonzini <pbonzini@redhat.com>, Sean Christopherson <seanjc@google.com>, Vitaly Kuznetsov <vkuznets@redhat.com>, Wanpeng Li <wanpengli@tencent.com>, Jim Mattson <jmattson@google.com>, Joerg Roedel <joro@8bytes.org>, Mathieu Tarral <mathieu.tarral@protonmail.com>, Tamas K Lengyel <tamas@tklengyel.com>, =?utf-8?q?Adalbert_Laz=C4=83r?= <alazar@bitdefender.com> Subject: [PATCH v12 45/77] KVM: introspection: add KVMI_VM_PAUSE_VCPU Date: Wed, 6 Oct 2021 20:30:41 +0300 Message-Id: <20211006173113.26445-46-alazar@bitdefender.com> In-Reply-To: <20211006173113.26445-1-alazar@bitdefender.com> References: <20211006173113.26445-1-alazar@bitdefender.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	VM introspection \| expand [v12,00/77] VM introspection [v12,01/77] KVM: UAPI: add error codes used by the VM introspection code [v12,02/77] KVM: add kvm_vcpu_kick_and_wait() [v12,03/77] KVM: x86: add kvm_arch_vcpu_get_regs() and kvm_arch_vcpu_get_sregs() [v12,04/77] KVM: x86: add kvm_arch_vcpu_set_regs() [v12,05/77] KVM: x86: avoid injecting #PF when emulate the VMCALL instruction [v12,06/77] KVM: x86: add kvm_x86_ops.bp_intercepted() [v12,07/77] KVM: x86: add kvm_x86_ops.control_cr3_intercept() [v12,08/77] KVM: x86: add kvm_x86_ops.cr3_write_intercepted() [v12,09/77] KVM: x86: add kvm_x86_ops.desc_ctrl_supported() [v12,10/77] KVM: svm: add support for descriptor-table VM-exits [v12,11/77] KVM: x86: add kvm_x86_ops.control_desc_intercept() [v12,12/77] KVM: x86: add kvm_x86_ops.desc_intercepted() [v12,13/77] KVM: x86: add kvm_x86_ops.msr_write_intercepted() [v12,14/77] KVM: x86: svm: use the vmx convention to control the MSR interception [v12,15/77] KVM: x86: add kvm_x86_ops.control_msr_intercept() [v12,16/77] KVM: x86: save the error code during EPT/NPF exits handling [v12,17/77] KVM: x86: add kvm_x86_ops.fault_gla() [v12,18/77] KVM: x86: add kvm_x86_ops.control_singlestep() [v12,19/77] KVM: x86: export kvm_arch_vcpu_set_guest_debug() [v12,20/77] KVM: x86: extend kvm_mmu_gva_to_gpa_system() with the 'access' parameter [v12,21/77] KVM: x86: export kvm_inject_pending_exception() [v12,22/77] KVM: x86: export kvm_vcpu_ioctl_x86_get_xsave() [v12,23/77] KVM: x86: export kvm_vcpu_ioctl_x86_set_xsave() [v12,24/77] KVM: x86: page track: provide all callbacks with the guest virtual address [v12,25/77] KVM: x86: page track: add track_create_slot() callback [v12,26/77] KVM: x86: page_track: add support for preread, prewrite and preexec [v12,27/77] KVM: x86: wire in the preread/prewrite/preexec page trackers [v12,28/77] KVM: x86: disable gpa_available optimization for fetch and page-walk SPT violations [v12,29/77] KVM: introduce VM introspection [v12,30/77] KVM: introspection: add hook/unhook ioctls [v12,31/77] KVM: introspection: add permission access ioctls [v12,32/77] KVM: introspection: add the read/dispatch message function [v12,33/77] KVM: introspection: add KVMI_GET_VERSION [v12,34/77] KVM: introspection: add KVMI_VM_CHECK_COMMAND and KVMI_VM_CHECK_EVENT [v12,35/77] KVM: introspection: add KVMI_VM_GET_INFO [v12,36/77] KVM: introspection: add KVM_INTROSPECTION_PREUNHOOK [v12,37/77] KVM: introspection: add KVMI_VM_EVENT_UNHOOK [v12,38/77] KVM: introspection: add KVMI_VM_CONTROL_EVENTS [v12,39/77] KVM: introspection: add KVMI_VM_READ_PHYSICAL/KVMI_VM_WRITE_PHYSICAL [v12,40/77] KVM: introspection: add vCPU related data [v12,41/77] KVM: introspection: add a jobs list to every introspected vCPU [v12,42/77] KVM: introspection: handle vCPU introspection requests [v12,43/77] KVM: introspection: handle vCPU commands [v12,44/77] KVM: introspection: add KVMI_VCPU_GET_INFO [v12,45/77] KVM: introspection: add KVMI_VM_PAUSE_VCPU [v12,46/77] KVM: introspection: add support for vCPU events [v12,47/77] KVM: introspection: add KVMI_VCPU_EVENT_PAUSE [v12,48/77] KVM: introspection: add the crash action handling on the event reply [v12,49/77] KVM: introspection: add KVMI_VCPU_CONTROL_EVENTS [v12,50/77] KVM: introspection: add KVMI_VCPU_GET_REGISTERS [v12,51/77] KVM: introspection: add KVMI_VCPU_SET_REGISTERS [v12,52/77] KVM: introspection: add KVMI_VCPU_GET_CPUID [v12,53/77] KVM: introspection: add KVMI_VCPU_EVENT_HYPERCALL [v12,54/77] KVM: introspection: add KVMI_VCPU_EVENT_BREAKPOINT [v12,55/77] KVM: introspection: add cleanup support for vCPUs [v12,56/77] KVM: introspection: restore the state of #BP interception on unhook [v12,57/77] KVM: introspection: add KVMI_VM_CONTROL_CLEANUP [v12,58/77] KVM: introspection: add KVMI_VCPU_CONTROL_CR and KVMI_VCPU_EVENT_CR [v12,59/77] KVM: introspection: restore the state of CR3 interception on unhook [v12,60/77] KVM: introspection: add KVMI_VCPU_INJECT_EXCEPTION + KVMI_VCPU_EVENT_TRAP [v12,61/77] KVM: introspection: add KVMI_VCPU_EVENT_XSETBV [v12,62/77] KVM: introspection: add KVMI_VCPU_GET_XCR [v12,63/77] KVM: introspection: add KVMI_VCPU_GET_XSAVE [v12,64/77] KVM: introspection: add KVMI_VCPU_SET_XSAVE [v12,65/77] KVM: introspection: add KVMI_VCPU_GET_MTRR_TYPE [v12,66/77] KVM: introspection: add KVMI_VCPU_EVENT_DESCRIPTOR [v12,67/77] KVM: introspection: restore the state of descriptor-table register interception on unho… [v12,68/77] KVM: introspection: add KVMI_VCPU_CONTROL_MSR and KVMI_VCPU_EVENT_MSR [v12,69/77] KVM: introspection: restore the state of MSR interception on unhook [v12,70/77] KVM: introspection: add KVMI_VM_SET_PAGE_ACCESS [v12,71/77] KVM: introspection: add KVMI_VCPU_EVENT_PF [v12,72/77] KVM: introspection: extend KVMI_GET_VERSION with struct kvmi_features [v12,73/77] KVM: introspection: add KVMI_VCPU_CONTROL_SINGLESTEP [v12,74/77] KVM: introspection: add KVMI_VCPU_EVENT_SINGLESTEP [v12,75/77] KVM: introspection: add KVMI_VCPU_TRANSLATE_GVA [v12,76/77] KVM: introspection: emulate a guest page table walk on SPT violations due to A/D bit up… [v12,77/77] KVM: x86: call the page tracking code on emulation failure

Message ID

20211006173113.26445-46-alazar@bitdefender.com (mailing list archive)

State

New, archived

Headers

From: =?utf-8?q?Adalbert_Laz=C4=83r?= <alazar@bitdefender.com>
To: kvm@vger.kernel.org
Cc: virtualization@lists.linux-foundation.org,
 Paolo Bonzini <pbonzini@redhat.com>, Sean Christopherson <seanjc@google.com>,
 Vitaly Kuznetsov <vkuznets@redhat.com>, Wanpeng Li <wanpengli@tencent.com>,
 Jim Mattson <jmattson@google.com>, Joerg Roedel <joro@8bytes.org>,
 Mathieu Tarral <mathieu.tarral@protonmail.com>,
 Tamas K Lengyel <tamas@tklengyel.com>,
 =?utf-8?q?Adalbert_Laz=C4=83r?= <alazar@bitdefender.com>
Subject: [PATCH v12 45/77] KVM: introspection: add KVMI_VM_PAUSE_VCPU
Date: Wed,  6 Oct 2021 20:30:41 +0300
Message-Id: <20211006173113.26445-46-alazar@bitdefender.com>
In-Reply-To: <20211006173113.26445-1-alazar@bitdefender.com>
References: <20211006173113.26445-1-alazar@bitdefender.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Precedence: bulk

Series

VM introspection | expand

Commit Message

Adalbert Lazăr Oct. 6, 2021, 5:30 p.m. UTC

This command increments a pause requests counter for a vCPU and kicks
it out of guest.

The introspection tool can pause a VM by sending this command for all
vCPUs. If it sets 'wait=1', it can consider that the VM is paused when
it receives the reply for the last KVMI_VM_PAUSE_VCPU command.

Signed-off-by: Adalbert Lazăr <alazar@bitdefender.com>
---
 Documentation/virt/kvm/kvmi.rst               | 39 +++++++++++++++
 include/linux/kvmi_host.h                     |  2 +
 include/uapi/linux/kvmi.h                     |  8 ++++
 .../testing/selftests/kvm/x86_64/kvmi_test.c  | 30 ++++++++++++
 virt/kvm/introspection/kvmi.c                 | 47 +++++++++++++++++--
 virt/kvm/introspection/kvmi_int.h             |  1 +
 virt/kvm/introspection/kvmi_msg.c             | 24 ++++++++++
 7 files changed, 147 insertions(+), 4 deletions(-)

diff --git a/Documentation/virt/kvm/kvmi.rst b/Documentation/virt/kvm/kvmi.rst
index 2f41fce79d95..9f6905456923 100644
--- a/Documentation/virt/kvm/kvmi.rst
+++ b/Documentation/virt/kvm/kvmi.rst
@@ -470,6 +470,45 @@  Returns the TSC frequency (in HZ) for the specified vCPU if available
 * -KVM_EINVAL - the selected vCPU is invalid
 * -KVM_EAGAIN - the selected vCPU can't be introspected yet
 
+9. KVMI_VM_PAUSE_VCPU
+---------------------
+
+:Architectures: all
+:Versions: >= 1
+:Parameters:
+
+::
+
+	struct kvmi_vm_pause_vcpu {
+		__u16 vcpu;
+		__u8 wait;
+		__u8 padding1;
+		__u32 padding2;
+	};
+
+:Returns:
+
+::
+
+	struct kvmi_error_code;
+
+Kicks the vCPU out of guest.
+
+If `wait` is 1, the command will wait for vCPU to acknowledge the IPI.
+
+The vCPU will handle the pending commands/events and send the
+*KVMI_VCPU_EVENT_PAUSE* event (one for every successful *KVMI_VM_PAUSE_VCPU*
+command) before returning to guest.
+
+:Errors:
+
+* -KVM_EINVAL - the padding is not zero
+* -KVM_EINVAL - the selected vCPU is invalid
+* -KVM_EAGAIN - the selected vCPU can't be introspected yet
+* -KVM_EBUSY  - the selected vCPU has too many queued
+                *KVMI_VCPU_EVENT_PAUSE* events
+* -KVM_EPERM  - the *KVMI_VCPU_EVENT_PAUSE* event is disallowed
+
 Events
 ======
 
diff --git a/include/linux/kvmi_host.h b/include/linux/kvmi_host.h
index 736edb400c05..59e645d9ea34 100644
--- a/include/linux/kvmi_host.h
+++ b/include/linux/kvmi_host.h
@@ -18,6 +18,8 @@  struct kvm_vcpu_introspection {
 
 	struct list_head job_list;
 	spinlock_t job_lock;
+
+	atomic_t pause_requests;
 };
 
 struct kvm_introspection {
diff --git a/include/uapi/linux/kvmi.h b/include/uapi/linux/kvmi.h
index da766427231e..bb90d03f059b 100644
--- a/include/uapi/linux/kvmi.h
+++ b/include/uapi/linux/kvmi.h
@@ -26,6 +26,7 @@  enum {
 	KVMI_VM_CONTROL_EVENTS = KVMI_VM_MESSAGE_ID(5),
 	KVMI_VM_READ_PHYSICAL  = KVMI_VM_MESSAGE_ID(6),
 	KVMI_VM_WRITE_PHYSICAL = KVMI_VM_MESSAGE_ID(7),
+	KVMI_VM_PAUSE_VCPU     = KVMI_VM_MESSAGE_ID(8),
 
 	KVMI_NEXT_VM_MESSAGE
 };
@@ -115,4 +116,11 @@  struct kvmi_vcpu_hdr {
 	__u32 padding2;
 };
 
+struct kvmi_vm_pause_vcpu {
+	__u16 vcpu;
+	__u8 wait;
+	__u8 padding1;
+	__u32 padding2;
+};
+
 #endif /* _UAPI__LINUX_KVMI_H */
diff --git a/tools/testing/selftests/kvm/x86_64/kvmi_test.c b/tools/testing/selftests/kvm/x86_64/kvmi_test.c
index 337f295d69ff..f8d355aff5fa 100644
--- a/tools/testing/selftests/kvm/x86_64/kvmi_test.c
+++ b/tools/testing/selftests/kvm/x86_64/kvmi_test.c
@@ -685,6 +685,35 @@  static void test_cmd_vcpu_get_info(struct kvm_vm *vm)
 			&rpl, sizeof(rpl), -KVM_EINVAL);
 }
 
+static void cmd_vcpu_pause(__u8 wait, int expected_err)
+{
+	struct {
+		struct kvmi_msg_hdr hdr;
+		struct kvmi_vm_pause_vcpu cmd;
+	} req = {};
+	__u16 vcpu_idx = 0;
+
+	req.cmd.wait = wait;
+	req.cmd.vcpu = vcpu_idx;
+
+	test_vm_command(KVMI_VM_PAUSE_VCPU, &req.hdr, sizeof(req), NULL, 0, expected_err);
+}
+
+static void pause_vcpu(void)
+{
+	cmd_vcpu_pause(1, 0);
+}
+
+static void test_pause(struct kvm_vm *vm)
+{
+	__u8 wait = 1, wait_inval = 2;
+
+	pause_vcpu();
+
+	cmd_vcpu_pause(wait, 0);
+	cmd_vcpu_pause(wait_inval, -KVM_EINVAL);
+}
+
 static void test_introspection(struct kvm_vm *vm)
 {
 	srandom(time(0));
@@ -700,6 +729,7 @@  static void test_introspection(struct kvm_vm *vm)
 	test_cmd_vm_control_events(vm);
 	test_memory_access(vm);
 	test_cmd_vcpu_get_info(vm);
+	test_pause(vm);
 
 	unhook_introspection(vm);
 }
diff --git a/virt/kvm/introspection/kvmi.c b/virt/kvm/introspection/kvmi.c
index 93b1bec23e48..faf443d6ce82 100644
--- a/virt/kvm/introspection/kvmi.c
+++ b/virt/kvm/introspection/kvmi.c
@@ -17,6 +17,8 @@ 
 
 #define KVMI_MSG_SIZE_ALLOC (sizeof(struct kvmi_msg_hdr) + KVMI_MAX_MSG_SIZE)
 
+#define MAX_PAUSE_REQUESTS 1001
+
 static DECLARE_BITMAP(Kvmi_always_allowed_commands, KVMI_NUM_COMMANDS);
 static DECLARE_BITMAP(Kvmi_known_events, KVMI_NUM_EVENTS);
 static DECLARE_BITMAP(Kvmi_known_vm_events, KVMI_NUM_EVENTS);
@@ -124,10 +126,14 @@  void kvmi_uninit(void)
 	kvmi_cache_destroy();
 }
 
-static void kvmi_make_request(struct kvm_vcpu *vcpu)
+static void kvmi_make_request(struct kvm_vcpu *vcpu, bool wait)
 {
 	kvm_make_request(KVM_REQ_INTROSPECTION, vcpu);
-	kvm_vcpu_kick(vcpu);
+
+	if (wait)
+		kvm_vcpu_kick_and_wait(vcpu);
+	else
+		kvm_vcpu_kick(vcpu);
 }
 
 static int __kvmi_add_job(struct kvm_vcpu *vcpu,
@@ -162,7 +168,7 @@  int kvmi_add_job(struct kvm_vcpu *vcpu,
 	err = __kvmi_add_job(vcpu, fct, ctx, free_fct);
 
 	if (!err)
-		kvmi_make_request(vcpu);
+		kvmi_make_request(vcpu, false);
 
 	return err;
 }
@@ -359,6 +365,9 @@  static int __kvmi_hook(struct kvm *kvm,
 
 static void kvmi_job_release_vcpu(struct kvm_vcpu *vcpu, void *ctx)
 {
+	struct kvm_vcpu_introspection *vcpui = VCPUI(vcpu);
+
+	atomic_set(&vcpui->pause_requests, 0);
 }
 
 static void kvmi_release_vcpus(struct kvm *kvm)
@@ -731,15 +740,45 @@  void kvmi_run_jobs(struct kvm_vcpu *vcpu)
 	}
 }
 
+static void kvmi_vcpu_pause_event(struct kvm_vcpu *vcpu)
+{
+	struct kvm_vcpu_introspection *vcpui = VCPUI(vcpu);
+
+	atomic_dec(&vcpui->pause_requests);
+	/* to be implemented */
+}
+
 void kvmi_handle_requests(struct kvm_vcpu *vcpu)
 {
+	struct kvm_vcpu_introspection *vcpui = VCPUI(vcpu);
 	struct kvm_introspection *kvmi;
 
 	kvmi = kvmi_get(vcpu->kvm);
 	if (!kvmi)
 		return;
 
-	kvmi_run_jobs(vcpu);
+	for (;;) {
+		kvmi_run_jobs(vcpu);
+
+		if (atomic_read(&vcpui->pause_requests))
+			kvmi_vcpu_pause_event(vcpu);
+		else
+			break;
+	}
 
 	kvmi_put(vcpu->kvm);
 }
+
+int kvmi_cmd_vcpu_pause(struct kvm_vcpu *vcpu, bool wait)
+{
+	struct kvm_vcpu_introspection *vcpui = VCPUI(vcpu);
+
+	if (atomic_read(&vcpui->pause_requests) > MAX_PAUSE_REQUESTS)
+		return -KVM_EBUSY;
+
+	atomic_inc(&vcpui->pause_requests);
+
+	kvmi_make_request(vcpu, wait);
+
+	return 0;
+}
diff --git a/virt/kvm/introspection/kvmi_int.h b/virt/kvm/introspection/kvmi_int.h
index 126e72201518..f1caa67dbdc3 100644
--- a/virt/kvm/introspection/kvmi_int.h
+++ b/virt/kvm/introspection/kvmi_int.h
@@ -55,6 +55,7 @@  int kvmi_cmd_read_physical(struct kvm *kvm, u64 gpa, size_t size,
 			   const struct kvmi_msg_hdr *ctx);
 int kvmi_cmd_write_physical(struct kvm *kvm, u64 gpa, size_t size,
 			    const void *buf);
+int kvmi_cmd_vcpu_pause(struct kvm_vcpu *vcpu, bool wait);
 
 /* arch */
 void kvmi_arch_init_vcpu_events_mask(unsigned long *supported);
diff --git a/virt/kvm/introspection/kvmi_msg.c b/virt/kvm/introspection/kvmi_msg.c
index 4cb19f069de2..588ceb36795d 100644
--- a/virt/kvm/introspection/kvmi_msg.c
+++ b/virt/kvm/introspection/kvmi_msg.c
@@ -245,6 +245,29 @@  static int handle_vm_write_physical(struct kvm_introspection *kvmi,
 	return kvmi_msg_vm_reply(kvmi, msg, ec, NULL, 0);
 }
 
+static int handle_vm_pause_vcpu(struct kvm_introspection *kvmi,
+				const struct kvmi_msg_hdr *msg,
+				const void *_req)
+{
+	const struct kvmi_vm_pause_vcpu *req = _req;
+	struct kvm_vcpu *vcpu;
+	int ec;
+
+	if (req->wait > 1 || req->padding1 || req->padding2) {
+		ec = -KVM_EINVAL;
+		goto reply;
+	}
+
+	vcpu = kvmi_get_vcpu(kvmi, req->vcpu);
+	if (!vcpu)
+		ec = -KVM_EINVAL;
+	else
+		ec = kvmi_cmd_vcpu_pause(vcpu, req->wait == 1);
+
+reply:
+	return kvmi_msg_vm_reply(kvmi, msg, ec, NULL, 0);
+}
+
 /*
  * These commands are executed by the receiving thread.
  */
@@ -254,6 +277,7 @@  static const kvmi_vm_msg_fct msg_vm[] = {
 	[KVMI_VM_CHECK_EVENT]    = handle_vm_check_event,
 	[KVMI_VM_CONTROL_EVENTS] = handle_vm_control_events,
 	[KVMI_VM_GET_INFO]       = handle_vm_get_info,
+	[KVMI_VM_PAUSE_VCPU]     = handle_vm_pause_vcpu,
 	[KVMI_VM_READ_PHYSICAL]  = handle_vm_read_physical,
 	[KVMI_VM_WRITE_PHYSICAL] = handle_vm_write_physical,
 };

[v12,45/77] KVM: introspection: add KVMI_VM_PAUSE_VCPU

Commit Message

Patch