From patchwork Fri Sep 29 15:01:41 2017 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: George Dunlap X-Patchwork-Id: 9978255 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id 4BBC860311 for ; Fri, 29 Sep 2017 15:04:13 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 3E1B52766D for ; Fri, 29 Sep 2017 15:04:13 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 32E4228448; Fri, 29 Sep 2017 15:04:13 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-4.2 required=2.0 tests=BAYES_00, RCVD_IN_DNSWL_MED autolearn=ham version=3.3.1 Received: from lists.xenproject.org (lists.xenproject.org [192.237.175.120]) (using TLSv1.2 with cipher AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.wl.linuxfoundation.org (Postfix) with ESMTPS id 0E2A42766D for ; Fri, 29 Sep 2017 15:04:12 +0000 (UTC) Received: from localhost ([127.0.0.1] helo=lists.xenproject.org) by lists.xenproject.org with esmtp (Exim 4.84_2) (envelope-from ) id 1dxwnt-0003hQ-Sz; Fri, 29 Sep 2017 15:02:13 +0000 Received: from mail6.bemta6.messagelabs.com ([193.109.254.103]) by lists.xenproject.org with esmtp (Exim 4.84_2) (envelope-from ) id 1dxwns-0003gI-Hj for xen-devel@lists.xenproject.org; Fri, 29 Sep 2017 15:02:12 +0000 Received: from [193.109.254.147] by server-2.bemta-6.messagelabs.com id 50/45-10804-3706EC95; Fri, 29 Sep 2017 15:02:11 +0000 X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFupnkeJIrShJLcpLzFFi42JxWrrBXrc44Vy kwe79whbft0xmcmD0OPzhCksAYxRrZl5SfkUCa8bFK0vYCx47VByYyNXA+N6wi5GTQ0LAX2Ll g5dMIDabgJ7EvONfWboYOThEBFQkbu816GLk4mAWeM8o0fPgLgtIjbBAnMTK2xeYQWpYBFQl/ k2MAQnzCthILHn9hR1ipLzEuQe3mUFsTgFbiU393WC2EFDNs2t32SFsVYnFD46yQ/QKSpyc+Q RsPLOAhMTBFy+YJzDyzkKSmoUktYCRaRWjRnFqUVlqka6RqV5SUWZ6RkluYmaOrqGBmV5uanF xYnpqTmJSsV5yfu4mRmDgMADBDsZVCwIPMUpyMCmJ8grEnosU4kvKT6nMSCzOiC8qzUktPsQo w8GhJMFbFA+UEyxKTU+tSMvMAYYwTFqCg0dJhJcTJM1bXJCYW5yZDpE6xajL0XHz7h8mIZa8/ LxUKXFeUZAiAZCijNI8uBGweLrEKCslzMsIdJQQT0FqUW5mCar8K0ZxDkYlYd5WkCk8mXklcJ teAR3BBHTE5IlnQI4oSURISTUwujyRWtboorVs0ZFG/ZsX/63ZekpmsoDcNLXlf0WiHmut+Wc TKXu1evu9J++//r3+ZvsF7leqyc1XDITkuObmdO3bvZbTI0ovdU3bejNe6bVbV8lKNs7c+CN8 SWdDwD2pdrM/NUl3boSznNYX60zTy7srGOso8I7R7cGtvHsJDGovmntD7i6JUmIpzkg01GIuK k4EAFLvKlKiAgAA X-Env-Sender: prvs=438a79e6a=George.Dunlap@citrix.com X-Msg-Ref: server-13.tower-27.messagelabs.com!1506697326!109688636!3 X-Originating-IP: [66.165.176.63] X-SpamReason: No, hits=0.0 required=7.0 tests=sa_preprocessor: VHJ1c3RlZCBJUDogNjYuMTY1LjE3Ni42MyA9PiAzMDYwNDg=\n, received_headers: No Received headers X-StarScan-Received: X-StarScan-Version: 9.4.45; banners=-,-,- X-VirusChecked: Checked Received: (qmail 40052 invoked from network); 29 Sep 2017 15:02:11 -0000 Received: from smtp02.citrix.com (HELO SMTP02.CITRIX.COM) (66.165.176.63) by server-13.tower-27.messagelabs.com with RC4-SHA encrypted SMTP; 29 Sep 2017 15:02:11 -0000 X-IronPort-AV: E=Sophos;i="5.42,452,1500940800"; d="scan'208";a="449785863" From: George Dunlap To: Date: Fri, 29 Sep 2017 16:01:41 +0100 Message-ID: <20170929150144.7602-6-george.dunlap@citrix.com> X-Mailer: git-send-email 2.14.1 In-Reply-To: <20170929150144.7602-1-george.dunlap@citrix.com> References: <20170929150144.7602-1-george.dunlap@citrix.com> MIME-Version: 1.0 Cc: Sergey Dyasli , Kevin Tian , Jan Beulich , Andrew Cooper , George Dunlap , Jun Nakajima Subject: [Xen-devel] [PATCH 6/9] x86/np2m: Send flush IPIs only when a vcpu is actively using a shadow p2m X-BeenThere: xen-devel@lists.xen.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: Xen developer discussion List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Errors-To: xen-devel-bounces@lists.xen.org Sender: "Xen-devel" X-Virus-Scanned: ClamAV using ClamSMTP Flush IPIs are sent to all cpus in a shadow p2m's dirty_cpumask when updated. This mask however is far to broad. A pcpu's bit is set in the cpumask when a vcpu runs on that pcpu, but is only cleared when a flush happens. This means that the IPI includes the current pcpu of vcpus that are not currently running, and also includes any pcpu that has ever had a vcpu use this p2m since the last flush (which in turn will cause spurious invalidations if a different vcpu is using a shadow p2m). Avoid these IPIs by keeping closer track of where a p2m is being used, and when a vcpu needs to be flushed: - On schedule-out, clear v->processor in p2m->dirty_cpumask - Add a 'generation' counter to the p2m and nestedvcpu structs to detect changes that would require re-loads on re-entry - On schedule-in or p2m change: - Set v->processor in p2m->dirty_cpumask - flush the vcpu's nested p2m pointer (and update nv->generation) if the generation changed Signed-off-by: Sergey Dyasli Signed-off-by: George Dunlap --- Changes since v1: - Combine patches 5 and 8, and the scheduling bits of patch 11 ("x86/np2m: add np2m_generation", "x86/np2m: add np2m_schedule()", and "x86/np2m: implement sharing of np2m between vCPUs") - Reword commit message CC: Andrew Cooper CC: Jan Beulich CC: Jun Nakajima CC: Kevin Tian --- xen/arch/x86/domain.c | 2 ++ xen/arch/x86/hvm/nestedhvm.c | 1 + xen/arch/x86/hvm/vmx/vvmx.c | 3 +++ xen/arch/x86/mm/p2m.c | 55 +++++++++++++++++++++++++++++++++++++++++- xen/include/asm-x86/hvm/vcpu.h | 1 + xen/include/asm-x86/p2m.h | 6 +++++ 6 files changed, 67 insertions(+), 1 deletion(-) diff --git a/xen/arch/x86/domain.c b/xen/arch/x86/domain.c index 466a1a2fac..35ea0d2418 100644 --- a/xen/arch/x86/domain.c +++ b/xen/arch/x86/domain.c @@ -1668,6 +1668,7 @@ void context_switch(struct vcpu *prev, struct vcpu *next) { _update_runstate_area(prev); vpmu_switch_from(prev); + np2m_schedule(NP2M_SCHEDLE_OUT); } if ( is_hvm_domain(prevd) && !list_empty(&prev->arch.hvm_vcpu.tm_list) ) @@ -1716,6 +1717,7 @@ void context_switch(struct vcpu *prev, struct vcpu *next) /* Must be done with interrupts enabled */ vpmu_switch_to(next); + np2m_schedule(NP2M_SCHEDLE_IN); } /* Ensure that the vcpu has an up-to-date time base. */ diff --git a/xen/arch/x86/hvm/nestedhvm.c b/xen/arch/x86/hvm/nestedhvm.c index 74a464d162..ab50b2ab98 100644 --- a/xen/arch/x86/hvm/nestedhvm.c +++ b/xen/arch/x86/hvm/nestedhvm.c @@ -57,6 +57,7 @@ nestedhvm_vcpu_reset(struct vcpu *v) nv->nv_flushp2m = 0; nv->nv_p2m = NULL; nv->stale_np2m = false; + nv->np2m_generation = 0; hvm_asid_flush_vcpu_asid(&nv->nv_n2asid); diff --git a/xen/arch/x86/hvm/vmx/vvmx.c b/xen/arch/x86/hvm/vmx/vvmx.c index 48e37158af..a6a558b460 100644 --- a/xen/arch/x86/hvm/vmx/vvmx.c +++ b/xen/arch/x86/hvm/vmx/vvmx.c @@ -1367,6 +1367,9 @@ static void virtual_vmexit(struct cpu_user_regs *regs) !(v->arch.hvm_vcpu.guest_efer & EFER_LMA) ) shadow_to_vvmcs_bulk(v, ARRAY_SIZE(gpdpte_fields), gpdpte_fields); + /* This will clear current pCPU bit in p2m->dirty_cpumask */ + np2m_schedule(NP2M_SCHEDLE_OUT); + vmx_vmcs_switch(v->arch.hvm_vmx.vmcs_pa, nvcpu->nv_n1vmcx_pa); nestedhvm_vcpu_exit_guestmode(v); diff --git a/xen/arch/x86/mm/p2m.c b/xen/arch/x86/mm/p2m.c index fd48a3b9db..3c6c486c00 100644 --- a/xen/arch/x86/mm/p2m.c +++ b/xen/arch/x86/mm/p2m.c @@ -73,6 +73,7 @@ static int p2m_initialise(struct domain *d, struct p2m_domain *p2m) p2m->p2m_class = p2m_host; p2m->np2m_base = P2M_BASE_EADDR; + p2m->np2m_generation = 0; for ( i = 0; i < ARRAY_SIZE(p2m->pod.mrp.list); ++i ) p2m->pod.mrp.list[i] = gfn_x(INVALID_GFN); @@ -1735,6 +1736,7 @@ p2m_flush_table_locked(struct p2m_domain *p2m) /* This is no longer a valid nested p2m for any address space */ p2m->np2m_base = P2M_BASE_EADDR; + p2m->np2m_generation++; /* Make sure nobody else is using this p2m table */ nestedhvm_vmcx_flushtlb(p2m); @@ -1809,6 +1811,7 @@ static void assign_np2m(struct vcpu *v, struct p2m_domain *p2m) nv->nv_flushp2m = 0; nv->nv_p2m = p2m; + nv->np2m_generation = p2m->np2m_generation; cpumask_set_cpu(v->processor, p2m->dirty_cpumask); } @@ -1840,7 +1843,9 @@ p2m_get_nestedp2m_locked(struct vcpu *v) p2m_lock(p2m); if ( p2m->np2m_base == np2m_base || p2m->np2m_base == P2M_BASE_EADDR ) { - if ( p2m->np2m_base == P2M_BASE_EADDR ) + /* Check if np2m was flushed just before the lock */ + if ( p2m->np2m_base == P2M_BASE_EADDR || + nv->np2m_generation != p2m->np2m_generation ) nvcpu_flush(v); p2m->np2m_base = np2m_base; assign_np2m(v, p2m); @@ -1848,6 +1853,11 @@ p2m_get_nestedp2m_locked(struct vcpu *v) return p2m; } + else + { + /* vCPU is switching from some other valid np2m */ + cpumask_clear_cpu(v->processor, p2m->dirty_cpumask); + } p2m_unlock(p2m); } @@ -1881,6 +1891,49 @@ p2m_get_p2m(struct vcpu *v) return p2m_get_nestedp2m(v); } +void np2m_schedule(int dir) +{ + struct nestedvcpu *nv = &vcpu_nestedhvm(current); + struct p2m_domain *p2m; + + ASSERT(dir == NP2M_SCHEDLE_IN || dir == NP2M_SCHEDLE_OUT); + + if ( !nestedhvm_enabled(current->domain) || + !nestedhvm_vcpu_in_guestmode(current) || + !nestedhvm_paging_mode_hap(current) ) + return; + + p2m = nv->nv_p2m; + if ( p2m ) + { + bool np2m_valid; + + p2m_lock(p2m); + np2m_valid = p2m->np2m_base == nhvm_vcpu_p2m_base(current) && + nv->np2m_generation == p2m->np2m_generation; + if ( dir == NP2M_SCHEDLE_OUT && np2m_valid ) + { + /* + * The np2m is up to date but this vCPU will no longer use it, + * which means there are no reasons to send a flush IPI. + */ + cpumask_clear_cpu(current->processor, p2m->dirty_cpumask); + } + else if ( dir == NP2M_SCHEDLE_IN ) + { + if ( !np2m_valid ) + { + /* This vCPU's np2m was flushed while it was not runnable */ + hvm_asid_flush_core(); + vcpu_nestedhvm(current).nv_p2m = NULL; + } + else + cpumask_set_cpu(current->processor, p2m->dirty_cpumask); + } + p2m_unlock(p2m); + } +} + unsigned long paging_gva_to_gfn(struct vcpu *v, unsigned long va, uint32_t *pfec) diff --git a/xen/include/asm-x86/hvm/vcpu.h b/xen/include/asm-x86/hvm/vcpu.h index 5cfa4b4aa4..afe5ffc6b3 100644 --- a/xen/include/asm-x86/hvm/vcpu.h +++ b/xen/include/asm-x86/hvm/vcpu.h @@ -116,6 +116,7 @@ struct nestedvcpu { bool_t nv_flushp2m; /* True, when p2m table must be flushed */ struct p2m_domain *nv_p2m; /* used p2m table for this vcpu */ bool stale_np2m; /* True when p2m_base in VMCX02 is no longer valid */ + uint64_t np2m_generation; struct hvm_vcpu_asid nv_n2asid; diff --git a/xen/include/asm-x86/p2m.h b/xen/include/asm-x86/p2m.h index 4a1c10c130..8d4aa8c6bf 100644 --- a/xen/include/asm-x86/p2m.h +++ b/xen/include/asm-x86/p2m.h @@ -209,6 +209,7 @@ struct p2m_domain { * to set it to any other value. */ #define P2M_BASE_EADDR (~0ULL) uint64_t np2m_base; + uint64_t np2m_generation; /* Nested p2ms: linked list of n2pms allocated to this domain. * The host p2m hasolds the head of the list and the np2ms are @@ -371,6 +372,11 @@ struct p2m_domain *p2m_get_nestedp2m_locked(struct vcpu *v); */ struct p2m_domain *p2m_get_p2m(struct vcpu *v); +#define NP2M_SCHEDLE_IN 0 +#define NP2M_SCHEDLE_OUT 1 + +void np2m_schedule(int dir); + static inline bool_t p2m_is_hostp2m(const struct p2m_domain *p2m) { return p2m->p2m_class == p2m_host;