diff mbox series

[v2] KVM: x86: directly call wbinvd for local cpu when emulate wbinvd

Message ID 1634118172-32699-1-git-send-email-lirongqing@baidu.com (mailing list archive)
State New, archived
Headers show
Series [v2] KVM: x86: directly call wbinvd for local cpu when emulate wbinvd | expand

Commit Message

Li RongQing Oct. 13, 2021, 9:42 a.m. UTC
directly call wbinvd for local cpu, instead of calling atomic
cpumask_set_cpu to set local cpu, and then check if local cpu
needs to run in on_each_cpu_mask

on_each_cpu_mask is less efficient than smp_call_function_many,
since it will close preempt again and running call function by
checking flag with SCF_RUN_LOCAL. and here wbinvd can be called
directly

In fact, This change reverts commit 2eec73437487 ("KVM: x86: Avoid
issuing wbinvd twice"), since smp_call_function_many is skiping the
local cpu (as description of c2162e13d6e2f), wbinvd is not issued
twice

and reverts commit c2162e13d6e2f ("KVM: X86: Fix missing local pCPU
when executing wbinvd on all dirty pCPUs") too, which fixed the
previous patch, when revert previous patch, it is not needed.

Signed-off-by: Li RongQing <lirongqing@baidu.com>
---

Comments

Li RongQing Oct. 22, 2021, 4:16 a.m. UTC | #1
Ping 

-Li

> -----邮件原件-----
> 发件人: Li,Rongqing <lirongqing@baidu.com>
> 发送时间: 2021年10月13日 17:43
> 收件人: x86@kernel.org; kvm@vger.kernel.org; Li,Rongqing
> <lirongqing@baidu.com>
> 主题: [PATCH][v2] KVM: x86: directly call wbinvd for local cpu when emulate
> wbinvd
> 
> directly call wbinvd for local cpu, instead of calling atomic cpumask_set_cpu to
> set local cpu, and then check if local cpu needs to run in on_each_cpu_mask
> 
> on_each_cpu_mask is less efficient than smp_call_function_many, since it will
> close preempt again and running call function by checking flag with
> SCF_RUN_LOCAL. and here wbinvd can be called directly
> 
> In fact, This change reverts commit 2eec73437487 ("KVM: x86: Avoid issuing
> wbinvd twice"), since smp_call_function_many is skiping the local cpu (as
> description of c2162e13d6e2f), wbinvd is not issued twice
> 
> and reverts commit c2162e13d6e2f ("KVM: X86: Fix missing local pCPU when
> executing wbinvd on all dirty pCPUs") too, which fixed the previous patch, when
> revert previous patch, it is not needed.
> 
> Signed-off-by: Li RongQing <lirongqing@baidu.com>
> ---
> diff v2: rewrite commit log
> 
>  arch/x86/kvm/x86.c |   13 ++++++-------
>  1 files changed, 6 insertions(+), 7 deletions(-)
> 
> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index aabd3a2..28c4c72
> 100644
> --- a/arch/x86/kvm/x86.c
> +++ b/arch/x86/kvm/x86.c
> @@ -6991,15 +6991,14 @@ static int kvm_emulate_wbinvd_noskip(struct
> kvm_vcpu *vcpu)
>  		return X86EMUL_CONTINUE;
> 
>  	if (static_call(kvm_x86_has_wbinvd_exit)()) {
> -		int cpu = get_cpu();
> -
> -		cpumask_set_cpu(cpu, vcpu->arch.wbinvd_dirty_mask);
> -		on_each_cpu_mask(vcpu->arch.wbinvd_dirty_mask,
> +		preempt_disable();
> +		smp_call_function_many(vcpu->arch.wbinvd_dirty_mask,
>  				wbinvd_ipi, NULL, 1);
> -		put_cpu();
> +		preempt_enable();
>  		cpumask_clear(vcpu->arch.wbinvd_dirty_mask);
> -	} else
> -		wbinvd();
> +	}
> +
> +	wbinvd();
>  	return X86EMUL_CONTINUE;
>  }
> 
> --
> 1.7.1
Nadav Amit Oct. 22, 2021, 5:28 a.m. UTC | #2
> On Oct 21, 2021, at 9:16 PM, Li,Rongqing <lirongqing@baidu.com> wrote:
> 
> Ping 
> 
> -Li
> 
>> -----邮件原件-----
>> 发件人: Li,Rongqing <lirongqing@baidu.com>
>> 发送时间: 2021年10月13日 17:43
>> 收件人: x86@kernel.org; kvm@vger.kernel.org; Li,Rongqing
>> <lirongqing@baidu.com>
>> 主题: [PATCH][v2] KVM: x86: directly call wbinvd for local cpu when emulate
>> wbinvd
>> 
>> directly call wbinvd for local cpu, instead of calling atomic cpumask_set_cpu to
>> set local cpu, and then check if local cpu needs to run in on_each_cpu_mask
>> 
>> on_each_cpu_mask is less efficient than smp_call_function_many, since it will
>> close preempt again and running call function by checking flag with
>> SCF_RUN_LOCAL. and here wbinvd can be called directly
>> 
>> In fact, This change reverts commit 2eec73437487 ("KVM: x86: Avoid issuing
>> wbinvd twice"), since smp_call_function_many is skiping the local cpu (as
>> description of c2162e13d6e2f), wbinvd is not issued twice
>> 
>> and reverts commit c2162e13d6e2f ("KVM: X86: Fix missing local pCPU when
>> executing wbinvd on all dirty pCPUs") too, which fixed the previous patch, when
>> revert previous patch, it is not needed.
>> 
>> Signed-off-by: Li RongQing <lirongqing@baidu.com>
>> ---
>> diff v2: rewrite commit log
>> 
>> arch/x86/kvm/x86.c |   13 ++++++-------
>> 1 files changed, 6 insertions(+), 7 deletions(-)
>> 
>> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index aabd3a2..28c4c72
>> 100644
>> --- a/arch/x86/kvm/x86.c
>> +++ b/arch/x86/kvm/x86.c
>> @@ -6991,15 +6991,14 @@ static int kvm_emulate_wbinvd_noskip(struct
>> kvm_vcpu *vcpu)
>> 		return X86EMUL_CONTINUE;
>> 
>> 	if (static_call(kvm_x86_has_wbinvd_exit)()) {
>> -		int cpu = get_cpu();
>> -
>> -		cpumask_set_cpu(cpu, vcpu->arch.wbinvd_dirty_mask);
>> -		on_each_cpu_mask(vcpu->arch.wbinvd_dirty_mask,
>> +		preempt_disable();
>> +		smp_call_function_many(vcpu->arch.wbinvd_dirty_mask,
>> 				wbinvd_ipi, NULL, 1);
>> -		put_cpu();
>> +		preempt_enable();
>> 		cpumask_clear(vcpu->arch.wbinvd_dirty_mask);
>> -	} else
>> -		wbinvd();
>> +	}
>> +
>> +	wbinvd();
>> 	return X86EMUL_CONTINUE;
>> }

KVM is none of my business, but on_each_cpu_mask() should be more
efficient since it would run wbinvd() concurrently locally and
remotely (this is a relatively recent change I made). wbinvd() is
an expensive operation, and preempt_enable() is cheap, so there
should not be complicated tradeoff here. 

The proposed change prevents running wbinvd() concurrently so
theoretically it should cause a 2x slowdown (for this specific
piece of code).
Li RongQing Oct. 22, 2021, 6:18 a.m. UTC | #3
> KVM is none of my business, but on_each_cpu_mask() should be more efficient
> since it would run wbinvd() concurrently locally and remotely (this is a relatively
> recent change I made). 

Thanks, I see

-Li
diff mbox series

Patch

diff v2: rewrite commit log

 arch/x86/kvm/x86.c |   13 ++++++-------
 1 files changed, 6 insertions(+), 7 deletions(-)

diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
index aabd3a2..28c4c72 100644
--- a/arch/x86/kvm/x86.c
+++ b/arch/x86/kvm/x86.c
@@ -6991,15 +6991,14 @@  static int kvm_emulate_wbinvd_noskip(struct kvm_vcpu *vcpu)
 		return X86EMUL_CONTINUE;
 
 	if (static_call(kvm_x86_has_wbinvd_exit)()) {
-		int cpu = get_cpu();
-
-		cpumask_set_cpu(cpu, vcpu->arch.wbinvd_dirty_mask);
-		on_each_cpu_mask(vcpu->arch.wbinvd_dirty_mask,
+		preempt_disable();
+		smp_call_function_many(vcpu->arch.wbinvd_dirty_mask,
 				wbinvd_ipi, NULL, 1);
-		put_cpu();
+		preempt_enable();
 		cpumask_clear(vcpu->arch.wbinvd_dirty_mask);
-	} else
-		wbinvd();
+	}
+
+	wbinvd();
 	return X86EMUL_CONTINUE;
 }