diff mbox series

[v3] KVM: x86: refine kvm_vcpu_is_preempted

Message ID 1642397842-46318-1-git-send-email-lirongqing@baidu.com (mailing list archive)
State New, archived
Headers show
Series [v3] KVM: x86: refine kvm_vcpu_is_preempted | expand

Commit Message

Li RongQing Jan. 17, 2022, 5:37 a.m. UTC
After support paravirtualized TLB shootdowns, steal_time.preempted
includes not only KVM_VCPU_PREEMPTED, but also KVM_VCPU_FLUSH_TLB

and kvm_vcpu_is_preempted should test only with KVM_VCPU_PREEMPTED

Co-developed-by: Sean Christopherson <seanjc@google.com>
Signed-off-by: Sean Christopherson <seanjc@google.com>
Signed-off-by: Li RongQing <lirongqing@baidu.com>
---

Comments

Peter Zijlstra Feb. 2, 2022, 2:54 p.m. UTC | #1
On Mon, Jan 17, 2022 at 01:37:22PM +0800, Li RongQing wrote:
> After support paravirtualized TLB shootdowns, steal_time.preempted
> includes not only KVM_VCPU_PREEMPTED, but also KVM_VCPU_FLUSH_TLB
> 
> and kvm_vcpu_is_preempted should test only with KVM_VCPU_PREEMPTED

This still fails to actually explain what the problem is, why did you
write this patch?

> 
> Co-developed-by: Sean Christopherson <seanjc@google.com>
> Signed-off-by: Sean Christopherson <seanjc@google.com>
> Signed-off-by: Li RongQing <lirongqing@baidu.com>
> ---
> diff with v2: using andl and setnz
> diff with v1: clear 64bit rax
> 
>  arch/x86/kernel/kvm.c | 5 +++--
>  1 file changed, 3 insertions(+), 2 deletions(-)
> 
> diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c
> index b061d17..fe0aead 100644
> --- a/arch/x86/kernel/kvm.c
> +++ b/arch/x86/kernel/kvm.c
> @@ -1025,8 +1025,9 @@ asm(
>  ".type __raw_callee_save___kvm_vcpu_is_preempted, @function;"
>  "__raw_callee_save___kvm_vcpu_is_preempted:"
>  "movq	__per_cpu_offset(,%rdi,8), %rax;"
> -"cmpb	$0, " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
> -"setne	%al;"
> +"movb	" __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax), %al;"
> +"andl	$" __stringify(KVM_VCPU_PREEMPTED) ", %eax;"
> +"setnz	%al;"

Isn't the below the simpler way of writing that same?

diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c
index a438217cbfac..bc79adcf59ff 100644
--- a/arch/x86/kernel/kvm.c
+++ b/arch/x86/kernel/kvm.c
@@ -1025,7 +1025,7 @@ asm(
 ".type __raw_callee_save___kvm_vcpu_is_preempted, @function;"
 "__raw_callee_save___kvm_vcpu_is_preempted:"
 "movq	__per_cpu_offset(,%rdi,8), %rax;"
-"cmpb	$0, " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
+"testb	$" __stringify(KVM_VCPU_PREEMPTED) ", " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
 "setne	%al;"
 "ret;"
 ".size __raw_callee_save___kvm_vcpu_is_preempted, .-__raw_callee_save___kvm_vcpu_is_preempted;"
Sean Christopherson Feb. 2, 2022, 4:48 p.m. UTC | #2
On Wed, Feb 02, 2022, Peter Zijlstra wrote:
> On Mon, Jan 17, 2022 at 01:37:22PM +0800, Li RongQing wrote:
> > After support paravirtualized TLB shootdowns, steal_time.preempted
> > includes not only KVM_VCPU_PREEMPTED, but also KVM_VCPU_FLUSH_TLB
> > 
> > and kvm_vcpu_is_preempted should test only with KVM_VCPU_PREEMPTED
> 
> This still fails to actually explain what the problem is, why did you
> write this patch?

Ya, definitely is lacking details.  I think this captures everything...

  Tweak the assembly code for detecting KVM_VCPU_PREEMPTED to future proof
  it against new features, code refactorings, and theoretical compiler
  behavior.

  Explicitly test only the KVM_VCPU_PREEMPTED flag; steal_time.preempted
  has already been overloaded once for KVM_VCPU_FLUSH_TLB, and checking the
  entire byte for a non-zero value could yield a false positive.  This
  currently isn't problematic as PREEMPTED and FLUSH_TLB are mutually
  exclusive, but that may not hold true for future flags.

  Use AND instead of TEST for querying PREEMPTED to clear RAX[63:8] before
  returning to avoid a potential false postive in the caller due to leaving
  the address (non-zero value) in the upper bits.  Compilers are technically
  allowed to use more than a byte for storing _Bool, and it would be all too
  easy for someone to refactor the return type to something larger.

  Keep the SETcc (but change it to setnz for sanity's sake) as the fact that
  KVM_VCPU_PREEMPTED happens to be bit 0, i.e. the AND will yield 0/1 as
  needed for _Bool, is pure coincidence.

> > Co-developed-by: Sean Christopherson <seanjc@google.com>
> > Signed-off-by: Sean Christopherson <seanjc@google.com>
> > Signed-off-by: Li RongQing <lirongqing@baidu.com>
> > ---
> > diff with v2: using andl and setnz
> > diff with v1: clear 64bit rax
> > 
> >  arch/x86/kernel/kvm.c | 5 +++--
> >  1 file changed, 3 insertions(+), 2 deletions(-)
> > 
> > diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c
> > index b061d17..fe0aead 100644
> > --- a/arch/x86/kernel/kvm.c
> > +++ b/arch/x86/kernel/kvm.c
> > @@ -1025,8 +1025,9 @@ asm(
> >  ".type __raw_callee_save___kvm_vcpu_is_preempted, @function;"
> >  "__raw_callee_save___kvm_vcpu_is_preempted:"
> >  "movq	__per_cpu_offset(,%rdi,8), %rax;"
> > -"cmpb	$0, " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
> > -"setne	%al;"
> > +"movb	" __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax), %al;"
> > +"andl	$" __stringify(KVM_VCPU_PREEMPTED) ", %eax;"
> > +"setnz	%al;"
> 
> Isn't the below the simpler way of writing that same?

The AND is needed to clear RAX[63:8], e.g. to avoid breakage if the return value
is changed from a bool to something larger, or the compiler uses more than a byte
for _Bool.

> diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c
> index a438217cbfac..bc79adcf59ff 100644
> --- a/arch/x86/kernel/kvm.c
> +++ b/arch/x86/kernel/kvm.c
> @@ -1025,7 +1025,7 @@ asm(
>  ".type __raw_callee_save___kvm_vcpu_is_preempted, @function;"
>  "__raw_callee_save___kvm_vcpu_is_preempted:"
>  "movq	__per_cpu_offset(,%rdi,8), %rax;"
> -"cmpb	$0, " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
> +"testb	$" __stringify(KVM_VCPU_PREEMPTED) ", " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
>  "setne	%al;"
>  "ret;"
>  ".size __raw_callee_save___kvm_vcpu_is_preempted, .-__raw_callee_save___kvm_vcpu_is_preempted;"
Peter Zijlstra Feb. 2, 2022, 10:17 p.m. UTC | #3
On Wed, Feb 02, 2022 at 04:48:52PM +0000, Sean Christopherson wrote:

> > > +"movb	" __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax), %al;"
> > > +"andl	$" __stringify(KVM_VCPU_PREEMPTED) ", %eax;"
> > > +"setnz	%al;"
> > 
> > Isn't the below the simpler way of writing that same?
> 
> The AND is needed to clear RAX[63:8], e.g. to avoid breakage if the return value
> is changed from a bool to something larger, or the compiler uses more than a byte
> for _Bool.

While C doesn't specify _Bool the arch ABI most certainly does, and
x86_64 ABI guarantees a byte there, changing the return type is the only
possible option here.

Anyway, I don't mind the patch and the proposed Changelog does clarify
that this is an exercise in paranoia.
Li RongQing Feb. 6, 2022, 11:26 a.m. UTC | #4
> On Wed, Feb 02, 2022, Peter Zijlstra wrote:
> > On Mon, Jan 17, 2022 at 01:37:22PM +0800, Li RongQing wrote:
> > > After support paravirtualized TLB shootdowns, steal_time.preempted
> > > includes not only KVM_VCPU_PREEMPTED, but also KVM_VCPU_FLUSH_TLB
> > >
> > > and kvm_vcpu_is_preempted should test only with KVM_VCPU_PREEMPTED
> >
> > This still fails to actually explain what the problem is, why did you
> > write this patch?
> 
> Ya, definitely is lacking details.  I think this captures everything...
> 
>   Tweak the assembly code for detecting KVM_VCPU_PREEMPTED to future
> proof
>   it against new features, code refactorings, and theoretical compiler
>   behavior.
> 
>   Explicitly test only the KVM_VCPU_PREEMPTED flag; steal_time.preempted
>   has already been overloaded once for KVM_VCPU_FLUSH_TLB, and checking
> the
>   entire byte for a non-zero value could yield a false positive.  This
>   currently isn't problematic as PREEMPTED and FLUSH_TLB are mutually
>   exclusive, but that may not hold true for future flags.
> 
>   Use AND instead of TEST for querying PREEMPTED to clear RAX[63:8] before
>   returning to avoid a potential false postive in the caller due to leaving
>   the address (non-zero value) in the upper bits.  Compilers are technically
>   allowed to use more than a byte for storing _Bool, and it would be all too
>   easy for someone to refactor the return type to something larger.
> 
>   Keep the SETcc (but change it to setnz for sanity's sake) as the fact that
>   KVM_VCPU_PREEMPTED happens to be bit 0, i.e. the AND will yield 0/1 as
>   needed for _Bool, is pure coincidence.
> cpu_is_preempted;"


Sean:

Could you resubmit this patch, thanks

-Li
diff mbox series

Patch

diff with v2: using andl and setnz
diff with v1: clear 64bit rax

 arch/x86/kernel/kvm.c | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/arch/x86/kernel/kvm.c b/arch/x86/kernel/kvm.c
index b061d17..fe0aead 100644
--- a/arch/x86/kernel/kvm.c
+++ b/arch/x86/kernel/kvm.c
@@ -1025,8 +1025,9 @@  asm(
 ".type __raw_callee_save___kvm_vcpu_is_preempted, @function;"
 "__raw_callee_save___kvm_vcpu_is_preempted:"
 "movq	__per_cpu_offset(,%rdi,8), %rax;"
-"cmpb	$0, " __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax);"
-"setne	%al;"
+"movb	" __stringify(KVM_STEAL_TIME_preempted) "+steal_time(%rax), %al;"
+"andl	$" __stringify(KVM_VCPU_PREEMPTED) ", %eax;"
+"setnz	%al;"
 "ret;"
 ".size __raw_callee_save___kvm_vcpu_is_preempted, .-__raw_callee_save___kvm_vcpu_is_preempted;"
 ".popsection");