xen/x86: lock cacheline for add_sized()

Message ID	20190806070000.13718-1-jgross@suse.com (mailing list archive)
State	Superseded
Headers	show Return-Path: <xen-devel-bounces@lists.xenproject.org> From: Juergen Gross <jgross@suse.com> To: xen-devel@lists.xenproject.org Date: Tue, 6 Aug 2019 09:00:00 +0200 Message-Id: <20190806070000.13718-1-jgross@suse.com> Subject: [Xen-devel] [PATCH] xen/x86: lock cacheline for add_sized() Precedence: list Cc: Juergen Gross <jgross@suse.com>, Andrew Cooper <andrew.cooper3@citrix.com>, Wei Liu <wl@xen.org>, Jan Beulich <jbeulich@suse.com>, =?utf-8?q?Roger_Pau_Monn=C3=A9?= <roger.pau@citrix.com> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: xen-devel-bounces@lists.xenproject.org Sender: "Xen-devel" <xen-devel-bounces@lists.xenproject.org>
Series	xen/x86: lock cacheline for add_sized() \| expand xen/x86: lock cacheline for add_sized()

Message ID

20190806070000.13718-1-jgross@suse.com (mailing list archive)

State

Superseded

Headers

From: Juergen Gross <jgross@suse.com>
To: xen-devel@lists.xenproject.org
Date: Tue,  6 Aug 2019 09:00:00 +0200
Message-Id: <20190806070000.13718-1-jgross@suse.com>
Subject: [Xen-devel] [PATCH] xen/x86: lock cacheline for add_sized()
Precedence: list
Cc: Juergen Gross <jgross@suse.com>,
 Andrew Cooper <andrew.cooper3@citrix.com>, Wei Liu <wl@xen.org>,
 Jan Beulich <jbeulich@suse.com>,
 =?utf-8?q?Roger_Pau_Monn=C3=A9?= <roger.pau@citrix.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: xen-devel-bounces@lists.xenproject.org
Sender: "Xen-devel" <xen-devel-bounces@lists.xenproject.org>

Series

xen/x86: lock cacheline for add_sized() | expand

Commit Message

Jürgen Groß Aug. 6, 2019, 7 a.m. UTC

add_sized() should use an atomic update of the memory word, as it is
used by spin_unlock().

With ticket locks unlocking needs to be atomic in order to avoid very
rare races when someone tries to get the lock while the lock holder
is just releasing it.

Signed-off-by: Juergen Gross <jgross@suse.com>
---
 xen/include/asm-x86/atomic.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

Comments

Jan Beulich Aug. 6, 2019, 8:26 a.m. UTC | #1

On 06.08.2019 09:00, Juergen Gross wrote:
> add_sized() should use an atomic update of the memory word, as it is
> used by spin_unlock().
> 
> With ticket locks unlocking needs to be atomic in order to avoid very
> rare races when someone tries to get the lock while the lock holder
> is just releasing it.

Considering the commit introducing the function (3c694aec08)
explicitly saying "The add is /not/ atomic" this needs more detail.
The idea behind not using a LOCKed access here was, iirc, that
no-one else could ever update the head pointer; someone trying to
acquire the lock would only write to the tail portion. But yes, I
think I can see how this was wrong: The arch_fetch_and_add() acts
on head_tail after all, not just tail.

> --- a/xen/include/asm-x86/atomic.h
> +++ b/xen/include/asm-x86/atomic.h
> @@ -21,7 +21,7 @@ static inline void name(volatile type *addr, type val) \
>   #define build_add_sized(name, size, type, reg) \
>       static inline void name(volatile type *addr, type val)              \
>       {                                                                   \
> -        asm volatile("add" size " %1,%0"                                \
> +        asm volatile("lock; add" size " %1,%0"                          \

I realize pre-existing code in our tree uses this not fully correct
form of specifying prefixes (there shouldn't really be a line
separator between prefix and main mnemonic, unless gas would refuse
assembling the result because of it wrongly thinking the prefix was
inapplicable to the insn, like is e.g. the case here and there for
the REP prefixes), but I think we should try to avoid widening the
issue. See e.g. gnttab_clear_flags() where we already have no
semicolon, and this has gone fine since around 4.2. I seem to vaguely
recall that Linux has been using this construct to avoid build issues
with some specific (Sun?) assembler.

Jan

Jürgen Groß Aug. 6, 2019, 8:34 a.m. UTC | #2

On 06.08.19 10:26, Jan Beulich wrote:
> On 06.08.2019 09:00, Juergen Gross wrote:
>> add_sized() should use an atomic update of the memory word, as it is
>> used by spin_unlock().
>>
>> With ticket locks unlocking needs to be atomic in order to avoid very
>> rare races when someone tries to get the lock while the lock holder
>> is just releasing it.
> 
> Considering the commit introducing the function (3c694aec08)
> explicitly saying "The add is /not/ atomic" this needs more detail.
> The idea behind not using a LOCKed access here was, iirc, that
> no-one else could ever update the head pointer; someone trying to
> acquire the lock would only write to the tail portion. But yes, I
> think I can see how this was wrong: The arch_fetch_and_add() acts
> on head_tail after all, not just tail.

I'll update the commit message.

> 
>> --- a/xen/include/asm-x86/atomic.h
>> +++ b/xen/include/asm-x86/atomic.h
>> @@ -21,7 +21,7 @@ static inline void name(volatile type *addr, type val) \
>>    #define build_add_sized(name, size, type, reg) \
>>        static inline void name(volatile type *addr, type val)              \
>>        {                                                                   \
>> -        asm volatile("add" size " %1,%0"                                \
>> +        asm volatile("lock; add" size " %1,%0"                          \
> 
> I realize pre-existing code in our tree uses this not fully correct
> form of specifying prefixes (there shouldn't really be a line
> separator between prefix and main mnemonic, unless gas would refuse
> assembling the result because of it wrongly thinking the prefix was
> inapplicable to the insn, like is e.g. the case here and there for
> the REP prefixes), but I think we should try to avoid widening the
> issue. See e.g. gnttab_clear_flags() where we already have no
> semicolon, and this has gone fine since around 4.2. I seem to vaguely
> recall that Linux has been using this construct to avoid build issues
> with some specific (Sun?) assembler.

Okay, I'll remove the semicolon.


Juergen

diff --git a/xen/include/asm-x86/atomic.h b/xen/include/asm-x86/atomic.h
index 682bcf91b1..0ef6a60e69 100644
--- a/xen/include/asm-x86/atomic.h
+++ b/xen/include/asm-x86/atomic.h
@@ -21,7 +21,7 @@  static inline void name(volatile type *addr, type val) \
 #define build_add_sized(name, size, type, reg) \
     static inline void name(volatile type *addr, type val)              \
     {                                                                   \
-        asm volatile("add" size " %1,%0"                                \
+        asm volatile("lock; add" size " %1,%0"                          \
                      : "=m" (*addr)                                     \
                      : reg (val));                                      \
     }

xen/x86: lock cacheline for add_sized()

Commit Message

Comments

Patch