diff mbox series

[net] net/tls: fix slab-out-of-bounds bug in decrypt_internal

Message ID 20220330085009.1011614-1-william.xuanziyang@huawei.com (mailing list archive)
State Superseded
Delegated to: Netdev Maintainers
Headers show
Series [net] net/tls: fix slab-out-of-bounds bug in decrypt_internal | expand

Checks

Context Check Description
netdev/tree_selection success Clearly marked for net
netdev/fixes_present success Fixes tag present in non-next series
netdev/subject_prefix success Link
netdev/cover_letter success Single patches do not need cover letters
netdev/patch_count success Link
netdev/header_inline success No static functions without inline keyword in header files
netdev/build_32bit success Errors and warnings before: 0 this patch: 0
netdev/cc_maintainers success CCed 8 of 8 maintainers
netdev/build_clang success Errors and warnings before: 0 this patch: 0
netdev/module_param success Was 0 now: 0
netdev/verify_signedoff success Signed-off-by tag matches author and committer
netdev/verify_fixes success Fixes tag looks correct
netdev/build_allmodconfig_warn success Errors and warnings before: 0 this patch: 0
netdev/checkpatch success total: 0 errors, 0 warnings, 0 checks, 22 lines checked
netdev/kdoc success Errors and warnings before: 0 this patch: 0
netdev/source_inline success Was 0 now: 0

Commit Message

Ziyang Xuan (William) March 30, 2022, 8:50 a.m. UTC
The memory size of tls_ctx->rx.iv for AES128-CCM is 12 setting in
tls_set_sw_offload(). The return value of crypto_aead_ivsize()
for "ccm(aes)" is 16. So memcpy() require 16 bytes from 12 bytes
memory space will trigger slab-out-of-bounds bug as following:

==================================================================
BUG: KASAN: slab-out-of-bounds in decrypt_internal+0x385/0xc40 [tls]
Read of size 16 at addr ffff888114e84e60 by task tls/10911

Call Trace:
 <TASK>
 dump_stack_lvl+0x34/0x44
 print_report.cold+0x5e/0x5db
 ? decrypt_internal+0x385/0xc40 [tls]
 kasan_report+0xab/0x120
 ? decrypt_internal+0x385/0xc40 [tls]
 kasan_check_range+0xf9/0x1e0
 memcpy+0x20/0x60
 decrypt_internal+0x385/0xc40 [tls]
 ? tls_get_rec+0x2e0/0x2e0 [tls]
 ? process_rx_list+0x1a5/0x420 [tls]
 ? tls_setup_from_iter.constprop.0+0x2e0/0x2e0 [tls]
 decrypt_skb_update+0x9d/0x400 [tls]
 tls_sw_recvmsg+0x3c8/0xb50 [tls]

Allocated by task 10911:
 kasan_save_stack+0x1e/0x40
 __kasan_kmalloc+0x81/0xa0
 tls_set_sw_offload+0x2eb/0xa20 [tls]
 tls_setsockopt+0x68c/0x700 [tls]
 __sys_setsockopt+0xfe/0x1b0

Reserve MAX_IV_SIZE memory space for iv to be compatible with all
ciphers. And do iv and salt copy like done in tls_do_encryption().

Fixes: f295b3ae9f59 ("net/tls: Add support of AES128-CCM based ciphers")
Signed-off-by: Ziyang Xuan <william.xuanziyang@huawei.com>
---
 net/tls/tls_sw.c | 10 +++-------
 1 file changed, 3 insertions(+), 7 deletions(-)

Comments

Jakub Kicinski March 30, 2022, 4:39 p.m. UTC | #1
On Wed, 30 Mar 2022 16:50:09 +0800 Ziyang Xuan wrote:
> The memory size of tls_ctx->rx.iv for AES128-CCM is 12 setting in
> tls_set_sw_offload(). The return value of crypto_aead_ivsize()
> for "ccm(aes)" is 16. So memcpy() require 16 bytes from 12 bytes
> memory space will trigger slab-out-of-bounds bug as following:
> 
> ==================================================================
> BUG: KASAN: slab-out-of-bounds in decrypt_internal+0x385/0xc40 [tls]
> Read of size 16 at addr ffff888114e84e60 by task tls/10911
> 
> Call Trace:
>  <TASK>
>  dump_stack_lvl+0x34/0x44
>  print_report.cold+0x5e/0x5db
>  ? decrypt_internal+0x385/0xc40 [tls]
>  kasan_report+0xab/0x120
>  ? decrypt_internal+0x385/0xc40 [tls]
>  kasan_check_range+0xf9/0x1e0
>  memcpy+0x20/0x60
>  decrypt_internal+0x385/0xc40 [tls]
>  ? tls_get_rec+0x2e0/0x2e0 [tls]
>  ? process_rx_list+0x1a5/0x420 [tls]
>  ? tls_setup_from_iter.constprop.0+0x2e0/0x2e0 [tls]
>  decrypt_skb_update+0x9d/0x400 [tls]
>  tls_sw_recvmsg+0x3c8/0xb50 [tls]
> 
> Allocated by task 10911:
>  kasan_save_stack+0x1e/0x40
>  __kasan_kmalloc+0x81/0xa0
>  tls_set_sw_offload+0x2eb/0xa20 [tls]
>  tls_setsockopt+0x68c/0x700 [tls]
>  __sys_setsockopt+0xfe/0x1b0

Interesting, are you running on non-x86 platform or with some crypto
accelerator? I wonder why we're not hitting it with KASAN and the
selftest we have.

> Reserve MAX_IV_SIZE memory space for iv to be compatible with all
> ciphers. And do iv and salt copy like done in tls_do_encryption().
> 
> Fixes: f295b3ae9f59 ("net/tls: Add support of AES128-CCM based ciphers")
> Signed-off-by: Ziyang Xuan <william.xuanziyang@huawei.com>
> ---
>  net/tls/tls_sw.c | 10 +++-------
>  1 file changed, 3 insertions(+), 7 deletions(-)
> 
> diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
> index 0024a692f0f8..6b858f995b23 100644
> --- a/net/tls/tls_sw.c
> +++ b/net/tls/tls_sw.c
> @@ -1456,7 +1456,7 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>  	aead_size = sizeof(*aead_req) + crypto_aead_reqsize(ctx->aead_recv);
>  	mem_size = aead_size + (nsg * sizeof(struct scatterlist));
>  	mem_size = mem_size + prot->aad_size;
> -	mem_size = mem_size + crypto_aead_ivsize(ctx->aead_recv);
> +	mem_size = mem_size + MAX_IV_SIZE;

This change is not strictly required for the patch, right?
Can we drop it, and perhaps send as an optimization separately later?

>  	/* Allocate a single block of memory which contains
>  	 * aead_req || sgin[] || sgout[] || aad || iv.
> @@ -1493,12 +1493,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>  		kfree(mem);
>  		return err;
>  	}
> -	if (prot->version == TLS_1_3_VERSION ||
> -	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
> -		memcpy(iv + iv_offset, tls_ctx->rx.iv,
> -		       crypto_aead_ivsize(ctx->aead_recv));
> -	else
> -		memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
> +	memcpy(iv + iv_offset, tls_ctx->rx.iv,
> +	       prot->iv_size + prot->salt_size);

If the IV really is 16B then we're passing 4 bytes of uninitialized
data at the end of the buffer, right?

>  	xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);
>
Jakub Kicinski March 30, 2022, 8:24 p.m. UTC | #2
On Wed, 30 Mar 2022 09:39:25 -0700 Jakub Kicinski wrote:
> On Wed, 30 Mar 2022 16:50:09 +0800 Ziyang Xuan wrote:
> > The memory size of tls_ctx->rx.iv for AES128-CCM is 12 setting in
> > tls_set_sw_offload(). The return value of crypto_aead_ivsize()
> > for "ccm(aes)" is 16. So memcpy() require 16 bytes from 12 bytes
> > memory space will trigger slab-out-of-bounds bug as following:
> > 
> > ==================================================================
> > BUG: KASAN: slab-out-of-bounds in decrypt_internal+0x385/0xc40 [tls]
> > Read of size 16 at addr ffff888114e84e60 by task tls/10911
> > 
> > Call Trace:
> >  <TASK>
> >  dump_stack_lvl+0x34/0x44
> >  print_report.cold+0x5e/0x5db
> >  ? decrypt_internal+0x385/0xc40 [tls]
> >  kasan_report+0xab/0x120
> >  ? decrypt_internal+0x385/0xc40 [tls]
> >  kasan_check_range+0xf9/0x1e0
> >  memcpy+0x20/0x60
> >  decrypt_internal+0x385/0xc40 [tls]
> >  ? tls_get_rec+0x2e0/0x2e0 [tls]
> >  ? process_rx_list+0x1a5/0x420 [tls]
> >  ? tls_setup_from_iter.constprop.0+0x2e0/0x2e0 [tls]
> >  decrypt_skb_update+0x9d/0x400 [tls]
> >  tls_sw_recvmsg+0x3c8/0xb50 [tls]
> > 
> > Allocated by task 10911:
> >  kasan_save_stack+0x1e/0x40
> >  __kasan_kmalloc+0x81/0xa0
> >  tls_set_sw_offload+0x2eb/0xa20 [tls]
> >  tls_setsockopt+0x68c/0x700 [tls]
> >  __sys_setsockopt+0xfe/0x1b0  
> 
> Interesting, are you running on non-x86 platform or with some crypto
> accelerator? I wonder why we're not hitting it with KASAN and the
> selftest we have.

I take that back, I can repro on x86 and 5.17, not sure why we're only
discovering this now.

Noob question for crypto folks, ivsize for AES CCM is reported 
as 16, but the real nonce size is 13 for TLS (q == 2, n == 13
using NIST's variable names AFAICT). Are we required to zero out 
the rest of the buffer?

In particular I think I've seen transient crypto failures with
SM4 CCM in the past and zeroing the tail of the iv buffer seems
to make the tests pass reliably.

> > Reserve MAX_IV_SIZE memory space for iv to be compatible with all
> > ciphers. And do iv and salt copy like done in tls_do_encryption().
> > 
> > Fixes: f295b3ae9f59 ("net/tls: Add support of AES128-CCM based ciphers")
> > Signed-off-by: Ziyang Xuan <william.xuanziyang@huawei.com>
> > ---
> >  net/tls/tls_sw.c | 10 +++-------
> >  1 file changed, 3 insertions(+), 7 deletions(-)
> > 
> > diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
> > index 0024a692f0f8..6b858f995b23 100644
> > --- a/net/tls/tls_sw.c
> > +++ b/net/tls/tls_sw.c
> > @@ -1456,7 +1456,7 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
> >  	aead_size = sizeof(*aead_req) + crypto_aead_reqsize(ctx->aead_recv);
> >  	mem_size = aead_size + (nsg * sizeof(struct scatterlist));
> >  	mem_size = mem_size + prot->aad_size;
> > -	mem_size = mem_size + crypto_aead_ivsize(ctx->aead_recv);
> > +	mem_size = mem_size + MAX_IV_SIZE;  
> 
> This change is not strictly required for the patch, right?
> Can we drop it, and perhaps send as an optimization separately later?
> 
> >  	/* Allocate a single block of memory which contains
> >  	 * aead_req || sgin[] || sgout[] || aad || iv.
> > @@ -1493,12 +1493,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
> >  		kfree(mem);
> >  		return err;
> >  	}
> > -	if (prot->version == TLS_1_3_VERSION ||
> > -	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
> > -		memcpy(iv + iv_offset, tls_ctx->rx.iv,
> > -		       crypto_aead_ivsize(ctx->aead_recv));
> > -	else
> > -		memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
> > +	memcpy(iv + iv_offset, tls_ctx->rx.iv,
> > +	       prot->iv_size + prot->salt_size);  
> 
> If the IV really is 16B then we're passing 4 bytes of uninitialized
> data at the end of the buffer, right?
> 
> >  	xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);

FWIW this is the fix I tested:

diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
index 0024a692f0f8..dbc6bce01898 100644
--- a/net/tls/tls_sw.c
+++ b/net/tls/tls_sw.c
@@ -1473,6 +1473,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
 	aad = (u8 *)(sgout + n_sgout);
 	iv = aad + prot->aad_size;
 
+	/* Prepare IV */
+	memset(iv, 0, crypto_aead_ivsize(ctx->aead_recv));
 	/* For CCM based ciphers, first byte of nonce+iv is a constant */
 	switch (prot->cipher_type) {
 	case TLS_CIPHER_AES_CCM_128:
@@ -1485,21 +1487,20 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
 		break;
 	}
 
-	/* Prepare IV */
-	err = skb_copy_bits(skb, rxm->offset + TLS_HEADER_SIZE,
-			    iv + iv_offset + prot->salt_size,
-			    prot->iv_size);
-	if (err < 0) {
-		kfree(mem);
-		return err;
-	}
 	if (prot->version == TLS_1_3_VERSION ||
-	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
+	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305) {
 		memcpy(iv + iv_offset, tls_ctx->rx.iv,
-		       crypto_aead_ivsize(ctx->aead_recv));
-	else
+		       prot->iv_size + prot->salt_size);
+	} else {
+		err = skb_copy_bits(skb, rxm->offset + TLS_HEADER_SIZE,
+				    iv + iv_offset + prot->salt_size,
+				    prot->iv_size);
+		if (err < 0) {
+			kfree(mem);
+			return err;
+		}
 		memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
-
+	}
 	xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);
 
 	/* Prepare AAD */
Jakub Kicinski March 30, 2022, 9:43 p.m. UTC | #3
On Wed, 30 Mar 2022 13:24:06 -0700 Jakub Kicinski wrote:
> Noob question for crypto folks, ivsize for AES CCM is reported 
> as 16, but the real nonce size is 13 for TLS (q == 2, n == 13
> using NIST's variable names AFAICT). Are we required to zero out 
> the rest of the buffer?

I guess we don't, set_msg_len() explicitly clears the tail of 
the buffer. Hopefully KASAN won't be upset about the uninit
read in format_input(), since it memcpy()s the entire 16B of iv.

> In particular I think I've seen transient crypto failures with
> SM4 CCM in the past and zeroing the tail of the iv buffer seems
> to make the tests pass reliably.
Ard Biesheuvel March 30, 2022, 9:44 p.m. UTC | #4
On Wed, 30 Mar 2022 at 22:24, Jakub Kicinski <kuba@kernel.org> wrote:
>
> On Wed, 30 Mar 2022 09:39:25 -0700 Jakub Kicinski wrote:
> > On Wed, 30 Mar 2022 16:50:09 +0800 Ziyang Xuan wrote:
> > > The memory size of tls_ctx->rx.iv for AES128-CCM is 12 setting in
> > > tls_set_sw_offload(). The return value of crypto_aead_ivsize()
> > > for "ccm(aes)" is 16. So memcpy() require 16 bytes from 12 bytes
> > > memory space will trigger slab-out-of-bounds bug as following:
> > >
> > > ==================================================================
> > > BUG: KASAN: slab-out-of-bounds in decrypt_internal+0x385/0xc40 [tls]
> > > Read of size 16 at addr ffff888114e84e60 by task tls/10911
> > >
> > > Call Trace:
> > >  <TASK>
> > >  dump_stack_lvl+0x34/0x44
> > >  print_report.cold+0x5e/0x5db
> > >  ? decrypt_internal+0x385/0xc40 [tls]
> > >  kasan_report+0xab/0x120
> > >  ? decrypt_internal+0x385/0xc40 [tls]
> > >  kasan_check_range+0xf9/0x1e0
> > >  memcpy+0x20/0x60
> > >  decrypt_internal+0x385/0xc40 [tls]
> > >  ? tls_get_rec+0x2e0/0x2e0 [tls]
> > >  ? process_rx_list+0x1a5/0x420 [tls]
> > >  ? tls_setup_from_iter.constprop.0+0x2e0/0x2e0 [tls]
> > >  decrypt_skb_update+0x9d/0x400 [tls]
> > >  tls_sw_recvmsg+0x3c8/0xb50 [tls]
> > >
> > > Allocated by task 10911:
> > >  kasan_save_stack+0x1e/0x40
> > >  __kasan_kmalloc+0x81/0xa0
> > >  tls_set_sw_offload+0x2eb/0xa20 [tls]
> > >  tls_setsockopt+0x68c/0x700 [tls]
> > >  __sys_setsockopt+0xfe/0x1b0
> >
> > Interesting, are you running on non-x86 platform or with some crypto
> > accelerator? I wonder why we're not hitting it with KASAN and the
> > selftest we have.
>
> I take that back, I can repro on x86 and 5.17, not sure why we're only
> discovering this now.
>
> Noob question for crypto folks, ivsize for AES CCM is reported
> as 16, but the real nonce size is 13 for TLS (q == 2, n == 13
> using NIST's variable names AFAICT). Are we required to zero out
> the rest of the buffer?
>

Looking at crypto/ccm.c and the arm64 accelerated implementation, it
appears the driver takes care of this: the first byte of the IV (q in
your example, but L in the crypto code) is the number of bytes minus
one that will be used for the counter, which starts at 0x1 for the CTR
cipher stream generation but is reset to 0x0 to encrypt the
authentication tag. Both drivers do a memset() to zero the last q+1
bytes of the IV.

> In particular I think I've seen transient crypto failures with
> SM4 CCM in the past and zeroing the tail of the iv buffer seems
> to make the tests pass reliably.
>

Yes, that seems like a bug, although there is only a single
implementation of the combined SM4-CCM transform in the tree, and
generic SM4 in C would be combined with the CCM chaining mode driver,
which is also used for generic AES.


> > > Reserve MAX_IV_SIZE memory space for iv to be compatible with all
> > > ciphers. And do iv and salt copy like done in tls_do_encryption().
> > >
> > > Fixes: f295b3ae9f59 ("net/tls: Add support of AES128-CCM based ciphers")
> > > Signed-off-by: Ziyang Xuan <william.xuanziyang@huawei.com>
> > > ---
> > >  net/tls/tls_sw.c | 10 +++-------
> > >  1 file changed, 3 insertions(+), 7 deletions(-)
> > >
> > > diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
> > > index 0024a692f0f8..6b858f995b23 100644
> > > --- a/net/tls/tls_sw.c
> > > +++ b/net/tls/tls_sw.c
> > > @@ -1456,7 +1456,7 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
> > >     aead_size = sizeof(*aead_req) + crypto_aead_reqsize(ctx->aead_recv);
> > >     mem_size = aead_size + (nsg * sizeof(struct scatterlist));
> > >     mem_size = mem_size + prot->aad_size;
> > > -   mem_size = mem_size + crypto_aead_ivsize(ctx->aead_recv);
> > > +   mem_size = mem_size + MAX_IV_SIZE;
> >
> > This change is not strictly required for the patch, right?
> > Can we drop it, and perhaps send as an optimization separately later?
> >
> > >     /* Allocate a single block of memory which contains
> > >      * aead_req || sgin[] || sgout[] || aad || iv.
> > > @@ -1493,12 +1493,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
> > >             kfree(mem);
> > >             return err;
> > >     }
> > > -   if (prot->version == TLS_1_3_VERSION ||
> > > -       prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
> > > -           memcpy(iv + iv_offset, tls_ctx->rx.iv,
> > > -                  crypto_aead_ivsize(ctx->aead_recv));
> > > -   else
> > > -           memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
> > > +   memcpy(iv + iv_offset, tls_ctx->rx.iv,
> > > +          prot->iv_size + prot->salt_size);
> >
> > If the IV really is 16B then we're passing 4 bytes of uninitialized
> > data at the end of the buffer, right?
> >
> > >     xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);
>
> FWIW this is the fix I tested:
>
> diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
> index 0024a692f0f8..dbc6bce01898 100644
> --- a/net/tls/tls_sw.c
> +++ b/net/tls/tls_sw.c
> @@ -1473,6 +1473,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>         aad = (u8 *)(sgout + n_sgout);
>         iv = aad + prot->aad_size;
>
> +       /* Prepare IV */
> +       memset(iv, 0, crypto_aead_ivsize(ctx->aead_recv));
>         /* For CCM based ciphers, first byte of nonce+iv is a constant */
>         switch (prot->cipher_type) {
>         case TLS_CIPHER_AES_CCM_128:
> @@ -1485,21 +1487,20 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>                 break;
>         }
>
> -       /* Prepare IV */
> -       err = skb_copy_bits(skb, rxm->offset + TLS_HEADER_SIZE,
> -                           iv + iv_offset + prot->salt_size,
> -                           prot->iv_size);
> -       if (err < 0) {
> -               kfree(mem);
> -               return err;
> -       }
>         if (prot->version == TLS_1_3_VERSION ||
> -           prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
> +           prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305) {
>                 memcpy(iv + iv_offset, tls_ctx->rx.iv,
> -                      crypto_aead_ivsize(ctx->aead_recv));
> -       else
> +                      prot->iv_size + prot->salt_size);
> +       } else {
> +               err = skb_copy_bits(skb, rxm->offset + TLS_HEADER_SIZE,
> +                                   iv + iv_offset + prot->salt_size,
> +                                   prot->iv_size);
> +               if (err < 0) {
> +                       kfree(mem);
> +                       return err;
> +               }
>                 memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
> -
> +       }
>         xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);
>
>         /* Prepare AAD */
> --
> 2.34.1
>
>
Ziyang Xuan (William) March 31, 2022, 2:35 a.m. UTC | #5
> On Wed, 30 Mar 2022 09:39:25 -0700 Jakub Kicinski wrote:
>> On Wed, 30 Mar 2022 16:50:09 +0800 Ziyang Xuan wrote:
>>> The memory size of tls_ctx->rx.iv for AES128-CCM is 12 setting in
>>> tls_set_sw_offload(). The return value of crypto_aead_ivsize()
>>> for "ccm(aes)" is 16. So memcpy() require 16 bytes from 12 bytes
>>> memory space will trigger slab-out-of-bounds bug as following:
>>>
>>> ==================================================================
>>> BUG: KASAN: slab-out-of-bounds in decrypt_internal+0x385/0xc40 [tls]
>>> Read of size 16 at addr ffff888114e84e60 by task tls/10911
>>>
>>> Call Trace:
>>>  <TASK>
>>>  dump_stack_lvl+0x34/0x44
>>>  print_report.cold+0x5e/0x5db
>>>  ? decrypt_internal+0x385/0xc40 [tls]
>>>  kasan_report+0xab/0x120
>>>  ? decrypt_internal+0x385/0xc40 [tls]
>>>  kasan_check_range+0xf9/0x1e0
>>>  memcpy+0x20/0x60
>>>  decrypt_internal+0x385/0xc40 [tls]
>>>  ? tls_get_rec+0x2e0/0x2e0 [tls]
>>>  ? process_rx_list+0x1a5/0x420 [tls]
>>>  ? tls_setup_from_iter.constprop.0+0x2e0/0x2e0 [tls]
>>>  decrypt_skb_update+0x9d/0x400 [tls]
>>>  tls_sw_recvmsg+0x3c8/0xb50 [tls]
>>>
>>> Allocated by task 10911:
>>>  kasan_save_stack+0x1e/0x40
>>>  __kasan_kmalloc+0x81/0xa0
>>>  tls_set_sw_offload+0x2eb/0xa20 [tls]
>>>  tls_setsockopt+0x68c/0x700 [tls]
>>>  __sys_setsockopt+0xfe/0x1b0  
>>
>> Interesting, are you running on non-x86 platform or with some crypto
>> accelerator? I wonder why we're not hitting it with KASAN and the
>> selftest we have.
> 
> I take that back, I can repro on x86 and 5.17, not sure why we're only
> discovering this now.
> 
> Noob question for crypto folks, ivsize for AES CCM is reported 
> as 16, but the real nonce size is 13 for TLS (q == 2, n == 13
> using NIST's variable names AFAICT). Are we required to zero out 
> the rest of the buffer?
> 
> In particular I think I've seen transient crypto failures with
> SM4 CCM in the past and zeroing the tail of the iv buffer seems
> to make the tests pass reliably.
> 
>>> Reserve MAX_IV_SIZE memory space for iv to be compatible with all
>>> ciphers. And do iv and salt copy like done in tls_do_encryption().
>>>
>>> Fixes: f295b3ae9f59 ("net/tls: Add support of AES128-CCM based ciphers")
>>> Signed-off-by: Ziyang Xuan <william.xuanziyang@huawei.com>
>>> ---
>>>  net/tls/tls_sw.c | 10 +++-------
>>>  1 file changed, 3 insertions(+), 7 deletions(-)
>>>
>>> diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
>>> index 0024a692f0f8..6b858f995b23 100644
>>> --- a/net/tls/tls_sw.c
>>> +++ b/net/tls/tls_sw.c
>>> @@ -1456,7 +1456,7 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>>>  	aead_size = sizeof(*aead_req) + crypto_aead_reqsize(ctx->aead_recv);
>>>  	mem_size = aead_size + (nsg * sizeof(struct scatterlist));
>>>  	mem_size = mem_size + prot->aad_size;
>>> -	mem_size = mem_size + crypto_aead_ivsize(ctx->aead_recv);
>>> +	mem_size = mem_size + MAX_IV_SIZE;  
>>
>> This change is not strictly required for the patch, right?
>> Can we drop it, and perhaps send as an optimization separately later?
>>
Yes, it is not required for the problem.

>>>  	/* Allocate a single block of memory which contains
>>>  	 * aead_req || sgin[] || sgout[] || aad || iv.
>>> @@ -1493,12 +1493,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>>>  		kfree(mem);
>>>  		return err;
>>>  	}
>>> -	if (prot->version == TLS_1_3_VERSION ||
>>> -	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
>>> -		memcpy(iv + iv_offset, tls_ctx->rx.iv,
>>> -		       crypto_aead_ivsize(ctx->aead_recv));
>>> -	else
>>> -		memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
>>> +	memcpy(iv + iv_offset, tls_ctx->rx.iv,
>>> +	       prot->iv_size + prot->salt_size);  
>>
>> If the IV really is 16B then we're passing 4 bytes of uninitialized
>> data at the end of the buffer, right?
>>
>>>  	xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);
> 
> FWIW this is the fix I tested:
> 
> diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
> index 0024a692f0f8..dbc6bce01898 100644
> --- a/net/tls/tls_sw.c
> +++ b/net/tls/tls_sw.c
> @@ -1473,6 +1473,8 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>  	aad = (u8 *)(sgout + n_sgout);
>  	iv = aad + prot->aad_size;
>  
> +	/* Prepare IV */
> +	memset(iv, 0, crypto_aead_ivsize(ctx->aead_recv));
>  	/* For CCM based ciphers, first byte of nonce+iv is a constant */
>  	switch (prot->cipher_type) {
>  	case TLS_CIPHER_AES_CCM_128:
> @@ -1485,21 +1487,20 @@ static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
>  		break;
>  	}
>  
> -	/* Prepare IV */
> -	err = skb_copy_bits(skb, rxm->offset + TLS_HEADER_SIZE,
> -			    iv + iv_offset + prot->salt_size,
> -			    prot->iv_size);
> -	if (err < 0) {
> -		kfree(mem);
> -		return err;
> -	}
>  	if (prot->version == TLS_1_3_VERSION ||
> -	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
> +	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305) {
>  		memcpy(iv + iv_offset, tls_ctx->rx.iv,
> -		       crypto_aead_ivsize(ctx->aead_recv));
> -	else
> +		       prot->iv_size + prot->salt_size);
> +	} else {
> +		err = skb_copy_bits(skb, rxm->offset + TLS_HEADER_SIZE,
> +				    iv + iv_offset + prot->salt_size,
> +				    prot->iv_size);
> +		if (err < 0) {
> +			kfree(mem);
> +			return err;
> +		}
>  		memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
> -
> +	}
I am thinking about is skb_copy_bits() necessary in non-TLS_1_3_VERSION
and non-TLS_CIPHER_CHACHA20_POLY1305 scenarios?

If the inital iv+salt negotiated configuration for tx/rx offload is right
and reliable, what is the reason why we have to extract the iv value from
received skb instead if using the negotiated iv value? Does it can be
modified or just follow spec that versions below TLS_1_3_VERSION?

Without skb_copy_bits(), tls selftest all passed.

Forgive my noob question.

>  	xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);
>  
>  	/* Prepare AAD */
>
Jakub Kicinski March 31, 2022, 2:52 a.m. UTC | #6
On Thu, 31 Mar 2022 10:35:41 +0800 Ziyang Xuan (William) wrote:
> I am thinking about is skb_copy_bits() necessary in non-TLS_1_3_VERSION
> and non-TLS_CIPHER_CHACHA20_POLY1305 scenarios?

It's not necessary there, but we should not make that change be part of
the fix, the fix should be minimal. I'll send a separate patch to move
the skb_copy_bits() call later on.

I think for the fix all you should do is replace the
	crypto_aead_ivsize(ctx->aead_recv));
line with
	prot->iv_size + prot->salt_size);

> If the inital iv+salt negotiated configuration for tx/rx offload is right
> and reliable, what is the reason why we have to extract the iv value from
> received skb instead if using the negotiated iv value? Does it can be
> modified or just follow spec that versions below TLS_1_3_VERSION?

TLS 1.3 does not send the nonce as part of the record. Instead 
the record number is always used as nonce in crypto.
diff mbox series

Patch

diff --git a/net/tls/tls_sw.c b/net/tls/tls_sw.c
index 0024a692f0f8..6b858f995b23 100644
--- a/net/tls/tls_sw.c
+++ b/net/tls/tls_sw.c
@@ -1456,7 +1456,7 @@  static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
 	aead_size = sizeof(*aead_req) + crypto_aead_reqsize(ctx->aead_recv);
 	mem_size = aead_size + (nsg * sizeof(struct scatterlist));
 	mem_size = mem_size + prot->aad_size;
-	mem_size = mem_size + crypto_aead_ivsize(ctx->aead_recv);
+	mem_size = mem_size + MAX_IV_SIZE;
 
 	/* Allocate a single block of memory which contains
 	 * aead_req || sgin[] || sgout[] || aad || iv.
@@ -1493,12 +1493,8 @@  static int decrypt_internal(struct sock *sk, struct sk_buff *skb,
 		kfree(mem);
 		return err;
 	}
-	if (prot->version == TLS_1_3_VERSION ||
-	    prot->cipher_type == TLS_CIPHER_CHACHA20_POLY1305)
-		memcpy(iv + iv_offset, tls_ctx->rx.iv,
-		       crypto_aead_ivsize(ctx->aead_recv));
-	else
-		memcpy(iv + iv_offset, tls_ctx->rx.iv, prot->salt_size);
+	memcpy(iv + iv_offset, tls_ctx->rx.iv,
+	       prot->iv_size + prot->salt_size);
 
 	xor_iv_with_seq(prot, iv + iv_offset, tls_ctx->rx.rec_seq);