diff mbox series

[rdma-next,2/3] RDMA/core: Fix check of device in rdma_listen()

Message ID b925e11d639726afbaaeea5aeaa58572b3aacf8e.1618753862.git.leonro@nvidia.com (mailing list archive)
State Changes Requested
Delegated to: Jason Gunthorpe
Headers show
Series CMA fixes | expand

Commit Message

Leon Romanovsky April 18, 2021, 1:55 p.m. UTC
From: Shay Drory <shayd@nvidia.com>

rdma_listen() checks if device already attached to rdma_id_priv,
based on the response the its decide to what to listen, however
this is different when the listeners are canceled.

This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
according to the cma_cancel_operation().

Found by syzcaller:
BUG: KASAN: wild-memory-access in __list_del include/linux/list.h:112 [inline]
BUG: KASAN: wild-memory-access in __list_del_entry include/linux/list.h:135 [inline]
BUG: KASAN: wild-memory-access in list_del include/linux/list.h:146 [inline]
BUG: KASAN: wild-memory-access in cma_cancel_listens drivers/infiniband/core/cma.c:1767 [inline]
BUG: KASAN: wild-memory-access in cma_cancel_operation drivers/infiniband/core/cma.c:1795 [inline]
BUG: KASAN: wild-memory-access in cma_cancel_operation+0x1f4/0x4b0 drivers/infiniband/core/cma.c:1783
Write of size 8 at addr dead000000000108 by task syz-executor716/334

CPU: 0 PID: 334 Comm: syz-executor716 Not tainted 5.11.0+ #271
Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS
rel-1.13.0-0-gf21b5a4aeb02-prebuilt.qemu.org 04/01/2014
Call Trace:
 __dump_stack lib/dump_stack.c:79 [inline]
 dump_stack+0xbe/0xf9 lib/dump_stack.c:120
 __kasan_report mm/kasan/report.c:400 [inline]
 kasan_report.cold+0x5f/0xd5 mm/kasan/report.c:413
 __list_del include/linux/list.h:112 [inline]
 __list_del_entry include/linux/list.h:135 [inline]
 list_del include/linux/list.h:146 [inline]
 cma_cancel_listens drivers/infiniband/core/cma.c:1767 [inline]
 cma_cancel_operation drivers/infiniband/core/cma.c:1795 [inline]
 cma_cancel_operation+0x1f4/0x4b0 drivers/infiniband/core/cma.c:1783
 _destroy_id+0x29/0x460 drivers/infiniband/core/cma.c:1862
 ucma_close_id+0x36/0x50 drivers/infiniband/core/ucma.c:185
 ucma_destroy_private_ctx+0x58d/0x5b0 drivers/infiniband/core/ucma.c:576
 ucma_close+0x91/0xd0 drivers/infiniband/core/ucma.c:1797
 __fput+0x169/0x540 fs/file_table.c:280
 task_work_run+0xb7/0x100 kernel/task_work.c:140
 exit_task_work include/linux/task_work.h:30 [inline]
 do_exit+0x7da/0x17f0 kernel/exit.c:825
 do_group_exit+0x9e/0x190 kernel/exit.c:922
 __do_sys_exit_group kernel/exit.c:933 [inline]
 __se_sys_exit_group kernel/exit.c:931 [inline]
 __x64_sys_exit_group+0x2d/0x30 kernel/exit.c:931
 do_syscall_64+0x2d/0x40 arch/x86/entry/common.c:46
 entry_SYSCALL_64_after_hwframe+0x44/0xa9
RIP: 0033:0x44a326
Code: Unable to access opcode bytes at RIP 0x44a2fc.
RSP: 002b:00007ffd14306748 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
RAX: ffffffffffffffda RBX: 00000000006c4490 RCX: 000000000044a326
RDX: 0000000000000001 RSI: 000000000000003c RDI: 0000000000000001
RBP: 0000000000000001 R08: 00000000000000e7 R09: ffffffffffffffc0
R10: bb1414ac000000c2 R11: 0000000000000246 R12: 00000000006c4490
R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000001
==================================================================

Fixes: 255d0c14b375 ("RDMA/cma: rdma_bind_addr() leaks a cma_dev reference count")
Signed-off-by: Shay Drory <shayd@nvidia.com>
Signed-off-by: Leon Romanovsky <leonro@nvidia.com>
---
 drivers/infiniband/core/cma.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

Comments

Jason Gunthorpe April 22, 2021, 11:28 a.m. UTC | #1
On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> From: Shay Drory <shayd@nvidia.com>
> 
> rdma_listen() checks if device already attached to rdma_id_priv,
> based on the response the its decide to what to listen, however
> this is different when the listeners are canceled.
> 
> This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> according to the cma_cancel_operation().

So this is happening because the error unwind in rdma_bind_addr() is
taking the exit path and calling cma_release_dev()?

This allows rdma_listen() to be called with a bogus device pointer
which precipitates this UAF during destroy.

However, I think rdma_bind_addr() should not allow the bogus device
pointer to leak out at all, since the ULP could see it. It really is
invalid to have it present no matter what.

This would make cma_release_dev() and _cma_attach_to_dev()
symmetrical - what do you think?

diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
index 2dc302a83014ae..91f6d968b46f65 100644
--- a/drivers/infiniband/core/cma.c
+++ b/drivers/infiniband/core/cma.c
@@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
 	list_del(&id_priv->list);
 	cma_dev_put(id_priv->cma_dev);
 	id_priv->cma_dev = NULL;
+	id_priv->id.device = NULL;
 	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
 		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
 		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
Shay Drori April 22, 2021, 12:44 p.m. UTC | #2
On 4/22/2021 14:28, Jason Gunthorpe wrote:

> On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
>> From: Shay Drory <shayd@nvidia.com>
>>
>> rdma_listen() checks if device already attached to rdma_id_priv,
>> based on the response the its decide to what to listen, however
>> this is different when the listeners are canceled.
>>
>> This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
>> and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
>> according to the cma_cancel_operation().
> So this is happening because the error unwind in rdma_bind_addr() is
> taking the exit path and calling cma_release_dev()?
>
> This allows rdma_listen() to be called with a bogus device pointer
> which precipitates this UAF during destroy.
>
> However, I think rdma_bind_addr() should not allow the bogus device
> pointer to leak out at all, since the ULP could see it. It really is
> invalid to have it present no matter what.
>
> This would make cma_release_dev() and _cma_attach_to_dev()
> symmetrical - what do you think?
>
> diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> index 2dc302a83014ae..91f6d968b46f65 100644
> --- a/drivers/infiniband/core/cma.c
> +++ b/drivers/infiniband/core/cma.c
> @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
>   	list_del(&id_priv->list);
>   	cma_dev_put(id_priv->cma_dev);
>   	id_priv->cma_dev = NULL;
> +	id_priv->id.device = NULL;
>   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
>   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
>   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;

I try that. this will break restrack_del() since restrack_del() is using id_priv->id.device
and is being called before restrack_del():
_destroy_id <https://elixir.bootlin.com/linux/v5.11/C/ident/_destroy_id>() {
if (id_priv 
<https://elixir.bootlin.com/linux/v5.11/C/ident/id_priv>->cma_dev 
<https://elixir.bootlin.com/linux/v5.11/C/ident/cma_dev>) 
cma_release_dev 
<https://elixir.bootlin.com/linux/v5.11/C/ident/cma_release_dev>(id_priv 
<https://elixir.bootlin.com/linux/v5.11/C/ident/id_priv>); ... 
rdma_restrack_del 
<https://elixir.bootlin.com/linux/v5.11/C/ident/rdma_restrack_del>(&id_priv 
<https://elixir.bootlin.com/linux/v5.11/C/ident/id_priv>->res); }
Jason Gunthorpe April 22, 2021, 12:51 p.m. UTC | #3
On Thu, Apr 22, 2021 at 03:44:55PM +0300, Shay Drory wrote:
> On 4/22/2021 14:28, Jason Gunthorpe wrote:
> 
> > On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> > > From: Shay Drory <shayd@nvidia.com>
> > > 
> > > rdma_listen() checks if device already attached to rdma_id_priv,
> > > based on the response the its decide to what to listen, however
> > > this is different when the listeners are canceled.
> > > 
> > > This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> > > and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> > > according to the cma_cancel_operation().
> > So this is happening because the error unwind in rdma_bind_addr() is
> > taking the exit path and calling cma_release_dev()?
> > 
> > This allows rdma_listen() to be called with a bogus device pointer
> > which precipitates this UAF during destroy.
> > 
> > However, I think rdma_bind_addr() should not allow the bogus device
> > pointer to leak out at all, since the ULP could see it. It really is
> > invalid to have it present no matter what.
> > 
> > This would make cma_release_dev() and _cma_attach_to_dev()
> > symmetrical - what do you think?
> > 
> > diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> > index 2dc302a83014ae..91f6d968b46f65 100644
> > +++ b/drivers/infiniband/core/cma.c
> > @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
> >   	list_del(&id_priv->list);
> >   	cma_dev_put(id_priv->cma_dev);
> >   	id_priv->cma_dev = NULL;
> > +	id_priv->id.device = NULL;
> >   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
> >   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
> >   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
> 
> I try that. this will break restrack_del() since restrack_del() is
> using id_priv->id.device and is being called before restrack_del():

Oh that is another bug, once cma_release_dev() is called there is no
refcount protecting the id.device and any access to it is invalid.

The order of rdma_restrack_del should be moved to be ahead of the
cma_release_dev, and we also can't have a restrack without a cma_dev
in the first place

Jason
Leon Romanovsky April 22, 2021, 1:01 p.m. UTC | #4
On Thu, Apr 22, 2021 at 09:51:35AM -0300, Jason Gunthorpe wrote:
> On Thu, Apr 22, 2021 at 03:44:55PM +0300, Shay Drory wrote:
> > On 4/22/2021 14:28, Jason Gunthorpe wrote:
> > 
> > > On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> > > > From: Shay Drory <shayd@nvidia.com>
> > > > 
> > > > rdma_listen() checks if device already attached to rdma_id_priv,
> > > > based on the response the its decide to what to listen, however
> > > > this is different when the listeners are canceled.
> > > > 
> > > > This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> > > > and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> > > > according to the cma_cancel_operation().
> > > So this is happening because the error unwind in rdma_bind_addr() is
> > > taking the exit path and calling cma_release_dev()?
> > > 
> > > This allows rdma_listen() to be called with a bogus device pointer
> > > which precipitates this UAF during destroy.
> > > 
> > > However, I think rdma_bind_addr() should not allow the bogus device
> > > pointer to leak out at all, since the ULP could see it. It really is
> > > invalid to have it present no matter what.
> > > 
> > > This would make cma_release_dev() and _cma_attach_to_dev()
> > > symmetrical - what do you think?
> > > 
> > > diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> > > index 2dc302a83014ae..91f6d968b46f65 100644
> > > +++ b/drivers/infiniband/core/cma.c
> > > @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
> > >   	list_del(&id_priv->list);
> > >   	cma_dev_put(id_priv->cma_dev);
> > >   	id_priv->cma_dev = NULL;
> > > +	id_priv->id.device = NULL;
> > >   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
> > >   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
> > >   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
> > 
> > I try that. this will break restrack_del() since restrack_del() is
> > using id_priv->id.device and is being called before restrack_del():
> 
> Oh that is another bug, once cma_release_dev() is called there is no
> refcount protecting the id.device and any access to it is invalid.
> 
> The order of rdma_restrack_del should be moved to be ahead of the
> cma_release_dev, and we also can't have a restrack without a cma_dev
> in the first place

We have restrack per-cmd_id and not per-cma_dev.

> 
> Jason
Jason Gunthorpe April 22, 2021, 1:02 p.m. UTC | #5
On Thu, Apr 22, 2021 at 04:01:20PM +0300, Leon Romanovsky wrote:
> On Thu, Apr 22, 2021 at 09:51:35AM -0300, Jason Gunthorpe wrote:
> > On Thu, Apr 22, 2021 at 03:44:55PM +0300, Shay Drory wrote:
> > > On 4/22/2021 14:28, Jason Gunthorpe wrote:
> > > 
> > > > On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> > > > > From: Shay Drory <shayd@nvidia.com>
> > > > > 
> > > > > rdma_listen() checks if device already attached to rdma_id_priv,
> > > > > based on the response the its decide to what to listen, however
> > > > > this is different when the listeners are canceled.
> > > > > 
> > > > > This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> > > > > and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> > > > > according to the cma_cancel_operation().
> > > > So this is happening because the error unwind in rdma_bind_addr() is
> > > > taking the exit path and calling cma_release_dev()?
> > > > 
> > > > This allows rdma_listen() to be called with a bogus device pointer
> > > > which precipitates this UAF during destroy.
> > > > 
> > > > However, I think rdma_bind_addr() should not allow the bogus device
> > > > pointer to leak out at all, since the ULP could see it. It really is
> > > > invalid to have it present no matter what.
> > > > 
> > > > This would make cma_release_dev() and _cma_attach_to_dev()
> > > > symmetrical - what do you think?
> > > > 
> > > > diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> > > > index 2dc302a83014ae..91f6d968b46f65 100644
> > > > +++ b/drivers/infiniband/core/cma.c
> > > > @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
> > > >   	list_del(&id_priv->list);
> > > >   	cma_dev_put(id_priv->cma_dev);
> > > >   	id_priv->cma_dev = NULL;
> > > > +	id_priv->id.device = NULL;
> > > >   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
> > > >   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
> > > >   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
> > > 
> > > I try that. this will break restrack_del() since restrack_del() is
> > > using id_priv->id.device and is being called before restrack_del():
> > 
> > Oh that is another bug, once cma_release_dev() is called there is no
> > refcount protecting the id.device and any access to it is invalid.
> > 
> > The order of rdma_restrack_del should be moved to be ahead of the
> > cma_release_dev, and we also can't have a restrack without a cma_dev
> > in the first place
> 
> We have restrack per-cmd_id and not per-cma_dev.

No, restrack has this:

	dev = res_to_dev(res);
	if (WARN_ON(!dev))

And here dev will be NULL if cma_dev isn't set

Jasno
Leon Romanovsky April 22, 2021, 2:58 p.m. UTC | #6
On Thu, Apr 22, 2021 at 10:02:18AM -0300, Jason Gunthorpe wrote:
> On Thu, Apr 22, 2021 at 04:01:20PM +0300, Leon Romanovsky wrote:
> > On Thu, Apr 22, 2021 at 09:51:35AM -0300, Jason Gunthorpe wrote:
> > > On Thu, Apr 22, 2021 at 03:44:55PM +0300, Shay Drory wrote:
> > > > On 4/22/2021 14:28, Jason Gunthorpe wrote:
> > > > 
> > > > > On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> > > > > > From: Shay Drory <shayd@nvidia.com>
> > > > > > 
> > > > > > rdma_listen() checks if device already attached to rdma_id_priv,
> > > > > > based on the response the its decide to what to listen, however
> > > > > > this is different when the listeners are canceled.
> > > > > > 
> > > > > > This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> > > > > > and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> > > > > > according to the cma_cancel_operation().
> > > > > So this is happening because the error unwind in rdma_bind_addr() is
> > > > > taking the exit path and calling cma_release_dev()?
> > > > > 
> > > > > This allows rdma_listen() to be called with a bogus device pointer
> > > > > which precipitates this UAF during destroy.
> > > > > 
> > > > > However, I think rdma_bind_addr() should not allow the bogus device
> > > > > pointer to leak out at all, since the ULP could see it. It really is
> > > > > invalid to have it present no matter what.
> > > > > 
> > > > > This would make cma_release_dev() and _cma_attach_to_dev()
> > > > > symmetrical - what do you think?
> > > > > 
> > > > > diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> > > > > index 2dc302a83014ae..91f6d968b46f65 100644
> > > > > +++ b/drivers/infiniband/core/cma.c
> > > > > @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
> > > > >   	list_del(&id_priv->list);
> > > > >   	cma_dev_put(id_priv->cma_dev);
> > > > >   	id_priv->cma_dev = NULL;
> > > > > +	id_priv->id.device = NULL;
> > > > >   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
> > > > >   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
> > > > >   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
> > > > 
> > > > I try that. this will break restrack_del() since restrack_del() is
> > > > using id_priv->id.device and is being called before restrack_del():
> > > 
> > > Oh that is another bug, once cma_release_dev() is called there is no
> > > refcount protecting the id.device and any access to it is invalid.
> > > 
> > > The order of rdma_restrack_del should be moved to be ahead of the
> > > cma_release_dev, and we also can't have a restrack without a cma_dev
> > > in the first place
> > 
> > We have restrack per-cmd_id and not per-cma_dev.
> 
> No, restrack has this:
> 
> 	dev = res_to_dev(res);
> 	if (WARN_ON(!dev))
> 
> And here dev will be NULL if cma_dev isn't set

  127 static struct ib_device *res_to_dev(struct rdma_restrack_entry *res)
  128 {

<...>

  136         case RDMA_RESTRACK_CM_ID:
  137                 return container_of(res, struct rdma_id_private,
  138                                     res)->id.device;
                                                ^^^^^ it is not cma_dev
Thanks
Jason Gunthorpe April 22, 2021, 4:18 p.m. UTC | #7
On Thu, Apr 22, 2021 at 05:58:18PM +0300, Leon Romanovsky wrote:
> On Thu, Apr 22, 2021 at 10:02:18AM -0300, Jason Gunthorpe wrote:
> > On Thu, Apr 22, 2021 at 04:01:20PM +0300, Leon Romanovsky wrote:
> > > On Thu, Apr 22, 2021 at 09:51:35AM -0300, Jason Gunthorpe wrote:
> > > > On Thu, Apr 22, 2021 at 03:44:55PM +0300, Shay Drory wrote:
> > > > > On 4/22/2021 14:28, Jason Gunthorpe wrote:
> > > > > 
> > > > > > On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> > > > > > > From: Shay Drory <shayd@nvidia.com>
> > > > > > > 
> > > > > > > rdma_listen() checks if device already attached to rdma_id_priv,
> > > > > > > based on the response the its decide to what to listen, however
> > > > > > > this is different when the listeners are canceled.
> > > > > > > 
> > > > > > > This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> > > > > > > and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> > > > > > > according to the cma_cancel_operation().
> > > > > > So this is happening because the error unwind in rdma_bind_addr() is
> > > > > > taking the exit path and calling cma_release_dev()?
> > > > > > 
> > > > > > This allows rdma_listen() to be called with a bogus device pointer
> > > > > > which precipitates this UAF during destroy.
> > > > > > 
> > > > > > However, I think rdma_bind_addr() should not allow the bogus device
> > > > > > pointer to leak out at all, since the ULP could see it. It really is
> > > > > > invalid to have it present no matter what.
> > > > > > 
> > > > > > This would make cma_release_dev() and _cma_attach_to_dev()
> > > > > > symmetrical - what do you think?
> > > > > > 
> > > > > > diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> > > > > > index 2dc302a83014ae..91f6d968b46f65 100644
> > > > > > +++ b/drivers/infiniband/core/cma.c
> > > > > > @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
> > > > > >   	list_del(&id_priv->list);
> > > > > >   	cma_dev_put(id_priv->cma_dev);
> > > > > >   	id_priv->cma_dev = NULL;
> > > > > > +	id_priv->id.device = NULL;
> > > > > >   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
> > > > > >   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
> > > > > >   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
> > > > > 
> > > > > I try that. this will break restrack_del() since restrack_del() is
> > > > > using id_priv->id.device and is being called before restrack_del():
> > > > 
> > > > Oh that is another bug, once cma_release_dev() is called there is no
> > > > refcount protecting the id.device and any access to it is invalid.
> > > > 
> > > > The order of rdma_restrack_del should be moved to be ahead of the
> > > > cma_release_dev, and we also can't have a restrack without a cma_dev
> > > > in the first place
> > > 
> > > We have restrack per-cmd_id and not per-cma_dev.
> > 
> > No, restrack has this:
> > 
> > 	dev = res_to_dev(res);
> > 	if (WARN_ON(!dev))
> > 
> > And here dev will be NULL if cma_dev isn't set
> 
>   127 static struct ib_device *res_to_dev(struct rdma_restrack_entry *res)
>   128 {
> 
> <...>
> 
>   136         case RDMA_RESTRACK_CM_ID:
>   137                 return container_of(res, struct rdma_id_private,
>   138                                     res)->id.device;
>                                                 ^^^^^ it is not cma_dev

The invariant is that 

   priv.id.device == priv.cma_dev->device

(this de-normalization of data exists only to allow priv to be in a
private header)

If cma_dev == NULL then id.device == NULL as cma_Dev was the thing
preventing the pointer from being free'd.

Jason
Leon Romanovsky April 25, 2021, 1:17 p.m. UTC | #8
On Thu, Apr 22, 2021 at 01:18:09PM -0300, Jason Gunthorpe wrote:
> On Thu, Apr 22, 2021 at 05:58:18PM +0300, Leon Romanovsky wrote:
> > On Thu, Apr 22, 2021 at 10:02:18AM -0300, Jason Gunthorpe wrote:
> > > On Thu, Apr 22, 2021 at 04:01:20PM +0300, Leon Romanovsky wrote:
> > > > On Thu, Apr 22, 2021 at 09:51:35AM -0300, Jason Gunthorpe wrote:
> > > > > On Thu, Apr 22, 2021 at 03:44:55PM +0300, Shay Drory wrote:
> > > > > > On 4/22/2021 14:28, Jason Gunthorpe wrote:
> > > > > > 
> > > > > > > On Sun, Apr 18, 2021 at 04:55:53PM +0300, Leon Romanovsky wrote:
> > > > > > > > From: Shay Drory <shayd@nvidia.com>
> > > > > > > > 
> > > > > > > > rdma_listen() checks if device already attached to rdma_id_priv,
> > > > > > > > based on the response the its decide to what to listen, however
> > > > > > > > this is different when the listeners are canceled.
> > > > > > > > 
> > > > > > > > This leads to a mismatch between rdma_listen() and cma_cancel_operation(),
> > > > > > > > and causes to bellow wild-memory-access. Fix it by aligning rdma_listen()
> > > > > > > > according to the cma_cancel_operation().
> > > > > > > So this is happening because the error unwind in rdma_bind_addr() is
> > > > > > > taking the exit path and calling cma_release_dev()?
> > > > > > > 
> > > > > > > This allows rdma_listen() to be called with a bogus device pointer
> > > > > > > which precipitates this UAF during destroy.
> > > > > > > 
> > > > > > > However, I think rdma_bind_addr() should not allow the bogus device
> > > > > > > pointer to leak out at all, since the ULP could see it. It really is
> > > > > > > invalid to have it present no matter what.
> > > > > > > 
> > > > > > > This would make cma_release_dev() and _cma_attach_to_dev()
> > > > > > > symmetrical - what do you think?
> > > > > > > 
> > > > > > > diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
> > > > > > > index 2dc302a83014ae..91f6d968b46f65 100644
> > > > > > > +++ b/drivers/infiniband/core/cma.c
> > > > > > > @@ -474,6 +474,7 @@ static void cma_release_dev(struct rdma_id_private *id_priv)
> > > > > > >   	list_del(&id_priv->list);
> > > > > > >   	cma_dev_put(id_priv->cma_dev);
> > > > > > >   	id_priv->cma_dev = NULL;
> > > > > > > +	id_priv->id.device = NULL;
> > > > > > >   	if (id_priv->id.route.addr.dev_addr.sgid_attr) {
> > > > > > >   		rdma_put_gid_attr(id_priv->id.route.addr.dev_addr.sgid_attr);
> > > > > > >   		id_priv->id.route.addr.dev_addr.sgid_attr = NULL;
> > > > > > 
> > > > > > I try that. this will break restrack_del() since restrack_del() is
> > > > > > using id_priv->id.device and is being called before restrack_del():
> > > > > 
> > > > > Oh that is another bug, once cma_release_dev() is called there is no
> > > > > refcount protecting the id.device and any access to it is invalid.
> > > > > 
> > > > > The order of rdma_restrack_del should be moved to be ahead of the
> > > > > cma_release_dev, and we also can't have a restrack without a cma_dev
> > > > > in the first place
> > > > 
> > > > We have restrack per-cmd_id and not per-cma_dev.
> > > 
> > > No, restrack has this:
> > > 
> > > 	dev = res_to_dev(res);
> > > 	if (WARN_ON(!dev))
> > > 
> > > And here dev will be NULL if cma_dev isn't set
> > 
> >   127 static struct ib_device *res_to_dev(struct rdma_restrack_entry *res)
> >   128 {
> > 
> > <...>
> > 
> >   136         case RDMA_RESTRACK_CM_ID:
> >   137                 return container_of(res, struct rdma_id_private,
> >   138                                     res)->id.device;
> >                                                 ^^^^^ it is not cma_dev
> 
> The invariant is that 
> 
>    priv.id.device == priv.cma_dev->device
> 
> (this de-normalization of data exists only to allow priv to be in a
> private header)
> 
> If cma_dev == NULL then id.device == NULL as cma_Dev was the thing
> preventing the pointer from being free'd.

Let's see what I can do here.

Thanks

> 
> Jason
diff mbox series

Patch

diff --git a/drivers/infiniband/core/cma.c b/drivers/infiniband/core/cma.c
index 2dc302a83014..cc990adaf2b5 100644
--- a/drivers/infiniband/core/cma.c
+++ b/drivers/infiniband/core/cma.c
@@ -3768,7 +3768,7 @@  int rdma_listen(struct rdma_cm_id *id, int backlog)
 	}
 
 	id_priv->backlog = backlog;
-	if (id->device) {
+	if (id_priv->cma_dev) {
 		if (rdma_cap_ib_cm(id->device, 1)) {
 			ret = cma_ib_listen(id_priv);
 			if (ret)