diff mbox series

[wireless] wifi: wilc1000: Fix UAF in wilc_netdev_cleanup() when iterator the RCU list

Message ID 20221124151349.2386077-1-zhangxiaoxu5@huawei.com (mailing list archive)
State Changes Requested
Delegated to: Kalle Valo
Headers show
Series [wireless] wifi: wilc1000: Fix UAF in wilc_netdev_cleanup() when iterator the RCU list | expand

Commit Message

Zhang Xiaoxu Nov. 24, 2022, 3:13 p.m. UTC
There is a UAF read when remove the wilc1000_spi module:

 BUG: KASAN: use-after-free in wilc_netdev_cleanup.cold+0xc4/0xe0 [wilc1000]
 Read of size 8 at addr ffff888116846900 by task rmmod/386

 CPU: 2 PID: 386 Comm: rmmod Tainted: G                 N 6.1.0-rc6+ #8
 Call Trace:
  dump_stack_lvl+0x68/0x85
  print_report+0x16c/0x4a3
  kasan_report+0x95/0x190
  wilc_netdev_cleanup.cold+0xc4/0xe0
  wilc_bus_remove+0x52/0x60
  spi_remove+0x46/0x60
  device_remove+0x73/0xc0
  device_release_driver_internal+0x12d/0x210
  driver_detach+0x84/0x100
  bus_remove_driver+0x90/0x120
  driver_unregister+0x4f/0x80
  __x64_sys_delete_module+0x2fc/0x440
  do_syscall_64+0x38/0x90
  entry_SYSCALL_64_after_hwframe+0x63/0xcd

Since set 'needs_free_netdev=true' when initialize the net device, the
net device will be freed when unregister, then use the freed 'vif' to
find the next will UAF read.

Let's unregister the net device after removed from wilc list to avoid
the UAF.

Fixes: 8399918f3056 ("staging: wilc1000: use RCU list to maintain vif interfaces list")
Signed-off-by: Zhang Xiaoxu <zhangxiaoxu5@huawei.com>
---
 drivers/net/wireless/microchip/wilc1000/netdev.c | 13 +++++++++----
 1 file changed, 9 insertions(+), 4 deletions(-)

Comments

Kalle Valo Nov. 25, 2022, 4:42 p.m. UTC | #1
<Ajay.Kathat@microchip.com> writes:

> On 24/11/22 20:43, Zhang Xiaoxu wrote:
>
>  EXTERNAL EMAIL: Do not click links or open attachments unless you know the
>  content is safe
>
> There is a UAF read when remove the wilc1000_spi module:
>
>  BUG: KASAN: use-after-free in wilc_netdev_cleanup.cold+0xc4/0xe0 [wilc1000]
>  Read of size 8 at addr ffff888116846900 by task rmmod/386
>
>  CPU: 2 PID: 386 Comm: rmmod Tainted: G                 N 6.1.0-rc6+ #8
>  Call Trace:
>   dump_stack_lvl+0x68/0x85
>   print_report+0x16c/0x4a3
>   kasan_report+0x95/0x190
>   wilc_netdev_cleanup.cold+0xc4/0xe0
>   wilc_bus_remove+0x52/0x60
>   spi_remove+0x46/0x60
>   device_remove+0x73/0xc0
>   device_release_driver_internal+0x12d/0x210
>   driver_detach+0x84/0x100
>   bus_remove_driver+0x90/0x120
>   driver_unregister+0x4f/0x80
>   __x64_sys_delete_module+0x2fc/0x440
>   do_syscall_64+0x38/0x90
>   entry_SYSCALL_64_after_hwframe+0x63/0xcd
>
> Since set 'needs_free_netdev=true' when initialize the net device, the
> net device will be freed when unregister, then use the freed 'vif' to
> find the next will UAF read.
>
> Did you test this behaviour on the real device. I am seeing a kernel
> crash when the module is unloaded after the connection with an AP. As
> I see, "vif_list" is used to maintain the interface list, so even when
> one interface is removed, another element is fetched from the
> "vif_list", not using the freed "vif"

Ajay, please don't use HTML as our lists drop those.
Zhang Xiaoxu Nov. 26, 2022, 8:14 a.m. UTC | #2
On 2022/11/26 0:17, Ajay.Kathat@microchip.com wrote:
> 
> On 24/11/22 20:43, Zhang Xiaoxu wrote:
>> EXTERNAL EMAIL: Do not click links or open attachments unless you know the content is safe
>>
>> There is a UAF read when remove the wilc1000_spi module:
>>
>>   BUG: KASAN: use-after-free in wilc_netdev_cleanup.cold+0xc4/0xe0 [wilc1000]
>>   Read of size 8 at addr ffff888116846900 by task rmmod/386
>>
>>   CPU: 2 PID: 386 Comm: rmmod Tainted: G                 N 6.1.0-rc6+ #8
>>   Call Trace:
>>    dump_stack_lvl+0x68/0x85
>>    print_report+0x16c/0x4a3
>>    kasan_report+0x95/0x190
>>    wilc_netdev_cleanup.cold+0xc4/0xe0
>>    wilc_bus_remove+0x52/0x60
>>    spi_remove+0x46/0x60
>>    device_remove+0x73/0xc0
>>    device_release_driver_internal+0x12d/0x210
>>    driver_detach+0x84/0x100
>>    bus_remove_driver+0x90/0x120
>>    driver_unregister+0x4f/0x80
>>    __x64_sys_delete_module+0x2fc/0x440
>>    do_syscall_64+0x38/0x90
>>    entry_SYSCALL_64_after_hwframe+0x63/0xcd
>>
>> Since set 'needs_free_netdev=true' when initialize the net device, the
>> net device will be freed when unregister, then use the freed 'vif' to
>> find the next will UAF read.
> 
> 
> Did you test this behaviour on the real device. I am seeing a kernel crash when the module is unloaded after the connection with an AP.
Thanks Ajay,
I have no real device, what kind of crash about your scenario?
> As I see, "vif_list" is used to maintain the interface list, so even when one interface is removed, another element is fetched from the "vif_list", not using the freed "vif"
For example if the "vif_list" has device A and device B, just like:
   A->next = &B
   B->prev = &A

When iterator on the vif_list,
   1st: Got A and unregister A, A will be freed since needs_free_netdev=true
   2nd: Try get B from A->next, A already freed in the first step, UAF occurred.

rcu list no implement the interface like "list_for_each_entry_safe".
> 
> Regards,
> Ajay
>
Kalle Valo Nov. 28, 2022, 11:14 a.m. UTC | #3
"zhangxiaoxu (A)" <zhangxiaoxu5@huawei.com> writes:

> On 2022/11/26 0:17, Ajay.Kathat@microchip.com wrote:
>>
>> On 24/11/22 20:43, Zhang Xiaoxu wrote:
>>> EXTERNAL EMAIL: Do not click links or open attachments unless you know the content is safe
>>>
>>> There is a UAF read when remove the wilc1000_spi module:
>>>
>>>   BUG: KASAN: use-after-free in wilc_netdev_cleanup.cold+0xc4/0xe0 [wilc1000]
>>>   Read of size 8 at addr ffff888116846900 by task rmmod/386
>>>
>>>   CPU: 2 PID: 386 Comm: rmmod Tainted: G                 N 6.1.0-rc6+ #8
>>>   Call Trace:
>>>    dump_stack_lvl+0x68/0x85
>>>    print_report+0x16c/0x4a3
>>>    kasan_report+0x95/0x190
>>>    wilc_netdev_cleanup.cold+0xc4/0xe0
>>>    wilc_bus_remove+0x52/0x60
>>>    spi_remove+0x46/0x60
>>>    device_remove+0x73/0xc0
>>>    device_release_driver_internal+0x12d/0x210
>>>    driver_detach+0x84/0x100
>>>    bus_remove_driver+0x90/0x120
>>>    driver_unregister+0x4f/0x80
>>>    __x64_sys_delete_module+0x2fc/0x440
>>>    do_syscall_64+0x38/0x90
>>>    entry_SYSCALL_64_after_hwframe+0x63/0xcd
>>>
>>> Since set 'needs_free_netdev=true' when initialize the net device, the
>>> net device will be freed when unregister, then use the freed 'vif' to
>>> find the next will UAF read.
>>
>>
>> Did you test this behaviour on the real device. I am seeing a kernel
>> crash when the module is unloaded after the connection with an AP.
>
> Thanks Ajay, I have no real device, what kind of crash about your
> scenario?

If you don't have a real device to test on, please state that clearly in
the commit log. For example, "Compile tested only" or something like
that.

We get way too much untested patches where there's no indication that
they have had no testing. I'm really concerned about this trend, I'm
even considering should I just start dropping these kind of untested
cleanup patches?
Zhang Xiaoxu Nov. 28, 2022, 2:01 p.m. UTC | #4
On 2022/11/28 19:14, Kalle Valo wrote:
> "zhangxiaoxu (A)" <zhangxiaoxu5@huawei.com> writes:
> 
>> On 2022/11/26 0:17, Ajay.Kathat@microchip.com wrote:
>>>
>>> On 24/11/22 20:43, Zhang Xiaoxu wrote:
>>>> EXTERNAL EMAIL: Do not click links or open attachments unless you know the content is safe
>>>>
>>>> There is a UAF read when remove the wilc1000_spi module:
>>>>
>>>>    BUG: KASAN: use-after-free in wilc_netdev_cleanup.cold+0xc4/0xe0 [wilc1000]
>>>>    Read of size 8 at addr ffff888116846900 by task rmmod/386
>>>>
>>>>    CPU: 2 PID: 386 Comm: rmmod Tainted: G                 N 6.1.0-rc6+ #8
>>>>    Call Trace:
>>>>     dump_stack_lvl+0x68/0x85
>>>>     print_report+0x16c/0x4a3
>>>>     kasan_report+0x95/0x190
>>>>     wilc_netdev_cleanup.cold+0xc4/0xe0
>>>>     wilc_bus_remove+0x52/0x60
>>>>     spi_remove+0x46/0x60
>>>>     device_remove+0x73/0xc0
>>>>     device_release_driver_internal+0x12d/0x210
>>>>     driver_detach+0x84/0x100
>>>>     bus_remove_driver+0x90/0x120
>>>>     driver_unregister+0x4f/0x80
>>>>     __x64_sys_delete_module+0x2fc/0x440
>>>>     do_syscall_64+0x38/0x90
>>>>     entry_SYSCALL_64_after_hwframe+0x63/0xcd
>>>>
>>>> Since set 'needs_free_netdev=true' when initialize the net device, the
>>>> net device will be freed when unregister, then use the freed 'vif' to
>>>> find the next will UAF read.
>>>
>>>
>>> Did you test this behaviour on the real device. I am seeing a kernel
>>> crash when the module is unloaded after the connection with an AP.
>>
>> Thanks Ajay, I have no real device, what kind of crash about your
>> scenario?
> 
> If you don't have a real device to test on, please state that clearly in
> the commit log. For example, "Compile tested only" or something like
> that.
Thanks Kalle,

I found this problem with a bpf mock device, and test this patch use
the same way.
> 
> We get way too much untested patches where there's no indication that
> they have had no testing. I'm really concerned about this trend, I'm
> even considering should I just start dropping these kind of untested
> cleanup patches?
>
Kalle Valo Nov. 28, 2022, 2:05 p.m. UTC | #5
"zhangxiaoxu (A)" <zhangxiaoxu5@huawei.com> writes:

> On 2022/11/28 19:14, Kalle Valo wrote:
>> "zhangxiaoxu (A)" <zhangxiaoxu5@huawei.com> writes:
>>
>>> On 2022/11/26 0:17, Ajay.Kathat@microchip.com wrote:
>>>>
>>>> On 24/11/22 20:43, Zhang Xiaoxu wrote:
>>>>> EXTERNAL EMAIL: Do not click links or open attachments unless you know the content is safe
>>>>>
>>>>> There is a UAF read when remove the wilc1000_spi module:
>>>>>
>>>>>    BUG: KASAN: use-after-free in wilc_netdev_cleanup.cold+0xc4/0xe0 [wilc1000]
>>>>>    Read of size 8 at addr ffff888116846900 by task rmmod/386
>>>>>
>>>>>    CPU: 2 PID: 386 Comm: rmmod Tainted: G                 N 6.1.0-rc6+ #8
>>>>>    Call Trace:
>>>>>     dump_stack_lvl+0x68/0x85
>>>>>     print_report+0x16c/0x4a3
>>>>>     kasan_report+0x95/0x190
>>>>>     wilc_netdev_cleanup.cold+0xc4/0xe0
>>>>>     wilc_bus_remove+0x52/0x60
>>>>>     spi_remove+0x46/0x60
>>>>>     device_remove+0x73/0xc0
>>>>>     device_release_driver_internal+0x12d/0x210
>>>>>     driver_detach+0x84/0x100
>>>>>     bus_remove_driver+0x90/0x120
>>>>>     driver_unregister+0x4f/0x80
>>>>>     __x64_sys_delete_module+0x2fc/0x440
>>>>>     do_syscall_64+0x38/0x90
>>>>>     entry_SYSCALL_64_after_hwframe+0x63/0xcd
>>>>>
>>>>> Since set 'needs_free_netdev=true' when initialize the net device, the
>>>>> net device will be freed when unregister, then use the freed 'vif' to
>>>>> find the next will UAF read.
>>>>
>>>>
>>>> Did you test this behaviour on the real device. I am seeing a kernel
>>>> crash when the module is unloaded after the connection with an AP.
>>>
>>> Thanks Ajay, I have no real device, what kind of crash about your
>>> scenario?
>>
>> If you don't have a real device to test on, please state that clearly in
>> the commit log. For example, "Compile tested only" or something like
>> that.
>
> Thanks Kalle,
>
> I found this problem with a bpf mock device, and test this patch use
> the same way.

That's very good info, please always include that information to the
commit log as well. That way there's a better understanding how this
patch is found.
diff mbox series

Patch

diff --git a/drivers/net/wireless/microchip/wilc1000/netdev.c b/drivers/net/wireless/microchip/wilc1000/netdev.c
index 9b319a455b96..d6374bfd7650 100644
--- a/drivers/net/wireless/microchip/wilc1000/netdev.c
+++ b/drivers/net/wireless/microchip/wilc1000/netdev.c
@@ -879,6 +879,7 @@  void wilc_netdev_cleanup(struct wilc *wilc)
 {
 	struct wilc_vif *vif;
 	int srcu_idx, ifc_cnt = 0;
+	LIST_HEAD(list_kill);
 
 	if (!wilc)
 		return;
@@ -888,16 +889,14 @@  void wilc_netdev_cleanup(struct wilc *wilc)
 		wilc->firmware = NULL;
 	}
 
+	rtnl_lock();
 	srcu_idx = srcu_read_lock(&wilc->srcu);
 	list_for_each_entry_rcu(vif, &wilc->vif_list, list) {
 		if (vif->ndev)
-			unregister_netdev(vif->ndev);
+			unregister_netdevice_queue(vif->ndev, &list_kill);
 	}
 	srcu_read_unlock(&wilc->srcu, srcu_idx);
 
-	wilc_wfi_deinit_mon_interface(wilc, false);
-	destroy_workqueue(wilc->hif_workqueue);
-
 	while (ifc_cnt < WILC_NUM_CONCURRENT_IFC) {
 		mutex_lock(&wilc->vif_mutex);
 		if (wilc->vif_num <= 0) {
@@ -914,6 +913,12 @@  void wilc_netdev_cleanup(struct wilc *wilc)
 		ifc_cnt++;
 	}
 
+	unregister_netdevice_many(&list_kill);
+	rtnl_unlock();
+
+	wilc_wfi_deinit_mon_interface(wilc, false);
+	destroy_workqueue(wilc->hif_workqueue);
+
 	wilc_wlan_cfg_deinit(wilc);
 	wlan_deinit_locks(wilc);
 	wiphy_unregister(wilc->wiphy);