diff mbox series

[net] ethtool: do not perform operations on net devices being unregistered

Message ID 20211203101318.435618-1-atenart@kernel.org (mailing list archive)
State Superseded
Commit dde91ccfa25fd58f64c397d91b81a4b393100ffa
Delegated to: Netdev Maintainers
Headers show
Series [net] ethtool: do not perform operations on net devices being unregistered | expand

Checks

Context Check Description
netdev/tree_selection success Clearly marked for net
netdev/fixes_present success Fixes tag present in non-next series
netdev/subject_prefix success Link
netdev/cover_letter success Single patches do not need cover letters
netdev/patch_count success Link
netdev/header_inline success No static functions without inline keyword in header files
netdev/build_32bit success Errors and warnings before: 2 this patch: 2
netdev/cc_maintainers fail 1 blamed authors not CCed: f.fainelli@gmail.com; 2 maintainers not CCed: hkallweit1@gmail.com f.fainelli@gmail.com
netdev/build_clang success Errors and warnings before: 20 this patch: 20
netdev/module_param success Was 0 now: 0
netdev/verify_signedoff success Signed-off-by tag matches author and committer
netdev/verify_fixes success Fixes tag looks correct
netdev/build_allmodconfig_warn success Errors and warnings before: 4 this patch: 4
netdev/checkpatch success total: 0 errors, 0 warnings, 0 checks, 9 lines checked
netdev/kdoc success Errors and warnings before: 0 this patch: 0
netdev/source_inline success Was 0 now: 0

Commit Message

Antoine Tenart Dec. 3, 2021, 10:13 a.m. UTC
There is a short period between a net device starts to be unregistered
and when it is actually gone. In that time frame ethtool operations
could still be performed, which might end up in unwanted or undefined
behaviours[1].

Do not allow ethtool operations after a net device starts its
unregistration. This patch targets the netlink part as the ioctl one
isn't affected: the reference to the net device is taken and the
operation is executed within an rtnl lock section and the net device
won't be found after unregister.

[1] For example adding Tx queues after unregister ends up in NULL
    pointer exceptions and UaFs, such as:

      BUG: KASAN: use-after-free in kobject_get+0x14/0x90
      Read of size 1 at addr ffff88801961248c by task ethtool/755

      CPU: 0 PID: 755 Comm: ethtool Not tainted 5.15.0-rc6+ #778
      Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-4.fc34 04/014
      Call Trace:
       dump_stack_lvl+0x57/0x72
       print_address_description.constprop.0+0x1f/0x140
       kasan_report.cold+0x7f/0x11b
       kobject_get+0x14/0x90
       kobject_add_internal+0x3d1/0x450
       kobject_init_and_add+0xba/0xf0
       netdev_queue_update_kobjects+0xcf/0x200
       netif_set_real_num_tx_queues+0xb4/0x310
       veth_set_channels+0x1c3/0x550
       ethnl_set_channels+0x524/0x610

Fixes: 041b1c5d4a53 ("ethtool: helper functions for netlink interface")
Suggested-by: Jakub Kicinski <kuba@kernel.org>
Signed-off-by: Antoine Tenart <atenart@kernel.org>
---

Following the discussions in those threads:
- https://lore.kernel.org/all/20211129154520.295823-1-atenart@kernel.org/T/
- https://lore.kernel.org/all/20211122162007.303623-1-atenart@kernel.org/T/

 net/ethtool/netlink.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

Comments

Julian Wiedmann Dec. 6, 2021, 9:46 a.m. UTC | #1
On 03.12.21 12:13, Antoine Tenart wrote:
> There is a short period between a net device starts to be unregistered
> and when it is actually gone. In that time frame ethtool operations
> could still be performed, which might end up in unwanted or undefined
> behaviours[1].
> 
> Do not allow ethtool operations after a net device starts its
> unregistration. This patch targets the netlink part as the ioctl one
> isn't affected: the reference to the net device is taken and the
> operation is executed within an rtnl lock section and the net device
> won't be found after unregister.
> 
> [1] For example adding Tx queues after unregister ends up in NULL
>     pointer exceptions and UaFs, such as:
> 
>       BUG: KASAN: use-after-free in kobject_get+0x14/0x90
>       Read of size 1 at addr ffff88801961248c by task ethtool/755
> 
>       CPU: 0 PID: 755 Comm: ethtool Not tainted 5.15.0-rc6+ #778
>       Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-4.fc34 04/014
>       Call Trace:
>        dump_stack_lvl+0x57/0x72
>        print_address_description.constprop.0+0x1f/0x140
>        kasan_report.cold+0x7f/0x11b
>        kobject_get+0x14/0x90
>        kobject_add_internal+0x3d1/0x450
>        kobject_init_and_add+0xba/0xf0
>        netdev_queue_update_kobjects+0xcf/0x200
>        netif_set_real_num_tx_queues+0xb4/0x310
>        veth_set_channels+0x1c3/0x550
>        ethnl_set_channels+0x524/0x610
> 
> Fixes: 041b1c5d4a53 ("ethtool: helper functions for netlink interface")
> Suggested-by: Jakub Kicinski <kuba@kernel.org>
> Signed-off-by: Antoine Tenart <atenart@kernel.org>
> ---
> 
> Following the discussions in those threads:
> - https://lore.kernel.org/all/20211129154520.295823-1-atenart@kernel.org/T/
> - https://lore.kernel.org/all/20211122162007.303623-1-atenart@kernel.org/T/
> 
>  net/ethtool/netlink.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/net/ethtool/netlink.c b/net/ethtool/netlink.c
> index 38b44c0291b1..96f4180aabd2 100644
> --- a/net/ethtool/netlink.c
> +++ b/net/ethtool/netlink.c
> @@ -40,7 +40,8 @@ int ethnl_ops_begin(struct net_device *dev)
>  	if (dev->dev.parent)
>  		pm_runtime_get_sync(dev->dev.parent);
>  
> -	if (!netif_device_present(dev)) {
> +	if (!netif_device_present(dev) ||
> +	    dev->reg_state == NETREG_UNREGISTERING) {
>  		ret = -ENODEV;
>  		goto err;
>  	}
> 

Wondering if other places would also benefit from a netif_device_detach()
in the unregistration sequence ...
Jakub Kicinski Dec. 6, 2021, 3:15 p.m. UTC | #2
On Mon, 6 Dec 2021 11:46:35 +0200 Julian Wiedmann wrote:
> On 03.12.21 12:13, Antoine Tenart wrote:
> > There is a short period between a net device starts to be unregistered
> > and when it is actually gone. In that time frame ethtool operations
> > could still be performed, which might end up in unwanted or undefined
> > behaviours[1].
> > 
> > Do not allow ethtool operations after a net device starts its
> > unregistration. This patch targets the netlink part as the ioctl one
> > isn't affected: the reference to the net device is taken and the
> > operation is executed within an rtnl lock section and the net device
> > won't be found after unregister.
> > [...]
> > +++ b/net/ethtool/netlink.c
> > @@ -40,7 +40,8 @@ int ethnl_ops_begin(struct net_device *dev)
> >  	if (dev->dev.parent)
> >  		pm_runtime_get_sync(dev->dev.parent);
> >  
> > -	if (!netif_device_present(dev)) {
> > +	if (!netif_device_present(dev) ||
> > +	    dev->reg_state == NETREG_UNREGISTERING) {
> >  		ret = -ENODEV;
> >  		goto err;
> >  	}
> >   
> 
> Wondering if other places would also benefit from a netif_device_detach()
> in the unregistration sequence ...

Sounds like a good idea but maybe as a follow up to net-next? 
The likelihood of that breaking things is low, but non-zero.
patchwork-bot+netdevbpf@kernel.org Dec. 7, 2021, 1 a.m. UTC | #3
Hello:

This patch was applied to netdev/net.git (master)
by Jakub Kicinski <kuba@kernel.org>:

On Fri,  3 Dec 2021 11:13:18 +0100 you wrote:
> There is a short period between a net device starts to be unregistered
> and when it is actually gone. In that time frame ethtool operations
> could still be performed, which might end up in unwanted or undefined
> behaviours[1].
> 
> Do not allow ethtool operations after a net device starts its
> unregistration. This patch targets the netlink part as the ioctl one
> isn't affected: the reference to the net device is taken and the
> operation is executed within an rtnl lock section and the net device
> won't be found after unregister.
> 
> [...]

Here is the summary with links:
  - [net] ethtool: do not perform operations on net devices being unregistered
    https://git.kernel.org/netdev/net/c/dde91ccfa25f

You are awesome, thank you!
Antoine Tenart Dec. 7, 2021, 8:05 a.m. UTC | #4
Quoting Jakub Kicinski (2021-12-06 16:15:20)
> On Mon, 6 Dec 2021 11:46:35 +0200 Julian Wiedmann wrote:
> > 
> > Wondering if other places would also benefit from a netif_device_detach()
> > in the unregistration sequence ...
> 
> Sounds like a good idea but maybe as a follow up to net-next? 
> The likelihood of that breaking things is low, but non-zero.

Might be good to have a look at this yes. I'm wondering, there are
multiple mechanisms to avoid using a net device after unregistration,
including netif_device_detach and unlinking it from the device chain
(unlist_netdevice). Currently it's possible to have a device unlinked
but not detached. Haven't looked at it, but there might be something to
do here.

(One the other way, detaching a device without unlinking it is valid for
PM).

Antoine
Julian Wiedmann Dec. 7, 2021, 9:05 a.m. UTC | #5
On 06.12.21 17:15, Jakub Kicinski wrote:
> On Mon, 6 Dec 2021 11:46:35 +0200 Julian Wiedmann wrote:
>> On 03.12.21 12:13, Antoine Tenart wrote:
>>> There is a short period between a net device starts to be unregistered
>>> and when it is actually gone. In that time frame ethtool operations
>>> could still be performed, which might end up in unwanted or undefined
>>> behaviours[1].
>>>
>>> Do not allow ethtool operations after a net device starts its
>>> unregistration. This patch targets the netlink part as the ioctl one
>>> isn't affected: the reference to the net device is taken and the
>>> operation is executed within an rtnl lock section and the net device
>>> won't be found after unregister.
>>> [...]
>>> +++ b/net/ethtool/netlink.c
>>> @@ -40,7 +40,8 @@ int ethnl_ops_begin(struct net_device *dev)
>>>  	if (dev->dev.parent)
>>>  		pm_runtime_get_sync(dev->dev.parent);
>>>  
>>> -	if (!netif_device_present(dev)) {
>>> +	if (!netif_device_present(dev) ||
>>> +	    dev->reg_state == NETREG_UNREGISTERING) {
>>>  		ret = -ENODEV;
>>>  		goto err;
>>>  	}
>>>   
>>
>> Wondering if other places would also benefit from a netif_device_detach()
>> in the unregistration sequence ...
> 
> Sounds like a good idea but maybe as a follow up to net-next? 
> The likelihood of that breaking things is low, but non-zero.
> 

Oh absolutely, only via net-next.
diff mbox series

Patch

diff --git a/net/ethtool/netlink.c b/net/ethtool/netlink.c
index 38b44c0291b1..96f4180aabd2 100644
--- a/net/ethtool/netlink.c
+++ b/net/ethtool/netlink.c
@@ -40,7 +40,8 @@  int ethnl_ops_begin(struct net_device *dev)
 	if (dev->dev.parent)
 		pm_runtime_get_sync(dev->dev.parent);
 
-	if (!netif_device_present(dev)) {
+	if (!netif_device_present(dev) ||
+	    dev->reg_state == NETREG_UNREGISTERING) {
 		ret = -ENODEV;
 		goto err;
 	}