mbox series

[net,v2,0/8] amt: fix validation and synchronization bugs

Message ID 20220717160910.19156-1-ap420073@gmail.com (mailing list archive)
Headers show
Series amt: fix validation and synchronization bugs | expand

Message

Taehee Yoo July 17, 2022, 4:09 p.m. UTC
There are some synchronization issues in the amt module.
Especially, an amt gateway doesn't well synchronize its own variables
and status(amt->status).
It tries to use a workqueue for handles in a single thread.
A global lock is also good, but it would occur complex locking complex.

In this patchset, only the gateway uses workqueue.
The reason why only gateway interface uses workqueue is that gateway
should manage its own states and variables a little bit statefully.
But relay doesn't need to manage tunnels statefully, stateless is okay.
So, relay side message handlers are okay to be called concurrently.
But it doesn't mean that no lock is needed.

Only amt multicast data message type will not be processed by the work
queue because It contains actual multicast data.
So, it should be processed immediately.

When any amt gateway events are triggered(sending discovery message by
delayed_work, sending request message by delayed_work and receiving
messages), it stores event and skb into the event queue(amt->events[16]).
Then, workqueue processes these events one by one.

The first patch is to use the work queue.

The second patch is to remove unnecessary lock due to a previous patch.

The third patch is to use READ_ONCE() in the amt module.
Even if the amt module uses a single thread, some variables (ready4,
ready6, amt->status) can be accessed concurrently.

The fourth patch is to add missing nonce generation logic when it sends a
new request message.

The fifth patch is to drop unexpected advertisement messages.
advertisement message should be received only after the gateway sends
a discovery message first.
So, the gateway should drop advertisement messages if it has never
sent a discovery message and it also should drop duplicate advertisement
messages.
Using nonce is good to distinguish whether a received message is an
expected message or not.

The sixth patch is to drop unexpected query messages.
This is the same behavior as the fourth patch.
Query messages should be received only after the gateway sends a request
message first.
The nonce variable is used to distinguish whether it is a reply to a
previous request message or not.
amt->ready4 and amt->ready6 are used to distinguish duplicate messages.

The seventh patch is to drop unexpected multicast data.
AMT gateway should not receive multicast data message type before
establish between gateway and relay.
In order to drop unexpected multicast data messages, it checks amt->status.

The last patch is to fix a locking problem on the relay side.
amt->nr_tunnels variable is protected by amt->lock.
But amt_request_handler() doesn't protect this variable.

v2:
 - Use local_bh_disable() instead of rcu_read_lock_bh() in
   amt_membership_query_handler.
 - Fix using uninitialized variables.
 - Fix unexpectedly start the event_wq after stopping.
 - Fix possible deadlock in amt_event_work().
 - Add a limit variable in amt_event_work() to prevent infinite working.
 - Rename amt_queue_events() to amt_queue_event().

Taehee Yoo (8):
  amt: use workqueue for gateway side message handling
  amt: remove unnecessary locks
  amt: use READ_ONCE() in amt module
  amt: add missing regeneration nonce logic in request logic
  amt: drop unexpected advertisement message
  amt: drop unexpected query message
  amt: drop unexpected multicast data
  amt: do not use amt->nr_tunnels outside of lock

 drivers/net/amt.c | 239 ++++++++++++++++++++++++++++++++++++----------
 include/net/amt.h |  20 ++++
 2 files changed, 207 insertions(+), 52 deletions(-)

Comments

patchwork-bot+netdevbpf@kernel.org July 19, 2022, 10:50 a.m. UTC | #1
Hello:

This series was applied to netdev/net.git (master)
by Paolo Abeni <pabeni@redhat.com>:

On Sun, 17 Jul 2022 16:09:02 +0000 you wrote:
> There are some synchronization issues in the amt module.
> Especially, an amt gateway doesn't well synchronize its own variables
> and status(amt->status).
> It tries to use a workqueue for handles in a single thread.
> A global lock is also good, but it would occur complex locking complex.
> 
> In this patchset, only the gateway uses workqueue.
> The reason why only gateway interface uses workqueue is that gateway
> should manage its own states and variables a little bit statefully.
> But relay doesn't need to manage tunnels statefully, stateless is okay.
> So, relay side message handlers are okay to be called concurrently.
> But it doesn't mean that no lock is needed.
> 
> [...]

Here is the summary with links:
  - [net,v2,1/8] amt: use workqueue for gateway side message handling
    https://git.kernel.org/netdev/net/c/30e22a6ebca0
  - [net,v2,2/8] amt: remove unnecessary locks
    https://git.kernel.org/netdev/net/c/9c343ea6185f
  - [net,v2,3/8] amt: use READ_ONCE() in amt module
    https://git.kernel.org/netdev/net/c/928f353cb867
  - [net,v2,4/8] amt: add missing regeneration nonce logic in request logic
    https://git.kernel.org/netdev/net/c/627f16931bf3
  - [net,v2,5/8] amt: drop unexpected advertisement message
    https://git.kernel.org/netdev/net/c/40185f359fba
  - [net,v2,6/8] amt: drop unexpected query message
    https://git.kernel.org/netdev/net/c/239d886601e3
  - [net,v2,7/8] amt: drop unexpected multicast data
    https://git.kernel.org/netdev/net/c/e882827d5b89
  - [net,v2,8/8] amt: do not use amt->nr_tunnels outside of lock
    https://git.kernel.org/netdev/net/c/989918482bbc

You are awesome, thank you!