diff mbox series

[net-next] sctp: fix busy polling

Message ID 20231219170017.73902-1-edumazet@google.com (mailing list archive)
State Accepted
Commit a562c0a2d651e040681b0bfce9b4d229ac3b0b8c
Delegated to: Netdev Maintainers
Headers show
Series [net-next] sctp: fix busy polling | expand

Checks

Context Check Description
netdev/series_format success Single patches do not need cover letters
netdev/tree_selection success Clearly marked for net-next
netdev/ynl success Generated files up to date; no warnings/errors; no diff in generated;
netdev/fixes_present success Fixes tag not required for -next series
netdev/header_inline success No static functions without inline keyword in header files
netdev/build_32bit success Errors and warnings before: 1116 this patch: 1116
netdev/cc_maintainers fail 1 blamed authors not CCed: daniel@iogearbox.net; 2 maintainers not CCed: linux-sctp@vger.kernel.org daniel@iogearbox.net
netdev/build_clang fail Errors and warnings before: 12 this patch: 12
netdev/verify_signedoff success Signed-off-by tag matches author and committer
netdev/deprecated_api success None detected
netdev/check_selftest success No net selftest shell script
netdev/verify_fixes success Fixes tag looks correct
netdev/build_allmodconfig_warn success Errors and warnings before: 1143 this patch: 1143
netdev/checkpatch warning WARNING: 'wont' may be misspelled - perhaps 'won't'?
netdev/build_clang_rust success No Rust files in patch. Skipping build
netdev/kdoc success Errors and warnings before: 0 this patch: 0
netdev/source_inline success Was 0 now: 0

Commit Message

Eric Dumazet Dec. 19, 2023, 5 p.m. UTC
Busy polling while holding the socket lock makes litle sense,
because incoming packets wont reach our receive queue.

Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
Reported-by: Jacob Moroni <jmoroni@google.com>
Signed-off-by: Eric Dumazet <edumazet@google.com>
Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
Cc: Xin Long <lucien.xin@gmail.com>
---
 net/sctp/socket.c | 10 ++++------
 1 file changed, 4 insertions(+), 6 deletions(-)

Comments

Xin Long Dec. 22, 2023, 4:08 p.m. UTC | #1
On Tue, Dec 19, 2023 at 12:00 PM Eric Dumazet <edumazet@google.com> wrote:
>
> Busy polling while holding the socket lock makes litle sense,
> because incoming packets wont reach our receive queue.
>
> Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> Reported-by: Jacob Moroni <jmoroni@google.com>
> Signed-off-by: Eric Dumazet <edumazet@google.com>
> Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> Cc: Xin Long <lucien.xin@gmail.com>
> ---
>  net/sctp/socket.c | 10 ++++------
>  1 file changed, 4 insertions(+), 6 deletions(-)
>
> diff --git a/net/sctp/socket.c b/net/sctp/socket.c
> index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
> --- a/net/sctp/socket.c
> +++ b/net/sctp/socket.c
> @@ -2102,6 +2102,10 @@ static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
>         if (unlikely(flags & MSG_ERRQUEUE))
>                 return inet_recv_error(sk, msg, len, addr_len);
>
> +       if (sk_can_busy_loop(sk) &&
> +           skb_queue_empty_lockless(&sk->sk_receive_queue))
> +               sk_busy_loop(sk, flags & MSG_DONTWAIT);
> +
Here is no any sk_state check, if the SCTP socket(TCP type) has been
already closed by peer, will sctp_recvmsg() block here?

Maybe here it needs a `!(sk->sk_shutdown & RCV_SHUTDOWN)` check,
which is set when it's closed by the peer.

Thanks

>         lock_sock(sk);
>
>         if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
> @@ -9046,12 +9050,6 @@ struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
>                 if (sk->sk_shutdown & RCV_SHUTDOWN)
>                         break;
>
> -               if (sk_can_busy_loop(sk)) {
> -                       sk_busy_loop(sk, flags & MSG_DONTWAIT);
> -
> -                       if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
> -                               continue;
> -               }
>
>                 /* User doesn't want to wait.  */
>                 error = -EAGAIN;
> --
> 2.43.0.472.g3155946c3a-goog
>
Eric Dumazet Dec. 22, 2023, 5:05 p.m. UTC | #2
On Fri, Dec 22, 2023 at 5:08 PM Xin Long <lucien.xin@gmail.com> wrote:
>
> On Tue, Dec 19, 2023 at 12:00 PM Eric Dumazet <edumazet@google.com> wrote:
> >
> > Busy polling while holding the socket lock makes litle sense,
> > because incoming packets wont reach our receive queue.
> >
> > Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> > Reported-by: Jacob Moroni <jmoroni@google.com>
> > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> > Cc: Xin Long <lucien.xin@gmail.com>
> > ---
> >  net/sctp/socket.c | 10 ++++------
> >  1 file changed, 4 insertions(+), 6 deletions(-)
> >
> > diff --git a/net/sctp/socket.c b/net/sctp/socket.c
> > index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
> > --- a/net/sctp/socket.c
> > +++ b/net/sctp/socket.c
> > @@ -2102,6 +2102,10 @@ static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
> >         if (unlikely(flags & MSG_ERRQUEUE))
> >                 return inet_recv_error(sk, msg, len, addr_len);
> >
> > +       if (sk_can_busy_loop(sk) &&
> > +           skb_queue_empty_lockless(&sk->sk_receive_queue))
> > +               sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > +
> Here is no any sk_state check, if the SCTP socket(TCP type) has been
> already closed by peer, will sctp_recvmsg() block here?

Busy polling is only polling the NIC queue, hoping to feed this socket
for incoming packets.

Using more than a lockless read of sk->sk_receive_queue is not really necessary,
and racy anyway.

Eliezer Tamir added a check against sk_state for no good reason in
TCP, my plan is to remove it.

There are other states where it still makes sense to allow busy polling.


>
> Maybe here it needs a `!(sk->sk_shutdown & RCV_SHUTDOWN)` check,
> which is set when it's closed by the peer.

See above. Keep this as simple as possible...


>
> Thanks
>
> >         lock_sock(sk);
> >
> >         if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
> > @@ -9046,12 +9050,6 @@ struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
> >                 if (sk->sk_shutdown & RCV_SHUTDOWN)
> >                         break;
> >
> > -               if (sk_can_busy_loop(sk)) {
> > -                       sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > -
> > -                       if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
> > -                               continue;
> > -               }
> >
> >                 /* User doesn't want to wait.  */
> >                 error = -EAGAIN;
> > --
> > 2.43.0.472.g3155946c3a-goog
> >
Xin Long Dec. 22, 2023, 6:34 p.m. UTC | #3
On Fri, Dec 22, 2023 at 12:05 PM Eric Dumazet <edumazet@google.com> wrote:
>
> On Fri, Dec 22, 2023 at 5:08 PM Xin Long <lucien.xin@gmail.com> wrote:
> >
> > On Tue, Dec 19, 2023 at 12:00 PM Eric Dumazet <edumazet@google.com> wrote:
> > >
> > > Busy polling while holding the socket lock makes litle sense,
> > > because incoming packets wont reach our receive queue.
> > >
> > > Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> > > Reported-by: Jacob Moroni <jmoroni@google.com>
> > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> > > Cc: Xin Long <lucien.xin@gmail.com>
> > > ---
> > >  net/sctp/socket.c | 10 ++++------
> > >  1 file changed, 4 insertions(+), 6 deletions(-)
> > >
> > > diff --git a/net/sctp/socket.c b/net/sctp/socket.c
> > > index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
> > > --- a/net/sctp/socket.c
> > > +++ b/net/sctp/socket.c
> > > @@ -2102,6 +2102,10 @@ static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
> > >         if (unlikely(flags & MSG_ERRQUEUE))
> > >                 return inet_recv_error(sk, msg, len, addr_len);
> > >
> > > +       if (sk_can_busy_loop(sk) &&
> > > +           skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > +               sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > +
> > Here is no any sk_state check, if the SCTP socket(TCP type) has been
> > already closed by peer, will sctp_recvmsg() block here?
>
> Busy polling is only polling the NIC queue, hoping to feed this socket
> for incoming packets.
OK, will it block if there's no incoming packets on the NIC queue?

If yes, when sysctl net.core.busy_read=1, my concern is:

     client                server
     -------------------------------
                           listen()
     connect()
                           accept()
     close()
                           recvmsg() <----

recvmsg() is supposed to return right away as the connection is
already close(). With this patch, will recvmsg() be able to do
that if no more incoming packets in the NIC after close()?

Thanks.

>
> Using more than a lockless read of sk->sk_receive_queue is not really necessary,
> and racy anyway.
>
> Eliezer Tamir added a check against sk_state for no good reason in
> TCP, my plan is to remove it.
>
> There are other states where it still makes sense to allow busy polling.
>
>
> >
> > Maybe here it needs a `!(sk->sk_shutdown & RCV_SHUTDOWN)` check,
> > which is set when it's closed by the peer.
>
> See above. Keep this as simple as possible...
>
>
> >
> > Thanks
> >
> > >         lock_sock(sk);
> > >
> > >         if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
> > > @@ -9046,12 +9050,6 @@ struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
> > >                 if (sk->sk_shutdown & RCV_SHUTDOWN)
> > >                         break;
> > >
> > > -               if (sk_can_busy_loop(sk)) {
> > > -                       sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > -
> > > -                       if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > -                               continue;
> > > -               }
> > >
> > >                 /* User doesn't want to wait.  */
> > >                 error = -EAGAIN;
> > > --
> > > 2.43.0.472.g3155946c3a-goog
> > >
Eric Dumazet Jan. 3, 2024, 10:51 a.m. UTC | #4
On Fri, Dec 22, 2023 at 7:34 PM Xin Long <lucien.xin@gmail.com> wrote:
>
> On Fri, Dec 22, 2023 at 12:05 PM Eric Dumazet <edumazet@google.com> wrote:
> >
> > On Fri, Dec 22, 2023 at 5:08 PM Xin Long <lucien.xin@gmail.com> wrote:
> > >
> > > On Tue, Dec 19, 2023 at 12:00 PM Eric Dumazet <edumazet@google.com> wrote:
> > > >
> > > > Busy polling while holding the socket lock makes litle sense,
> > > > because incoming packets wont reach our receive queue.
> > > >
> > > > Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> > > > Reported-by: Jacob Moroni <jmoroni@google.com>
> > > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > > Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> > > > Cc: Xin Long <lucien.xin@gmail.com>
> > > > ---
> > > >  net/sctp/socket.c | 10 ++++------
> > > >  1 file changed, 4 insertions(+), 6 deletions(-)
> > > >
> > > > diff --git a/net/sctp/socket.c b/net/sctp/socket.c
> > > > index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
> > > > --- a/net/sctp/socket.c
> > > > +++ b/net/sctp/socket.c
> > > > @@ -2102,6 +2102,10 @@ static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
> > > >         if (unlikely(flags & MSG_ERRQUEUE))
> > > >                 return inet_recv_error(sk, msg, len, addr_len);
> > > >
> > > > +       if (sk_can_busy_loop(sk) &&
> > > > +           skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > > +               sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > > +
> > > Here is no any sk_state check, if the SCTP socket(TCP type) has been
> > > already closed by peer, will sctp_recvmsg() block here?
> >
> > Busy polling is only polling the NIC queue, hoping to feed this socket
> > for incoming packets.
> OK, will it block if there's no incoming packets on the NIC queue?
>
> If yes, when sysctl net.core.busy_read=1, my concern is:
>
>      client                server
>      -------------------------------
>                            listen()
>      connect()
>                            accept()
>      close()
>                            recvmsg() <----
>
> recvmsg() is supposed to return right away as the connection is
> already close(). With this patch, will recvmsg() be able to do
> that if no more incoming packets in the NIC after close()?


Answer is yes for a variety of reasons :

net.core.busy_read=1 means :

Busy poll will happen for
1) at most one usec, and
2) as long as there is no packet in sk->sk_receive_queue (see
sk_busy_loop_end())

But busy poll is only started on sockets that had established packets.

A listener will not engage this because sk->sk_napi_id does not
contain a valid NAPI ID.



>
> Thanks.
>
> >
> > Using more than a lockless read of sk->sk_receive_queue is not really necessary,
> > and racy anyway.
> >
> > Eliezer Tamir added a check against sk_state for no good reason in
> > TCP, my plan is to remove it.
> >
> > There are other states where it still makes sense to allow busy polling.
> >
> >
> > >
> > > Maybe here it needs a `!(sk->sk_shutdown & RCV_SHUTDOWN)` check,
> > > which is set when it's closed by the peer.
> >
> > See above. Keep this as simple as possible...
> >
> >
> > >
> > > Thanks
> > >
> > > >         lock_sock(sk);
> > > >
> > > >         if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
> > > > @@ -9046,12 +9050,6 @@ struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
> > > >                 if (sk->sk_shutdown & RCV_SHUTDOWN)
> > > >                         break;
> > > >
> > > > -               if (sk_can_busy_loop(sk)) {
> > > > -                       sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > > -
> > > > -                       if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > > -                               continue;
> > > > -               }
> > > >
> > > >                 /* User doesn't want to wait.  */
> > > >                 error = -EAGAIN;
> > > > --
> > > > 2.43.0.472.g3155946c3a-goog
> > > >
Xin Long Jan. 3, 2024, 3:14 p.m. UTC | #5
On Wed, Jan 3, 2024 at 5:51 AM Eric Dumazet <edumazet@google.com> wrote:
>
> On Fri, Dec 22, 2023 at 7:34 PM Xin Long <lucien.xin@gmail.com> wrote:
> >
> > On Fri, Dec 22, 2023 at 12:05 PM Eric Dumazet <edumazet@google.com> wrote:
> > >
> > > On Fri, Dec 22, 2023 at 5:08 PM Xin Long <lucien.xin@gmail.com> wrote:
> > > >
> > > > On Tue, Dec 19, 2023 at 12:00 PM Eric Dumazet <edumazet@google.com> wrote:
> > > > >
> > > > > Busy polling while holding the socket lock makes litle sense,
> > > > > because incoming packets wont reach our receive queue.
> > > > >
> > > > > Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> > > > > Reported-by: Jacob Moroni <jmoroni@google.com>
> > > > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > > > Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> > > > > Cc: Xin Long <lucien.xin@gmail.com>
> > > > > ---
> > > > >  net/sctp/socket.c | 10 ++++------
> > > > >  1 file changed, 4 insertions(+), 6 deletions(-)
> > > > >
> > > > > diff --git a/net/sctp/socket.c b/net/sctp/socket.c
> > > > > index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
> > > > > --- a/net/sctp/socket.c
> > > > > +++ b/net/sctp/socket.c
> > > > > @@ -2102,6 +2102,10 @@ static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
> > > > >         if (unlikely(flags & MSG_ERRQUEUE))
> > > > >                 return inet_recv_error(sk, msg, len, addr_len);
> > > > >
> > > > > +       if (sk_can_busy_loop(sk) &&
> > > > > +           skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > > > +               sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > > > +
> > > > Here is no any sk_state check, if the SCTP socket(TCP type) has been
> > > > already closed by peer, will sctp_recvmsg() block here?
> > >
> > > Busy polling is only polling the NIC queue, hoping to feed this socket
> > > for incoming packets.
> > OK, will it block if there's no incoming packets on the NIC queue?
> >
> > If yes, when sysctl net.core.busy_read=1, my concern is:
> >
> >      client                server
> >      -------------------------------
> >                            listen()
> >      connect()
> >                            accept()
> >      close()
> >                            recvmsg() <----
> >
> > recvmsg() is supposed to return right away as the connection is
> > already close(). With this patch, will recvmsg() be able to do
> > that if no more incoming packets in the NIC after close()?
>
>
> Answer is yes for a variety of reasons :
>
> net.core.busy_read=1 means :
>
> Busy poll will happen for
> 1) at most one usec, and
I see, never used busy polling, but what if the value is set to a large value,
like minutes, I might be just overthinking and no one will do this?

> 2) as long as there is no packet in sk->sk_receive_queue (see
> sk_busy_loop_end())
It's likely after being closed by peer, no packet at sk_receive_queue.

>
> But busy poll is only started on sockets that had established packets.
I think it won't be told to break when the socket is closed by peer.

>
> A listener will not engage this because sk->sk_napi_id does not
> contain a valid NAPI ID.
>
>
>
> >
> > Thanks.
> >
> > >
> > > Using more than a lockless read of sk->sk_receive_queue is not really necessary,
> > > and racy anyway.
> > >
> > > Eliezer Tamir added a check against sk_state for no good reason in
> > > TCP, my plan is to remove it.
> > >
> > > There are other states where it still makes sense to allow busy polling.
> > >
> > >
> > > >
> > > > Maybe here it needs a `!(sk->sk_shutdown & RCV_SHUTDOWN)` check,
> > > > which is set when it's closed by the peer.
> > >
> > > See above. Keep this as simple as possible...
> > >
> > >
> > > >
> > > > Thanks
> > > >
> > > > >         lock_sock(sk);
> > > > >
> > > > >         if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
> > > > > @@ -9046,12 +9050,6 @@ struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
> > > > >                 if (sk->sk_shutdown & RCV_SHUTDOWN)
> > > > >                         break;
> > > > >
> > > > > -               if (sk_can_busy_loop(sk)) {
> > > > > -                       sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > > > -
> > > > > -                       if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > > > -                               continue;
> > > > > -               }
> > > > >
> > > > >                 /* User doesn't want to wait.  */
> > > > >                 error = -EAGAIN;
> > > > > --
> > > > > 2.43.0.472.g3155946c3a-goog
> > > > >
Eric Dumazet Jan. 3, 2024, 4:06 p.m. UTC | #6
On Wed, Jan 3, 2024 at 4:14 PM Xin Long <lucien.xin@gmail.com> wrote:
>
> On Wed, Jan 3, 2024 at 5:51 AM Eric Dumazet <edumazet@google.com> wrote:
> >
> > On Fri, Dec 22, 2023 at 7:34 PM Xin Long <lucien.xin@gmail.com> wrote:
> > >
> > > On Fri, Dec 22, 2023 at 12:05 PM Eric Dumazet <edumazet@google.com> wrote:
> > > >
> > > > On Fri, Dec 22, 2023 at 5:08 PM Xin Long <lucien.xin@gmail.com> wrote:
> > > > >
> > > > > On Tue, Dec 19, 2023 at 12:00 PM Eric Dumazet <edumazet@google.com> wrote:
> > > > > >
> > > > > > Busy polling while holding the socket lock makes litle sense,
> > > > > > because incoming packets wont reach our receive queue.
> > > > > >
> > > > > > Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> > > > > > Reported-by: Jacob Moroni <jmoroni@google.com>
> > > > > > Signed-off-by: Eric Dumazet <edumazet@google.com>
> > > > > > Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> > > > > > Cc: Xin Long <lucien.xin@gmail.com>
> > > > > > ---
> > > > > >  net/sctp/socket.c | 10 ++++------
> > > > > >  1 file changed, 4 insertions(+), 6 deletions(-)
> > > > > >
> > > > > > diff --git a/net/sctp/socket.c b/net/sctp/socket.c
> > > > > > index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
> > > > > > --- a/net/sctp/socket.c
> > > > > > +++ b/net/sctp/socket.c
> > > > > > @@ -2102,6 +2102,10 @@ static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
> > > > > >         if (unlikely(flags & MSG_ERRQUEUE))
> > > > > >                 return inet_recv_error(sk, msg, len, addr_len);
> > > > > >
> > > > > > +       if (sk_can_busy_loop(sk) &&
> > > > > > +           skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > > > > +               sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > > > > +
> > > > > Here is no any sk_state check, if the SCTP socket(TCP type) has been
> > > > > already closed by peer, will sctp_recvmsg() block here?
> > > >
> > > > Busy polling is only polling the NIC queue, hoping to feed this socket
> > > > for incoming packets.
> > > OK, will it block if there's no incoming packets on the NIC queue?
> > >
> > > If yes, when sysctl net.core.busy_read=1, my concern is:
> > >
> > >      client                server
> > >      -------------------------------
> > >                            listen()
> > >      connect()
> > >                            accept()
> > >      close()
> > >                            recvmsg() <----
> > >
> > > recvmsg() is supposed to return right away as the connection is
> > > already close(). With this patch, will recvmsg() be able to do
> > > that if no more incoming packets in the NIC after close()?
> >
> >
> > Answer is yes for a variety of reasons :
> >
> > net.core.busy_read=1 means :
> >
> > Busy poll will happen for
> > 1) at most one usec, and
> I see, never used busy polling, but what if the value is set to a large value,
> like minutes, I might be just overthinking and no one will do this?
>

No problem, you can look at
https://netdevconf.info/2.1/papers/BusyPollingNextGen.pdf for
a short introduction.

<quote>
Suggested settings are in the 50 to 100 us range
</quote>

> > 2) as long as there is no packet in sk->sk_receive_queue (see
> > sk_busy_loop_end())
> It's likely after being closed by peer, no packet at sk_receive_queue.
>
> >
> > But busy poll is only started on sockets that had established packets.
> I think it won't be told to break when the socket is closed by peer.

This is fine really.

sk_busy_loop_end() works fine as is for UDP/TCP sockets, and it does
not look at sk_state.

Keep in mind polling applications are using recvmsg() 20,000 times per second,
there is no point trying to optimize the last call.

>
> >
> > A listener will not engage this because sk->sk_napi_id does not
> > contain a valid NAPI ID.
> >
> >
> >
> > >
> > > Thanks.
> > >
> > > >
> > > > Using more than a lockless read of sk->sk_receive_queue is not really necessary,
> > > > and racy anyway.
> > > >
> > > > Eliezer Tamir added a check against sk_state for no good reason in
> > > > TCP, my plan is to remove it.
> > > >
> > > > There are other states where it still makes sense to allow busy polling.
> > > >
> > > >
> > > > >
> > > > > Maybe here it needs a `!(sk->sk_shutdown & RCV_SHUTDOWN)` check,
> > > > > which is set when it's closed by the peer.
> > > >
> > > > See above. Keep this as simple as possible...
> > > >
> > > >
> > > > >
> > > > > Thanks
> > > > >
> > > > > >         lock_sock(sk);
> > > > > >
> > > > > >         if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
> > > > > > @@ -9046,12 +9050,6 @@ struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
> > > > > >                 if (sk->sk_shutdown & RCV_SHUTDOWN)
> > > > > >                         break;
> > > > > >
> > > > > > -               if (sk_can_busy_loop(sk)) {
> > > > > > -                       sk_busy_loop(sk, flags & MSG_DONTWAIT);
> > > > > > -
> > > > > > -                       if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
> > > > > > -                               continue;
> > > > > > -               }
> > > > > >
> > > > > >                 /* User doesn't want to wait.  */
> > > > > >                 error = -EAGAIN;
> > > > > > --
> > > > > > 2.43.0.472.g3155946c3a-goog
> > > > > >
patchwork-bot+netdevbpf@kernel.org Jan. 4, 2024, 10:30 a.m. UTC | #7
Hello:

This patch was applied to netdev/net-next.git (main)
by David S. Miller <davem@davemloft.net>:

On Tue, 19 Dec 2023 17:00:17 +0000 you wrote:
> Busy polling while holding the socket lock makes litle sense,
> because incoming packets wont reach our receive queue.
> 
> Fixes: 8465a5fcd1ce ("sctp: add support for busy polling to sctp protocol")
> Reported-by: Jacob Moroni <jmoroni@google.com>
> Signed-off-by: Eric Dumazet <edumazet@google.com>
> Cc: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
> Cc: Xin Long <lucien.xin@gmail.com>
> 
> [...]

Here is the summary with links:
  - [net-next] sctp: fix busy polling
    https://git.kernel.org/netdev/net-next/c/a562c0a2d651

You are awesome, thank you!
diff mbox series

Patch

diff --git a/net/sctp/socket.c b/net/sctp/socket.c
index 5fb02bbb4b349ef9ab9c2790cccb30fb4c4e897c..6b9fcdb0952a0fe599ae5d1d1cc6fa9557a3a3bc 100644
--- a/net/sctp/socket.c
+++ b/net/sctp/socket.c
@@ -2102,6 +2102,10 @@  static int sctp_recvmsg(struct sock *sk, struct msghdr *msg, size_t len,
 	if (unlikely(flags & MSG_ERRQUEUE))
 		return inet_recv_error(sk, msg, len, addr_len);
 
+	if (sk_can_busy_loop(sk) &&
+	    skb_queue_empty_lockless(&sk->sk_receive_queue))
+		sk_busy_loop(sk, flags & MSG_DONTWAIT);
+
 	lock_sock(sk);
 
 	if (sctp_style(sk, TCP) && !sctp_sstate(sk, ESTABLISHED) &&
@@ -9046,12 +9050,6 @@  struct sk_buff *sctp_skb_recv_datagram(struct sock *sk, int flags, int *err)
 		if (sk->sk_shutdown & RCV_SHUTDOWN)
 			break;
 
-		if (sk_can_busy_loop(sk)) {
-			sk_busy_loop(sk, flags & MSG_DONTWAIT);
-
-			if (!skb_queue_empty_lockless(&sk->sk_receive_queue))
-				continue;
-		}
 
 		/* User doesn't want to wait.  */
 		error = -EAGAIN;