diff mbox series

[net,v3] tcp_cubic: fix incorrect HyStart round start detection

Message ID 20250117213751.2404-1-ma.arghavani@yahoo.com (mailing list archive)
State New
Delegated to: Netdev Maintainers
Headers show
Series [net,v3] tcp_cubic: fix incorrect HyStart round start detection | expand

Checks

Context Check Description
netdev/series_format success Single patches do not need cover letters
netdev/tree_selection success Clearly marked for net
netdev/ynl success Generated files up to date; no warnings/errors; no diff in generated;
netdev/fixes_present success Fixes tag present in non-next series
netdev/header_inline success No static functions without inline keyword in header files
netdev/build_32bit success Errors and warnings before: 0 this patch: 0
netdev/build_tools success No tools touched, skip
netdev/cc_maintainers fail 1 blamed authors not CCed: kuba@kernel.org; 4 maintainers not CCed: pabeni@redhat.com dsahern@kernel.org horms@kernel.org kuba@kernel.org
netdev/build_clang success Errors and warnings before: 1 this patch: 1
netdev/verify_signedoff success Signed-off-by tag matches author and committer
netdev/deprecated_api success None detected
netdev/check_selftest success No net selftest shell script
netdev/verify_fixes success Fixes tag looks correct
netdev/build_allmodconfig_warn success Errors and warnings before: 0 this patch: 0
netdev/checkpatch success total: 0 errors, 0 warnings, 0 checks, 20 lines checked
netdev/build_clang_rust success No Rust files in patch. Skipping build
netdev/kdoc success Errors and warnings before: 0 this patch: 0
netdev/source_inline success Was 0 now: 0
netdev/contest pending net-next-2025-01-18--00-00 (tests: 0)

Commit Message

Mahdi Arghavani Jan. 17, 2025, 9:37 p.m. UTC
I noticed that HyStart incorrectly marks the start of rounds,
leading to inaccurate measurements of ACK train lengths and
resetting the `ca->sample_cnt` variable. This inaccuracy can impact
HyStart's functionality in terminating exponential cwnd growth during
Slow-Start, potentially degrading TCP performance.

The issue arises because the changes introduced in commit 4e1fddc98d25
("tcp_cubic: fix spurious Hystart ACK train detections for not-cwnd-limited flows")
moved the caller of the `bictcp_hystart_reset` function inside the `hystart_update` function.
This modification added an additional condition for triggering the caller,
requiring that (tcp_snd_cwnd(tp) >= hystart_low_window) must also
be satisfied before invoking `bictcp_hystart_reset`.

This fix ensures that `bictcp_hystart_reset` is correctly called
at the start of a new round, regardless of the congestion window size.
This is achieved by moving the condition
(tcp_snd_cwnd(tp) >= hystart_low_window)
from before calling `bictcp_hystart_reset` to after it.

I tested with a client and a server connected through two Linux software routers.
In this setup, the minimum RTT was 150 ms, the bottleneck bandwidth was 50 Mbps,
and the bottleneck buffer size was 1 BDP, calculated as (50M / 1514 / 8) * 0.150 = 619 packets.
I conducted the test twice, transferring data from the server to the client for 1.5 seconds.
Before the patch was applied, HYSTART-DELAY stopped the exponential growth of cwnd when
cwnd = 516, and the bottleneck link was not yet saturated (516 < 619).
After the patch was applied, HYSTART-ACK-TRAIN stopped the exponential growth of cwnd when
cwnd = 632, and the bottleneck link was saturated (632 > 619).
In this test, applying the patch resulted in 300 KB more data delivered.

Fixes: 4e1fddc98d25 ("tcp_cubic: fix spurious Hystart ACK train detections for not-cwnd-limited flows")
Signed-off-by: Mahdi Arghavani <ma.arghavani@yahoo.com>
Reviewed-by: Jason Xing <kerneljasonxing@gmail.com>
Cc: Neal Cardwell <ncardwell@google.com>
Cc: Eric Dumazet <edumazet@google.com>
Cc: Haibo Zhang <haibo.zhang@otago.ac.nz>
Cc: David Eyers <david.eyers@otago.ac.nz>
Cc: Abbas Arghavani <abbas.arghavani@mdu.se>
---
 net/ipv4/tcp_cubic.c | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

Comments

Neal Cardwell Jan. 17, 2025, 11:11 p.m. UTC | #1
On Fri, Jan 17, 2025 at 5:00 PM Eric Dumazet <edumazet@google.com> wrote:
>
> On Fri, Jan 17, 2025 at 10:38 PM Mahdi Arghavani <ma.arghavani@yahoo.com> wrote:
> >
> > I noticed that HyStart incorrectly marks the start of rounds,
> > leading to inaccurate measurements of ACK train lengths and
> > resetting the `ca->sample_cnt` variable. This inaccuracy can impact
> > HyStart's functionality in terminating exponential cwnd growth during
> > Slow-Start, potentially degrading TCP performance.
> >
> > The issue arises because the changes introduced in commit 4e1fddc98d25
> > ("tcp_cubic: fix spurious Hystart ACK train detections for not-cwnd-limited flows")
> > moved the caller of the `bictcp_hystart_reset` function inside the `hystart_update` function.
> > This modification added an additional condition for triggering the caller,
> > requiring that (tcp_snd_cwnd(tp) >= hystart_low_window) must also
> > be satisfied before invoking `bictcp_hystart_reset`.
> >
> > This fix ensures that `bictcp_hystart_reset` is correctly called
> > at the start of a new round, regardless of the congestion window size.
> > This is achieved by moving the condition
> > (tcp_snd_cwnd(tp) >= hystart_low_window)
> > from before calling `bictcp_hystart_reset` to after it.
> >
> > I tested with a client and a server connected through two Linux software routers.
> > In this setup, the minimum RTT was 150 ms, the bottleneck bandwidth was 50 Mbps,
> > and the bottleneck buffer size was 1 BDP, calculated as (50M / 1514 / 8) * 0.150 = 619 packets.
> > I conducted the test twice, transferring data from the server to the client for 1.5 seconds.
> > Before the patch was applied, HYSTART-DELAY stopped the exponential growth of cwnd when
> > cwnd = 516, and the bottleneck link was not yet saturated (516 < 619).
> > After the patch was applied, HYSTART-ACK-TRAIN stopped the exponential growth of cwnd when
> > cwnd = 632, and the bottleneck link was saturated (632 > 619).
> > In this test, applying the patch resulted in 300 KB more data delivered.
> >
> > Fixes: 4e1fddc98d25 ("tcp_cubic: fix spurious Hystart ACK train detections for not-cwnd-limited flows")
> > Signed-off-by: Mahdi Arghavani <ma.arghavani@yahoo.com>
> > Reviewed-by: Jason Xing <kerneljasonxing@gmail.com>
> > Cc: Neal Cardwell <ncardwell@google.com>
> > Cc: Eric Dumazet <edumazet@google.com>
> > Cc: Haibo Zhang <haibo.zhang@otago.ac.nz>
> > Cc: David Eyers <david.eyers@otago.ac.nz>
> > Cc: Abbas Arghavani <abbas.arghavani@mdu.se>
> > ---
>
> SGTM thanks.
>
> Reviewed-by: Eric Dumazet <edumazet@google.com>

LGTM as well. Thanks for the fix!

Reviewed-by: Neal Cardwell <ncardwell@google.com>
Tested-by: Neal Cardwell <ncardwell@google.com>

neal
diff mbox series

Patch

diff --git a/net/ipv4/tcp_cubic.c b/net/ipv4/tcp_cubic.c
index 5dbed91c6178..76c23675ae50 100644
--- a/net/ipv4/tcp_cubic.c
+++ b/net/ipv4/tcp_cubic.c
@@ -392,6 +392,10 @@  static void hystart_update(struct sock *sk, u32 delay)
 	if (after(tp->snd_una, ca->end_seq))
 		bictcp_hystart_reset(sk);
 
+	/* hystart triggers when cwnd is larger than some threshold */
+	if (tcp_snd_cwnd(tp) < hystart_low_window)
+		return;
+
 	if (hystart_detect & HYSTART_ACK_TRAIN) {
 		u32 now = bictcp_clock_us(sk);
 
@@ -467,9 +471,7 @@  __bpf_kfunc static void cubictcp_acked(struct sock *sk, const struct ack_sample
 	if (ca->delay_min == 0 || ca->delay_min > delay)
 		ca->delay_min = delay;
 
-	/* hystart triggers when cwnd is larger than some threshold */
-	if (!ca->found && tcp_in_slow_start(tp) && hystart &&
-	    tcp_snd_cwnd(tp) >= hystart_low_window)
+	if (!ca->found && tcp_in_slow_start(tp) && hystart)
 		hystart_update(sk, delay);
 }