Message ID | 20230929162121.1822900-1-bigeasy@linutronix.de (mailing list archive) |
---|---|
Headers | show |
Series | net: Use SMP threads for backlog NAPI (or optional). | expand |
On Fri, 29 Sep 2023 18:20:18 +0200 Sebastian Andrzej Siewior wrote: > - Patch #2 has been removed. Removing the warning is still an option. > > - There are two patches in the series: > - Patch #1 always creates backlog threads > - Patch #2 creates the backlog threads if requested at boot time, > mandatory on PREEMPT_RT. > So it is either or and I wanted to show how both look like. > > - The kernel test robot reported a performance regression with > loopback (stress-ng --udp X --udp-ops Y) against the RFC version. > The regression is now avoided by using local-NAPI if backlog > processing is requested on the local CPU. Not what we asked for, and it doesn't apply.
On 2023-10-04 15:46:09 [-0700], Jakub Kicinski wrote: > On Fri, 29 Sep 2023 18:20:18 +0200 Sebastian Andrzej Siewior wrote: > > - Patch #2 has been removed. Removing the warning is still an option. > > > > - There are two patches in the series: > > - Patch #1 always creates backlog threads > > - Patch #2 creates the backlog threads if requested at boot time, > > mandatory on PREEMPT_RT. > > So it is either or and I wanted to show how both look like. > > > > - The kernel test robot reported a performance regression with > > loopback (stress-ng --udp X --udp-ops Y) against the RFC version. > > The regression is now avoided by using local-NAPI if backlog > > processing is requested on the local CPU. > > Not what we asked for, and it doesn't apply. Apologies if I misunderstood. You said to make it optional which I did with the static key in the second patch of this series. The first patch is indeed not what we talked about I just to show what it would look like now that there is no "delay" for backlog-NAPI on the local CPU. If the optional part is okay then I can repost only that patch against current net-next. Sebastian
On Sat, 7 Oct 2023 17:59:57 +0200 Sebastian Andrzej Siewior wrote: > Apologies if I misunderstood. You said to make it optional which I did > with the static key in the second patch of this series. The first patch > is indeed not what we talked about I just to show what it would look > like now that there is no "delay" for backlog-NAPI on the local CPU. > > If the optional part is okay then I can repost only that patch against > current net-next. Do we have reason to believe nobody uses RPS?
Sorry, getting back that late, I was traveling the last two weeks… On 2023-10-09 18:09:37 [-0700], Jakub Kicinski wrote: > On Sat, 7 Oct 2023 17:59:57 +0200 Sebastian Andrzej Siewior wrote: > > Apologies if I misunderstood. You said to make it optional which I did > > with the static key in the second patch of this series. The first patch > > is indeed not what we talked about I just to show what it would look > > like now that there is no "delay" for backlog-NAPI on the local CPU. > > > > If the optional part is okay then I can repost only that patch against > > current net-next. > > Do we have reason to believe nobody uses RPS? Not sure what you relate to. I would assume that RPS is used in general on actual devices and not on loopback where backlog is used. But it is just an assumption. The performance drop, which I observed with RPS and stress-ng --udp, is within the same range with threads and IPIs (based on memory). I can re-run the test and provide actual numbers if you want. Sebastian
On Mon, 16 Oct 2023 11:53:21 +0200 Sebastian Andrzej Siewior wrote: > > Do we have reason to believe nobody uses RPS? > > Not sure what you relate to. I would assume that RPS is used in general > on actual devices and not on loopback where backlog is used. But it is > just an assumption. > The performance drop, which I observed with RPS and stress-ng --udp, is > within the same range with threads and IPIs (based on memory). I can > re-run the test and provide actual numbers if you want. I was asking about RPS because with your current series RPS processing is forced into threads. IDK how well you can simulate the kind of workload which requires RPS. I've seen it used mostly on proxyies and gateways. For proxies Meta's experiments with threaded NAPI show regressions across the board. So "force-threading" RPS will most likely also cause regressions.
On 2023-10-16 07:17:56 [-0700], Jakub Kicinski wrote: > On Mon, 16 Oct 2023 11:53:21 +0200 Sebastian Andrzej Siewior wrote: > > > Do we have reason to believe nobody uses RPS? > > > > Not sure what you relate to. I would assume that RPS is used in general > > on actual devices and not on loopback where backlog is used. But it is > > just an assumption. > > The performance drop, which I observed with RPS and stress-ng --udp, is > > within the same range with threads and IPIs (based on memory). I can > > re-run the test and provide actual numbers if you want. > > I was asking about RPS because with your current series RPS processing > is forced into threads. IDK how well you can simulate the kind of > workload which requires RPS. I've seen it used mostly on proxyies > and gateways. For proxies Meta's experiments with threaded NAPI show > regressions across the board. So "force-threading" RPS will most likely > also cause regressions. Understood. Wandere/ Juri: Do you have any benchmark/ workload where you would see whether RPS with IPI (now) vs RPS (this patch) shows any regression? Sebastian
On 2023-10-16 16:53:39 [+0200], To Jakub Kicinski wrote: > On 2023-10-16 07:17:56 [-0700], Jakub Kicinski wrote: > > On Mon, 16 Oct 2023 11:53:21 +0200 Sebastian Andrzej Siewior wrote: > > > > Do we have reason to believe nobody uses RPS? > > > > > > Not sure what you relate to. I would assume that RPS is used in general > > > on actual devices and not on loopback where backlog is used. But it is > > > just an assumption. > > > The performance drop, which I observed with RPS and stress-ng --udp, is > > > within the same range with threads and IPIs (based on memory). I can > > > re-run the test and provide actual numbers if you want. > > > > I was asking about RPS because with your current series RPS processing > > is forced into threads. IDK how well you can simulate the kind of > > workload which requires RPS. I've seen it used mostly on proxyies > > and gateways. For proxies Meta's experiments with threaded NAPI show > > regressions across the board. So "force-threading" RPS will most likely > > also cause regressions. > > Understood. > > Wandere/ Juri: Do you have any benchmark/ workload where you would see > whether RPS with IPI (now) vs RPS (this patch) shows any regression? So I poked offlist other RH people and I've been told that they hardly ever test RPS since the NICs these days have RSS in hardware. Sebastian
On Tue, Oct 31, 2023 at 7:14 AM Sebastian Andrzej Siewior <bigeasy@linutronix.de> wrote: > > On 2023-10-16 16:53:39 [+0200], To Jakub Kicinski wrote: > > On 2023-10-16 07:17:56 [-0700], Jakub Kicinski wrote: > > > On Mon, 16 Oct 2023 11:53:21 +0200 Sebastian Andrzej Siewior wrote: > > > > > Do we have reason to believe nobody uses RPS? > > > > > > > > Not sure what you relate to. I would assume that RPS is used in general > > > > on actual devices and not on loopback where backlog is used. But it is > > > > just an assumption. > > > > The performance drop, which I observed with RPS and stress-ng --udp, is > > > > within the same range with threads and IPIs (based on memory). I can > > > > re-run the test and provide actual numbers if you want. > > > > > > I was asking about RPS because with your current series RPS processing > > > is forced into threads. IDK how well you can simulate the kind of > > > workload which requires RPS. I've seen it used mostly on proxyies > > > and gateways. For proxies Meta's experiments with threaded NAPI show > > > regressions across the board. So "force-threading" RPS will most likely > > > also cause regressions. > > > > Understood. > > > > Wandere/ Juri: Do you have any benchmark/ workload where you would see > > whether RPS with IPI (now) vs RPS (this patch) shows any regression? > > So I poked offlist other RH people and I've been told that they hardly > ever test RPS since the NICs these days have RSS in hardware. Sorry, Juri is in PTO and I am just back from sick leave and still catching up. I've been contacting some QE people, but so far it is like you said, no stress test for RPS. If I have some news, I let you know. > > Sebastian >