diff mbox series

[1/1] RDMA/irdma: Make the source udp port vary

Message ID 20211214054227.1071338-1-yanjun.zhu@linux.dev (mailing list archive)
State Superseded
Headers show
Series [1/1] RDMA/irdma: Make the source udp port vary | expand

Commit Message

Zhu Yanjun Dec. 14, 2021, 5:42 a.m. UTC
From: Zhu Yanjun <yanjun.zhu@linux.dev>

Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
get the source udp port number for a QP based on the local QPN. This
provides a better spread of traffic across NIC RX queues.  The method in
the commit d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
scaling") is stable. So it is also adopted in this commit.

Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
---
 drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

Comments

Tom Talpey Dec. 14, 2021, 5:27 p.m. UTC | #1
On 12/14/2021 12:42 AM, yanjun.zhu@linux.dev wrote:
> From: Zhu Yanjun <yanjun.zhu@linux.dev>
> 
> Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
> get the source udp port number for a QP based on the local QPN. This
> provides a better spread of traffic across NIC RX queues.  The method in
> the commit d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
> scaling") is stable. So it is also adopted in this commit.
> 
> Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
> ---
>   drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
>   1 file changed, 6 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
> index 102dc9342f2a..2697b40a539e 100644
> --- a/drivers/infiniband/hw/irdma/verbs.c
> +++ b/drivers/infiniband/hw/irdma/verbs.c
> @@ -690,6 +690,11 @@ static int irdma_cqp_create_qp_cmd(struct irdma_qp *iwqp)
>   	return status ? -ENOMEM : 0;
>   }
>   
> +static inline u16 irdma_get_src_port(struct irdma_qp *iwqp)
> +{
> +	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff);
> +}

How do you ensure the resulting port number is not already in use?

Tom.

> +
>   static void irdma_roce_fill_and_set_qpctx_info(struct irdma_qp *iwqp,
>   					       struct irdma_qp_host_ctx_info *ctx_info)
>   {
> @@ -703,7 +708,7 @@ static void irdma_roce_fill_and_set_qpctx_info(struct irdma_qp *iwqp,
>   	udp_info->cwnd = iwdev->roce_cwnd;
>   	udp_info->rexmit_thresh = 2;
>   	udp_info->rnr_nak_thresh = 2;
> -	udp_info->src_port = 0xc000;
> +	udp_info->src_port = irdma_get_src_port(iwqp);
>   	udp_info->dst_port = ROCE_V2_UDP_DPORT;
>   	roce_info = &iwqp->roce_info;
>   	ether_addr_copy(roce_info->mac_addr, iwdev->netdev->dev_addr);
Jason Gunthorpe Dec. 14, 2021, 5:29 p.m. UTC | #2
On Tue, Dec 14, 2021 at 12:27:24PM -0500, Tom Talpey wrote:
> On 12/14/2021 12:42 AM, yanjun.zhu@linux.dev wrote:
> > From: Zhu Yanjun <yanjun.zhu@linux.dev>
> > 
> > Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
> > get the source udp port number for a QP based on the local QPN. This
> > provides a better spread of traffic across NIC RX queues.  The method in
> > the commit d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
> > scaling") is stable. So it is also adopted in this commit.
> > 
> > Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
> >   drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
> >   1 file changed, 6 insertions(+), 1 deletion(-)
> > 
> > diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
> > index 102dc9342f2a..2697b40a539e 100644
> > +++ b/drivers/infiniband/hw/irdma/verbs.c
> > @@ -690,6 +690,11 @@ static int irdma_cqp_create_qp_cmd(struct irdma_qp *iwqp)
> >   	return status ? -ENOMEM : 0;
> >   }
> > +static inline u16 irdma_get_src_port(struct irdma_qp *iwqp)
> > +{
> > +	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff);
> > +}
> 
> How do you ensure the resulting port number is not already in use?

It doesn't matter, it is never used by anything, the receiver captures
all data with the roce dport and ignores the sport

Jason
Tom Talpey Dec. 14, 2021, 6:09 p.m. UTC | #3
On 12/14/2021 12:29 PM, Jason Gunthorpe wrote:
> On Tue, Dec 14, 2021 at 12:27:24PM -0500, Tom Talpey wrote:
>> On 12/14/2021 12:42 AM, yanjun.zhu@linux.dev wrote:
>>> From: Zhu Yanjun <yanjun.zhu@linux.dev>
>>>
>>> Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
>>> get the source udp port number for a QP based on the local QPN. This
>>> provides a better spread of traffic across NIC RX queues.  The method in
>>> the commit d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
>>> scaling") is stable. So it is also adopted in this commit.
>>>
>>> Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
>>>    drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
>>>    1 file changed, 6 insertions(+), 1 deletion(-)
>>>
>>> diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
>>> index 102dc9342f2a..2697b40a539e 100644
>>> +++ b/drivers/infiniband/hw/irdma/verbs.c
>>> @@ -690,6 +690,11 @@ static int irdma_cqp_create_qp_cmd(struct irdma_qp *iwqp)
>>>    	return status ? -ENOMEM : 0;
>>>    }
>>> +static inline u16 irdma_get_src_port(struct irdma_qp *iwqp)
>>> +{
>>> +	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff);
>>> +}
>>
>> How do you ensure the resulting port number is not already in use?
> 
> It doesn't matter, it is never used by anything, the receiver captures
> all data with the roce dport and ignores the sport

It still violates core networking addressing principles, and will
mightily confuse a network capture that's filtering on source ports.
Firewalls, ICMP, and similar fabric behaviors may also interfere.

SoftRoCE is forced to register/reserve the source port, isn't it?

Tom.
Jason Gunthorpe Dec. 14, 2021, 6:50 p.m. UTC | #4
On Tue, Dec 14, 2021 at 01:09:01PM -0500, Tom Talpey wrote:
> On 12/14/2021 12:29 PM, Jason Gunthorpe wrote:
> > On Tue, Dec 14, 2021 at 12:27:24PM -0500, Tom Talpey wrote:
> > > On 12/14/2021 12:42 AM, yanjun.zhu@linux.dev wrote:
> > > > From: Zhu Yanjun <yanjun.zhu@linux.dev>
> > > > 
> > > > Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
> > > > get the source udp port number for a QP based on the local QPN. This
> > > > provides a better spread of traffic across NIC RX queues.  The method in
> > > > the commit d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
> > > > scaling") is stable. So it is also adopted in this commit.
> > > > 
> > > > Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
> > > >    drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
> > > >    1 file changed, 6 insertions(+), 1 deletion(-)
> > > > 
> > > > diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
> > > > index 102dc9342f2a..2697b40a539e 100644
> > > > +++ b/drivers/infiniband/hw/irdma/verbs.c
> > > > @@ -690,6 +690,11 @@ static int irdma_cqp_create_qp_cmd(struct irdma_qp *iwqp)
> > > >    	return status ? -ENOMEM : 0;
> > > >    }
> > > > +static inline u16 irdma_get_src_port(struct irdma_qp *iwqp)
> > > > +{
> > > > +	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff);
> > > > +}
> > > 
> > > How do you ensure the resulting port number is not already in use?
> > 
> > It doesn't matter, it is never used by anything, the receiver captures
> > all data with the roce dport and ignores the sport
> 
> It still violates core networking addressing principles, and will
> mightily confuse a network capture that's filtering on source ports.
> Firewalls, ICMP, and similar fabric behaviors may also interfere.

Maybe, but most of that stuff doesn't work with roce anyhow.

> SoftRoCE is forced to register/reserve the source port, isn't it?

Logically it has to register the dest port, it receives from any
source port.

Due to the way the netstack works softroce can't do this trick either,
IIRC.

Jason
Shiraz Saleem Dec. 17, 2021, 2:02 a.m. UTC | #5
> Subject: [PATCH 1/1] RDMA/irdma: Make the source udp port vary
> 
> From: Zhu Yanjun <yanjun.zhu@linux.dev>
> 
> Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
> get the source udp port number for a QP based on the local QPN. This provides a
> better spread of traffic across NIC RX queues.  The method in the commit
> d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
> scaling") is stable. So it is also adopted in this commit.
> 
> Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
> ---
>  drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
>  1 file changed, 6 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
> index 102dc9342f2a..2697b40a539e 100644
> --- a/drivers/infiniband/hw/irdma/verbs.c
> +++ b/drivers/infiniband/hw/irdma/verbs.c
> @@ -690,6 +690,11 @@ static int irdma_cqp_create_qp_cmd(struct irdma_qp
> *iwqp)
>  	return status ? -ENOMEM : 0;
>  }
> 
> +static inline u16 irdma_get_src_port(struct irdma_qp *iwqp) {
> +	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff); }
> +

There are core hash function helpers based on the grh.flow_label or lqpn/rqrpn that RoCEv2 drivers could use the to get the UDP src port?

https://elixir.bootlin.com/linux/v5.16-rc5/source/include/rdma/ib_verbs.h#L4719

Why don't we use them instead to set the udp_info->src_port in irdma_modify_qp_roce when the path address vector is provided?

Shiraz
Zhu Yanjun Dec. 18, 2021, 2:29 a.m. UTC | #6
在 2021/12/17 10:02, Saleem, Shiraz 写道:
>> Subject: [PATCH 1/1] RDMA/irdma: Make the source udp port vary
>>
>> From: Zhu Yanjun <yanjun.zhu@linux.dev>
>>
>> Based on the link https://www.spinics.net/lists/linux-rdma/msg73735.html,
>> get the source udp port number for a QP based on the local QPN. This provides a
>> better spread of traffic across NIC RX queues.  The method in the commit
>> d3c04a3a6870 ("IB/rxe: vary the source udp port for receive
>> scaling") is stable. So it is also adopted in this commit.
>>
>> Signed-off-by: Zhu Yanjun <yanjun.zhu@linux.dev>
>> ---
>>   drivers/infiniband/hw/irdma/verbs.c | 7 ++++++-
>>   1 file changed, 6 insertions(+), 1 deletion(-)
>>
>> diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
>> index 102dc9342f2a..2697b40a539e 100644
>> --- a/drivers/infiniband/hw/irdma/verbs.c
>> +++ b/drivers/infiniband/hw/irdma/verbs.c
>> @@ -690,6 +690,11 @@ static int irdma_cqp_create_qp_cmd(struct irdma_qp
>> *iwqp)
>>   	return status ? -ENOMEM : 0;
>>   }
>>
>> +static inline u16 irdma_get_src_port(struct irdma_qp *iwqp) {
>> +	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff); }
>> +
> 
> There are core hash function helpers based on the grh.flow_label or lqpn/rqrpn that RoCEv2 drivers could use the to get the UDP src port?
> 
> https://elixir.bootlin.com/linux/v5.16-rc5/source/include/rdma/ib_verbs.h#L4719
> 
> Why don't we use them instead to set the udp_info->src_port in irdma_modify_qp_roce when the path address vector is provided?

Got it. I will send a new patch based on your suggestion.
Thanks.
Zhu Yanjun

> 
> Shiraz
diff mbox series

Patch

diff --git a/drivers/infiniband/hw/irdma/verbs.c b/drivers/infiniband/hw/irdma/verbs.c
index 102dc9342f2a..2697b40a539e 100644
--- a/drivers/infiniband/hw/irdma/verbs.c
+++ b/drivers/infiniband/hw/irdma/verbs.c
@@ -690,6 +690,11 @@  static int irdma_cqp_create_qp_cmd(struct irdma_qp *iwqp)
 	return status ? -ENOMEM : 0;
 }
 
+static inline u16 irdma_get_src_port(struct irdma_qp *iwqp)
+{
+	return 0xc000 + (hash_32_generic(iwqp->ibqp.qp_num, 14) & 0x3fff);
+}
+
 static void irdma_roce_fill_and_set_qpctx_info(struct irdma_qp *iwqp,
 					       struct irdma_qp_host_ctx_info *ctx_info)
 {
@@ -703,7 +708,7 @@  static void irdma_roce_fill_and_set_qpctx_info(struct irdma_qp *iwqp,
 	udp_info->cwnd = iwdev->roce_cwnd;
 	udp_info->rexmit_thresh = 2;
 	udp_info->rnr_nak_thresh = 2;
-	udp_info->src_port = 0xc000;
+	udp_info->src_port = irdma_get_src_port(iwqp);
 	udp_info->dst_port = ROCE_V2_UDP_DPORT;
 	roce_info = &iwqp->roce_info;
 	ether_addr_copy(roce_info->mac_addr, iwdev->netdev->dev_addr);