diff mbox

[PATCHv3,1/1] IB/rxe: remove unnecessary skb_clone

Message ID 1518604600-4425-1-git-send-email-yanjun.zhu@oracle.com (mailing list archive)
State Superseded
Headers show

Commit Message

Zhu Yanjun Feb. 14, 2018, 10:36 a.m. UTC
In send_atomic_ack function, it is not necessary to make a
skb_clone. To gain better performance (high throughput and
low latency), this skb_clone is removed.

The following tests are made.

 server                       client
---------                    ---------
|1.1.1.1|<----rxe-channel--->|1.1.1.2|
---------                    ---------

On server: rping -s -a 1.1.1.1 -v -C 1000 -S 512
On client: rping -c -a 1.1.1.1 -v -C 1000 -S 512

The kernel config CONFIG_DEBUG_KMEMLEAK is enabled on both server
and client.

This test runs for several hours. There is no memory leak and the whole
system can work well.

As the above network, the following tests are made.

Server: ibv_rc_pingpong -d rxe0 -g 1
Client: ibv_rc_pingpong -d rxe0 -g 1 1.1.1.1

The result on Server(10 tests are made).
Before:
Throughput is 137.07 Mbit/sec
Latency is 517.76 usec/iter

After:
Throughput is 148.85 Mbit/sec
Latency is 476.64 usec/iter

The throughput is enhanced and the latency is reduced.

CC: Srinivas Eeda <srinivas.eeda@oracle.com>
CC: Junxiao Bi <junxiao.bi@oracle.com>
Signed-off-by: Zhu Yanjun <yanjun.zhu@oracle.com>

---
V2-->V3: Fix typo errors.
V1-->V2: 10 tests are made. From throughput and latency, the performance is better.
---
 drivers/infiniband/sw/rxe/rxe_resp.c | 18 +++---------------
 1 file changed, 3 insertions(+), 15 deletions(-)

Comments

Doug Ledford Feb. 20, 2018, 5:59 p.m. UTC | #1
On Wed, 2018-02-14 at 05:36 -0500, Zhu Yanjun wrote:
> In send_atomic_ack function, it is not necessary to make a
> skb_clone. To gain better performance (high throughput and
> low latency), this skb_clone is removed.
> 
> The following tests are made.
> 
>  server                       client
> ---------                    ---------
> > 1.1.1.1|<----rxe-channel--->|1.1.1.2|
> 
> ---------                    ---------
> 
> On server: rping -s -a 1.1.1.1 -v -C 1000 -S 512
> On client: rping -c -a 1.1.1.1 -v -C 1000 -S 512
> 
> The kernel config CONFIG_DEBUG_KMEMLEAK is enabled on both server
> and client.
> 
> This test runs for several hours. There is no memory leak and the whole
> system can work well.
> 
> As the above network, the following tests are made.
> 
> Server: ibv_rc_pingpong -d rxe0 -g 1
> Client: ibv_rc_pingpong -d rxe0 -g 1 1.1.1.1
> 
> The result on Server(10 tests are made).
> Before:
> Throughput is 137.07 Mbit/sec
> Latency is 517.76 usec/iter
> 
> After:
> Throughput is 148.85 Mbit/sec
> Latency is 476.64 usec/iter
> 
> The throughput is enhanced and the latency is reduced.
> 
> CC: Srinivas Eeda <srinivas.eeda@oracle.com>
> CC: Junxiao Bi <junxiao.bi@oracle.com>
> Signed-off-by: Zhu Yanjun <yanjun.zhu@oracle.com>
> 
> ---
> V2-->V3: Fix typo errors.
> V1-->V2: 10 tests are made. From throughput and latency, the performance is better.

Hi Jason,

I see that the state of this patch in patchworks is Changes Requested,
but from what I can see, the changes were requested of v2 and
implemented in v3.  Was the state of this patch in patchworks switched
by accident possibly, or is there something you were waiting on for the
v3 patch?
Zhu Yanjun Feb. 21, 2018, 1:50 a.m. UTC | #2
On 2018/2/21 1:59, Doug Ledford wrote:
> On Wed, 2018-02-14 at 05:36 -0500, Zhu Yanjun wrote:
>> In send_atomic_ack function, it is not necessary to make a
>> skb_clone. To gain better performance (high throughput and
>> low latency), this skb_clone is removed.
>>
>> The following tests are made.
>>
>>   server                       client
>> ---------                    ---------
>>> 1.1.1.1|<----rxe-channel--->|1.1.1.2|
>> ---------                    ---------
>>
>> On server: rping -s -a 1.1.1.1 -v -C 1000 -S 512
>> On client: rping -c -a 1.1.1.1 -v -C 1000 -S 512
>>
>> The kernel config CONFIG_DEBUG_KMEMLEAK is enabled on both server
>> and client.
>>
>> This test runs for several hours. There is no memory leak and the whole
>> system can work well.
>>
>> As the above network, the following tests are made.
>>
>> Server: ibv_rc_pingpong -d rxe0 -g 1
>> Client: ibv_rc_pingpong -d rxe0 -g 1 1.1.1.1
>>
>> The result on Server(10 tests are made).
>> Before:
>> Throughput is 137.07 Mbit/sec
>> Latency is 517.76 usec/iter
>>
>> After:
>> Throughput is 148.85 Mbit/sec
>> Latency is 476.64 usec/iter
>>
>> The throughput is enhanced and the latency is reduced.
>>
>> CC: Srinivas Eeda <srinivas.eeda@oracle.com>
>> CC: Junxiao Bi <junxiao.bi@oracle.com>
>> Signed-off-by: Zhu Yanjun <yanjun.zhu@oracle.com>
>>
>> ---
>> V2-->V3: Fix typo errors.
>> V1-->V2: 10 tests are made. From throughput and latency, the performance is better.
> Hi Jason,
>
> I see that the state of this patch in patchworks is Changes Requested,
> but from what I can see, the changes were requested of v2 and
> implemented in v3.  Was the state of this patch in patchworks switched
> by accident possibly, or is there something you were waiting on for the
> v3 patch?
Hi, All

I am making tests with the comments from Yonatan. After the tests are 
over, I will make V4 patch.

Zhu Yanjun
>

--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Jason Gunthorpe Feb. 21, 2018, 3:52 p.m. UTC | #3
On Tue, Feb 20, 2018 at 12:59:00PM -0500, Doug Ledford wrote:

> I see that the state of this patch in patchworks is Changes Requested,
> but from what I can see, the changes were requested of v2 and
> implemented in v3.  Was the state of this patch in patchworks switched
> by accident possibly, or is there something you were waiting on for the
> v3 patch?

Yes, the last conversation I saw on Feb 19 was that the patch would be
reworked for refcounting? I didn't think that had been done
yet.

Basically waiting for an ack from Yuval..

Jason
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Doug Ledford Feb. 21, 2018, 4:26 p.m. UTC | #4
On Wed, 2018-02-21 at 08:52 -0700, Jason Gunthorpe wrote:
> On Tue, Feb 20, 2018 at 12:59:00PM -0500, Doug Ledford wrote:
> 
> > I see that the state of this patch in patchworks is Changes Requested,
> > but from what I can see, the changes were requested of v2 and
> > implemented in v3.  Was the state of this patch in patchworks switched
> > by accident possibly, or is there something you were waiting on for the
> > v3 patch?
> 
> Yes, the last conversation I saw on Feb 19 was that the patch would be
> reworked for refcounting? I didn't think that had been done
> yet.
> 
> Basically waiting for an ack from Yuval..

Got it, thanks :-)
diff mbox

Patch

diff --git a/drivers/infiniband/sw/rxe/rxe_resp.c b/drivers/infiniband/sw/rxe/rxe_resp.c
index d37bb9b..2ebb07c 100644
--- a/drivers/infiniband/sw/rxe/rxe_resp.c
+++ b/drivers/infiniband/sw/rxe/rxe_resp.c
@@ -969,7 +969,6 @@  static int send_atomic_ack(struct rxe_qp *qp, struct rxe_pkt_info *pkt,
 	int rc = 0;
 	struct rxe_pkt_info ack_pkt;
 	struct sk_buff *skb;
-	struct sk_buff *skb_copy;
 	struct rxe_dev *rxe = to_rdev(qp->ibqp.device);
 	struct resp_res *res;
 
@@ -981,15 +980,6 @@  static int send_atomic_ack(struct rxe_qp *qp, struct rxe_pkt_info *pkt,
 		goto out;
 	}
 
-	skb_copy = skb_clone(skb, GFP_ATOMIC);
-	if (skb_copy)
-		rxe_add_ref(qp); /* for the new SKB */
-	else {
-		pr_warn("Could not clone atomic response\n");
-		rc = -ENOMEM;
-		goto out;
-	}
-
 	res = &qp->resp.resources[qp->resp.res_head];
 	free_rd_atomic_resource(qp, res);
 	rxe_advance_resp_resource(qp);
@@ -998,18 +988,16 @@  static int send_atomic_ack(struct rxe_qp *qp, struct rxe_pkt_info *pkt,
 	memset((unsigned char *)SKB_TO_PKT(skb) + sizeof(ack_pkt), 0,
 	       sizeof(skb->cb) - sizeof(ack_pkt));
 
+	refcount_inc(&skb->users);
 	res->type = RXE_ATOMIC_MASK;
 	res->atomic.skb = skb;
 	res->first_psn = ack_pkt.psn;
 	res->last_psn  = ack_pkt.psn;
 	res->cur_psn   = ack_pkt.psn;
 
-	rc = rxe_xmit_packet(rxe, qp, &ack_pkt, skb_copy);
-	if (rc) {
+	rc = rxe_xmit_packet(rxe, qp, &ack_pkt, skb);
+	if (rc)
 		pr_err_ratelimited("Failed sending ack\n");
-		rxe_drop_ref(qp);
-		kfree_skb(skb_copy);
-	}
 
 out:
 	return rc;