[09/24] lnet: o2iblnd: Salt comp_vector

Message ID	1662429337-18737-10-git-send-email-jsimmons@infradead.org (mailing list archive)
State	New, archived
Headers	show Return-Path: <lustre-devel-bounces@lists.lustre.org> From: James Simmons <jsimmons@infradead.org> To: Andreas Dilger <adilger@whamcloud.com>, Oleg Drokin <green@whamcloud.com>, NeilBrown <neilb@suse.de> Date: Mon, 5 Sep 2022 21:55:22 -0400 Message-Id: <1662429337-18737-10-git-send-email-jsimmons@infradead.org> In-Reply-To: <1662429337-18737-1-git-send-email-jsimmons@infradead.org> References: <1662429337-18737-1-git-send-email-jsimmons@infradead.org> Subject: [lustre-devel] [PATCH 09/24] lnet: o2iblnd: Salt comp_vector Precedence: list Cc: Ian Ziemba <ian.ziemba@hpe.com>, Lustre Development List <lustre-devel@lists.lustre.org> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: lustre-devel-bounces@lists.lustre.org Sender: "lustre-devel" <lustre-devel-bounces@lists.lustre.org>
Series	lustre: update to OpenSFS tree Sept 5, 2022 \| expand [00/24] lustre: update to OpenSFS tree Sept 5, 2022 [01/24] lustre: sec: new connect flag for name encryption [02/24] lustre: lmv: always space-balance r-r directories [03/24] lustre: ldlm: rid of obsolete param of ldlm_resource_get() [04/24] lustre: llite: fully disable readahead in kernel I/O path [05/24] lustre: llite: use fatal_signal_pending in range_lock [06/24] lustre: update version to 2.15.51 [07/24] lustre: llite: simplify callback handling for async getattr [08/24] lustre: statahead: add total hit/miss count stats [09/24] lnet: o2iblnd: Salt comp_vector [10/24] lnet: selftest: use preallocate bulk for server [11/24] lnet: change ni_status in lnet_ni to u32* [12/24] lustre: llite: Rework upper/lower DIO/AIO [13/24] lustre: sec: use enc pool for bounce pages [14/24] lustre: llite: Unify range unlock [15/24] lustre: llite: Refactor DIO/AIO free code [16/24] lnet: Use fatal NI if none other available [17/24] lnet: LNet peer aliveness broken [18/24] lnet: Correct net selection for router ping [19/24] lnet: Remove duplicate checks for peer sensitivity [20/24] lustre: obdclass: use consistent stats units [21/24] lnet: Memory leak on adding existing interface [22/24] lustre: sec: fix detection of SELinux enforcement [23/24] lustre: idl: add checks for OBD_CONNECT flags [24/24] lustre: llite: fix stat attributes_mask

Message ID

1662429337-18737-10-git-send-email-jsimmons@infradead.org (mailing list archive)

State

New, archived

Headers

From: James Simmons <jsimmons@infradead.org>
To: Andreas Dilger <adilger@whamcloud.com>, Oleg Drokin <green@whamcloud.com>,
 NeilBrown <neilb@suse.de>
Date: Mon,  5 Sep 2022 21:55:22 -0400
Message-Id: <1662429337-18737-10-git-send-email-jsimmons@infradead.org>
In-Reply-To: <1662429337-18737-1-git-send-email-jsimmons@infradead.org>
References: <1662429337-18737-1-git-send-email-jsimmons@infradead.org>
Subject: [lustre-devel] [PATCH 09/24] lnet: o2iblnd: Salt comp_vector
Precedence: list
Cc: Ian Ziemba <ian.ziemba@hpe.com>,
 Lustre Development List <lustre-devel@lists.lustre.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: lustre-devel-bounces@lists.lustre.org
Sender: "lustre-devel" <lustre-devel-bounces@lists.lustre.org>

Series

lustre: update to OpenSFS tree Sept 5, 2022 | expand

Commit Message

James Simmons Sept. 6, 2022, 1:55 a.m. UTC

From: Ian Ziemba <ian.ziemba@hpe.com>

If conns_per_peer is greater than 1, all the connections targeting
the same peer are assigned the same comp_vector. This results in
multiple IB CQs targeting the same peer to be serialized on a single
comp_vector.

Help spread out the IB CQ work to multiple cores by salting
comp_vector based on number of connections.

1 client to 1 server LST 1M write results with 4 conns_per_peer and
RXE configured to spread out work based on comp_vector.

Before: 1377.92 MB/s
After: 3828.48 MB/s

HPE-bug-id: LUS-11043
WC-bug-id: https://jira.whamcloud.com/browse/LU-16078
Lustre-commit: 1ef1fa06b20c424f5 ("LU-16078 o2iblnd: Salt comp_vector")
Signed-off-by: Ian Ziemba <ian.ziemba@hpe.com>
Reviewed-on: https://review.whamcloud.com/48148
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Signed-off-by: James Simmons <jsimmons@infradead.org>
---
 net/lnet/klnds/o2iblnd/o2iblnd.c | 14 +++++++++++---
 net/lnet/klnds/o2iblnd/o2iblnd.h |  2 ++
 2 files changed, 13 insertions(+), 3 deletions(-)

diff --git a/net/lnet/klnds/o2iblnd/o2iblnd.c b/net/lnet/klnds/o2iblnd/o2iblnd.c
index ea28c65..c713528 100644
--- a/net/lnet/klnds/o2iblnd/o2iblnd.c
+++ b/net/lnet/klnds/o2iblnd/o2iblnd.c
@@ -338,6 +338,7 @@  int kiblnd_create_peer(struct lnet_ni *ni, struct kib_peer_ni **peerp,
 	peer_ni->ibp_queue_depth = ni->ni_net->net_tunables.lct_peer_tx_credits;
 	peer_ni->ibp_queue_depth_mod = 0;	/* try to use the default */
 	kref_init(&peer_ni->ibp_kref);
+	atomic_set(&peer_ni->ibp_nconns, 0);
 
 	INIT_HLIST_NODE(&peer_ni->ibp_list);
 	INIT_LIST_HEAD(&peer_ni->ibp_conns);
@@ -569,7 +570,7 @@  static int kiblnd_get_completion_vector(struct kib_conn *conn, int cpt)
 	int vectors;
 	int off;
 	int i;
-	lnet_nid_t nid = conn->ibc_peer->ibp_nid;
+	lnet_nid_t ibp_nid;
 
 	vectors = conn->ibc_cmid->device->num_comp_vectors;
 	if (vectors <= 1)
@@ -579,8 +580,13 @@  static int kiblnd_get_completion_vector(struct kib_conn *conn, int cpt)
 	if (!mask)
 		return 0;
 
-	/* hash NID to CPU id in this partition... */
-	off = do_div(nid, cpumask_weight(*mask));
+	/* hash NID to CPU id in this partition... when targeting a single peer
+	 * with multiple QPs, to engage more cores in CQ processing to a single
+	 * peer, use ibp_nconns to salt the value the comp_vector value
+	 */
+	ibp_nid = conn->ibc_peer->ibp_nid +
+		  atomic_read(&conn->ibc_peer->ibp_nconns);
+	off = do_div(ibp_nid, cpumask_weight(*mask));
 	for_each_cpu(i, *mask) {
 		if (!off--)
 			return i % vectors;
@@ -889,6 +895,7 @@  struct kib_conn *kiblnd_create_conn(struct kib_peer_ni *peer_ni,
 	conn->ibc_state = state;
 
 	/* 1 more conn */
+	atomic_inc(&peer_ni->ibp_nconns);
 	atomic_inc(&net->ibn_nconns);
 	return conn;
 
@@ -954,6 +961,7 @@  void kiblnd_destroy_conn(struct kib_conn *conn)
 
 		kiblnd_peer_decref(peer_ni);
 		rdma_destroy_id(cmid);
+		atomic_dec(&peer_ni->ibp_nconns);
 		atomic_dec(&net->ibn_nconns);
 	}
 }
diff --git a/net/lnet/klnds/o2iblnd/o2iblnd.h b/net/lnet/klnds/o2iblnd/o2iblnd.h
index 0066e85..56d486f 100644
--- a/net/lnet/klnds/o2iblnd/o2iblnd.h
+++ b/net/lnet/klnds/o2iblnd/o2iblnd.h
@@ -522,6 +522,8 @@  struct kib_peer_ni {
 	u16			ibp_queue_depth;
 	/* reduced value which allows conn to be created if max fails */
 	u16			ibp_queue_depth_mod;
+	/* Number of connections allocated. */
+	atomic_t		ibp_nconns;
 };
 
 extern struct kib_data kiblnd_data;

[09/24] lnet: o2iblnd: Salt comp_vector

Commit Message

Patch