From patchwork Tue Jan 22 18:14:18 2019
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: James Smart <jsmart2021@gmail.com>
X-Patchwork-Id: 10775931
Return-Path: <linux-scsi-owner@kernel.org>
Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org
 [172.30.200.125])
	by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id A2C8F6C2
	for <patchwork-linux-scsi@patchwork.kernel.org>;
 Tue, 22 Jan 2019 18:14:59 +0000 (UTC)
Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1])
	by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 92A8A201F5
	for <patchwork-linux-scsi@patchwork.kernel.org>;
 Tue, 22 Jan 2019 18:14:59 +0000 (UTC)
Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486)
	id 8799D2B982; Tue, 22 Jan 2019 18:14:59 +0000 (UTC)
X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on
	pdx-wl-mail.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-8.0 required=2.0 tests=BAYES_00,DKIM_SIGNED,
	DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,MAILING_LIST_MULTI,RCVD_IN_DNSWL_HI
	autolearn=ham version=3.3.1
Received: from vger.kernel.org (vger.kernel.org [209.132.180.67])
	by mail.wl.linuxfoundation.org (Postfix) with ESMTP id F21112B99B
	for <patchwork-linux-scsi@patchwork.kernel.org>;
 Tue, 22 Jan 2019 18:14:58 +0000 (UTC)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S1726695AbfAVSO5 (ORCPT
        <rfc822;patchwork-linux-scsi@patchwork.kernel.org>);
        Tue, 22 Jan 2019 13:14:57 -0500
Received: from mail-pl1-f195.google.com ([209.85.214.195]:39115 "EHLO
        mail-pl1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S1726174AbfAVSOz (ORCPT
        <rfc822;linux-scsi@vger.kernel.org>); Tue, 22 Jan 2019 13:14:55 -0500
Received: by mail-pl1-f195.google.com with SMTP id 101so11855674pld.6
        for <linux-scsi@vger.kernel.org>;
 Tue, 22 Jan 2019 10:14:54 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20161025;
        h=from:to:cc:subject:date:message-id:in-reply-to:references;
        bh=oHpzqoN5bncpWAA4BmKtDUuwG3B5np50h7JfhU/GDOg=;
        b=Oyt3vWxZvwKc7f40fhitDJBEV8jw6Mn4WWZhS804zMubcm3BLNWMPv02lRuqyVx/8e
         L3Siv0NhC46pXcijgUlTgjF0lhCJA67yJFgYzyV+KnHGtE2RQFjtSTcJzpobDQ4GH7Ts
         LJwYo+tKk71Wkb8upB2sZW5SbiSfBzD5eKn0SrSkA5OOeZB6UaaBrQjq53z06PjUhoEa
         8I229YhfS3pRbou2Cu3QuYtrwMY46wPeq2nHUVyXSMcgn2tT+OOsU70ANOikG7cppQA2
         GZ21Eo3m831Ba/q2te8ogNt8ptmAr4dQWIo0el4CqK8YuXDNWABd0ZWyfVG0AgXoJQRM
         KUIQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20161025;
        h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to
         :references;
        bh=oHpzqoN5bncpWAA4BmKtDUuwG3B5np50h7JfhU/GDOg=;
        b=FGzCZPlZz6q/6yw1Maj6wK4FNYJ1fXwA5/2eogCSnaHcIC3k+bAYwKitefiXeb21e+
         GW+//8Ias970lDPX9Yp8fQJ1SIBv9JA1cUC43MZwslu+ibZAa5W7OzA2mSbcDHf2w+K6
         CdLHOhNpi7dtwL0TAtjSMd5hxGyKOEiiKbvdfAaaH68QU1z5Sw3Scw/8xH1rHOt1LW60
         YzRHpxhRd0MJa7mxqr1dP4vV64o/BclFlQ1V+qPf7vz2sRcNFfDdMkbLzJnxyQMco8gv
         Ip6cxeyn9RfLY7h1A9HDqIRxBVEGY1mzsRy1wIPdw0ThbmdXNtVbotiPCtj96vXOIAJc
         p+dg==
X-Gm-Message-State: AJcUukcCfgqiWBDF5fToJ95irnblaO/FTJ6j2hGJUVacMEmqCPxJ2/6L
        O8WMoQTXHNQYV48tFc+PKzmKGlIs
X-Google-Smtp-Source: 
 ALg8bN45QgpaCUHXzvIRd15SeHPcyl2UhYIcMNEmGxF6K4DcleTnJ/zv8aytT3UKbAr3sEl+PKkaaA==
X-Received: by 2002:a17:902:ac8f:: with SMTP id
 h15mr34364133plr.245.1548180894360;
        Tue, 22 Jan 2019 10:14:54 -0800 (PST)
Received: from pallmd1.broadcom.com ([192.19.223.250])
        by smtp.gmail.com with ESMTPSA id
 h134sm38848765pfe.27.2019.01.22.10.14.53
        (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128);
        Tue, 22 Jan 2019 10:14:54 -0800 (PST)
From: James Smart <jsmart2021@gmail.com>
To: linux-scsi@vger.kernel.org
Cc: James Smart <jsmart2021@gmail.com>,
        Dick Kennedy <dick.kennedy@broadcom.com>
Subject: [PATCH v2 13/26] lpfc: Allow override of hardware queue selection
 policies
Date: Tue, 22 Jan 2019 10:14:18 -0800
Message-Id: <20190122181431.8297-14-jsmart2021@gmail.com>
X-Mailer: git-send-email 2.13.7
In-Reply-To: <20190122181431.8297-1-jsmart2021@gmail.com>
References: <20190122181431.8297-1-jsmart2021@gmail.com>
Sender: linux-scsi-owner@vger.kernel.org
Precedence: bulk
List-ID: <linux-scsi.vger.kernel.org>
X-Mailing-List: linux-scsi@vger.kernel.org
X-Virus-Scanned: ClamAV using ClamSMTP

Default behavior is to use the information from the upper io
stacks to select the hardware queue to use for io submission.
which typically has good cpu affinity.

However, the driver, when used on some variants of the upstream
kernel, has found queuing information to be suboptimal for FCP
or io completion locked on particular cpus.

For command submission situations, the lpfc_fcp_io_sched module
parameter can be set to specify a hardware queue selection policy
that overrides the os stack information.

For io completion situations, rather than queing cq processing
based on the cpu servicing the interrupting event, schedule the
cq processing on the cpu associated with the hardware queue's cq.

Signed-off-by: Dick Kennedy <dick.kennedy@broadcom.com>
Signed-off-by: James Smart <jsmart2021@gmail.com>
---
v2:
 Adapt for 5.0 api changes:
   Remove all use of shost_use_scsi_mq().
   Remove references to driver flag to enable scsi_mq
---
 drivers/scsi/lpfc/lpfc_attr.c | 11 ++++++-----
 drivers/scsi/lpfc/lpfc_hw4.h  |  2 +-
 drivers/scsi/lpfc/lpfc_nvme.c | 14 +++++++++++---
 drivers/scsi/lpfc/lpfc_scsi.c |  2 +-
 drivers/scsi/lpfc/lpfc_sli.c  |  2 +-
 5 files changed, 20 insertions(+), 11 deletions(-)

diff --git a/drivers/scsi/lpfc/lpfc_attr.c b/drivers/scsi/lpfc/lpfc_attr.c
index 47aa2af885a4..93a96491899c 100644
--- a/drivers/scsi/lpfc/lpfc_attr.c
+++ b/drivers/scsi/lpfc/lpfc_attr.c
@@ -5275,11 +5275,12 @@ LPFC_ATTR_R(xri_rebalancing, 1, 0, 1, "Enable/Disable XRI rebalancing");
 /*
  * lpfc_io_sched: Determine scheduling algrithmn for issuing FCP cmds
  * range is [0,1]. Default value is 0.
- * For [0], FCP commands are issued to Work Queues ina round robin fashion.
+ * For [0], FCP commands are issued to Work Queues based on upper layer
+ * hardware queue index.
  * For [1], FCP commands are issued to a Work Queue associated with the
  *          current CPU.
  *
- * LPFC_FCP_SCHED_ROUND_ROBIN == 0
+ * LPFC_FCP_SCHED_BY_HDWQ == 0
  * LPFC_FCP_SCHED_BY_CPU == 1
  *
  * The driver dynamically sets this to 1 (BY_CPU) if it's able to set up cpu
@@ -5287,11 +5288,11 @@ LPFC_ATTR_R(xri_rebalancing, 1, 0, 1, "Enable/Disable XRI rebalancing");
  * CPU. Otherwise, the default 0 (Round Robin) scheduling of FCP/NVME I/Os
  * through WQs will be used.
  */
-LPFC_ATTR_RW(fcp_io_sched, LPFC_FCP_SCHED_ROUND_ROBIN,
-	     LPFC_FCP_SCHED_ROUND_ROBIN,
+LPFC_ATTR_RW(fcp_io_sched, LPFC_FCP_SCHED_BY_HDWQ,
+	     LPFC_FCP_SCHED_BY_HDWQ,
 	     LPFC_FCP_SCHED_BY_CPU,
 	     "Determine scheduling algorithm for "
-	     "issuing commands [0] - Round Robin, [1] - Current CPU");
+	     "issuing commands [0] - Hardware Queue, [1] - Current CPU");
 
 /*
  * lpfc_ns_query: Determine algrithmn for NameServer queries after RSCN
diff --git a/drivers/scsi/lpfc/lpfc_hw4.h b/drivers/scsi/lpfc/lpfc_hw4.h
index c15b9b6fb840..cd39845c909f 100644
--- a/drivers/scsi/lpfc/lpfc_hw4.h
+++ b/drivers/scsi/lpfc/lpfc_hw4.h
@@ -194,7 +194,7 @@ struct lpfc_sli_intf {
 #define LPFC_ACT_INTR_CNT	4
 
 /* Algrithmns for scheduling FCP commands to WQs */
-#define	LPFC_FCP_SCHED_ROUND_ROBIN	0
+#define	LPFC_FCP_SCHED_BY_HDWQ		0
 #define	LPFC_FCP_SCHED_BY_CPU		1
 
 /* Algrithmns for NameServer Query after RSCN */
diff --git a/drivers/scsi/lpfc/lpfc_nvme.c b/drivers/scsi/lpfc/lpfc_nvme.c
index 0c6c91d39e2f..c9aacd56a449 100644
--- a/drivers/scsi/lpfc/lpfc_nvme.c
+++ b/drivers/scsi/lpfc/lpfc_nvme.c
@@ -1546,8 +1546,17 @@ lpfc_nvme_fcp_io_submit(struct nvme_fc_local_port *pnvme_lport,
 		}
 	}
 
-	lpfc_ncmd = lpfc_get_nvme_buf(phba, ndlp,
-				      lpfc_queue_info->index, expedite);
+	if (phba->cfg_fcp_io_sched == LPFC_FCP_SCHED_BY_HDWQ) {
+		idx = lpfc_queue_info->index;
+	} else {
+		cpu = smp_processor_id();
+		if (cpu < phba->cfg_hdw_queue)
+			idx = cpu;
+		else
+			idx = cpu % phba->cfg_hdw_queue;
+	}
+
+	lpfc_ncmd = lpfc_get_nvme_buf(phba, ndlp, idx, expedite);
 	if (lpfc_ncmd == NULL) {
 		atomic_inc(&lport->xmt_fcp_noxri);
 		lpfc_printf_vlog(vport, KERN_INFO, LOG_NVME_IOERR,
@@ -1585,7 +1594,6 @@ lpfc_nvme_fcp_io_submit(struct nvme_fc_local_port *pnvme_lport,
 	 * index to use and that they have affinitized a CPU to this hardware
 	 * queue. A hardware queue maps to a driver MSI-X vector/EQ/CQ/WQ.
 	 */
-	idx = lpfc_queue_info->index;
 	lpfc_ncmd->cur_iocbq.hba_wqidx = idx;
 	cstat = &phba->sli4_hba.hdwq[idx].nvme_cstat;
 
diff --git a/drivers/scsi/lpfc/lpfc_scsi.c b/drivers/scsi/lpfc/lpfc_scsi.c
index c824ed3be4f9..7b22cc995d7f 100644
--- a/drivers/scsi/lpfc/lpfc_scsi.c
+++ b/drivers/scsi/lpfc/lpfc_scsi.c
@@ -688,7 +688,7 @@ lpfc_get_scsi_buf_s4(struct lpfc_hba *phba, struct lpfc_nodelist *ndlp,
 	int tag;
 
 	cpu = smp_processor_id();
-	if (cmnd) {
+	if (cmnd && phba->cfg_fcp_io_sched == LPFC_FCP_SCHED_BY_HDWQ) {
 		tag = blk_mq_unique_tag(cmnd->request);
 		idx = blk_mq_unique_tag_to_hwq(tag);
 	} else {
diff --git a/drivers/scsi/lpfc/lpfc_sli.c b/drivers/scsi/lpfc/lpfc_sli.c
index 55512300aa8e..e2f4a89a27ca 100644
--- a/drivers/scsi/lpfc/lpfc_sli.c
+++ b/drivers/scsi/lpfc/lpfc_sli.c
@@ -14106,7 +14106,7 @@ lpfc_sli4_hba_handle_eqe(struct lpfc_hba *phba, struct lpfc_eqe *eqe,
 	/* Save EQ associated with this CQ */
 	cq->assoc_qp = phba->sli4_hba.hdwq[qidx].hba_eq;
 
-	if (!queue_work(phba->wq, &cq->irqwork))
+	if (!queue_work_on(cq->chann, phba->wq, &cq->irqwork))
 		lpfc_printf_log(phba, KERN_ERR, LOG_SLI,
 				"0363 Cannot schedule soft IRQ "
 				"for CQ eqcqid=%d, cqid=%d on CPU %d\n",