From patchwork Wed Dec 26 23:33:23 2018
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: James Smart <jsmart2021@gmail.com>
X-Patchwork-Id: 10743363
Return-Path: <linux-scsi-owner@kernel.org>
Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org
 [172.30.200.125])
	by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id 6D58813AD
	for <patchwork-linux-scsi@patchwork.kernel.org>;
 Wed, 26 Dec 2018 23:34:17 +0000 (UTC)
Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1])
	by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 5EC1928710
	for <patchwork-linux-scsi@patchwork.kernel.org>;
 Wed, 26 Dec 2018 23:34:17 +0000 (UTC)
Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486)
	id 531A028748; Wed, 26 Dec 2018 23:34:17 +0000 (UTC)
X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on
	pdx-wl-mail.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-8.0 required=2.0 tests=BAYES_00,DKIM_SIGNED,
	DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,MAILING_LIST_MULTI,RCVD_IN_DNSWL_HI
	autolearn=ham version=3.3.1
Received: from vger.kernel.org (vger.kernel.org [209.132.180.67])
	by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 3072128710
	for <patchwork-linux-scsi@patchwork.kernel.org>;
 Wed, 26 Dec 2018 23:34:11 +0000 (UTC)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S1727408AbeLZXeJ (ORCPT
        <rfc822;patchwork-linux-scsi@patchwork.kernel.org>);
        Wed, 26 Dec 2018 18:34:09 -0500
Received: from mail-yb1-f194.google.com ([209.85.219.194]:34174 "EHLO
        mail-yb1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S1727202AbeLZXeI (ORCPT
        <rfc822;linux-scsi@vger.kernel.org>); Wed, 26 Dec 2018 18:34:08 -0500
Received: by mail-yb1-f194.google.com with SMTP id w6so2326996ybq.1
        for <linux-scsi@vger.kernel.org>;
 Wed, 26 Dec 2018 15:34:07 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20161025;
        h=from:to:cc:subject:date:message-id:in-reply-to:references;
        bh=e9gyGj45FXs7zwKMny8q3FQ4UF0GfDcBZYb7GUTwQxY=;
        b=WeqJCULC6JtrwQrHxlqA7Pfm5N498yTw1JEMOKQxOFQNBna8Ez7ItEWiyLZnQdzaG+
         FyDLi8eFleMKRIK5oTNpm2iSCri29TVRj9L/u+ES2uydnKVYVJctfUmBQ2K9QOVynZFZ
         6MCMEMgOLNcga0fN/RNVARzZ8aXUPVuIGChE6UqNgnSb1K/RoKDNBymUp3DVpBeOrj+8
         1tw4rIT5rGDhdzUTVMadq9Fkskqom4d0SkA4IufaP/AwlbncCjpMw9IJDk5LsMwPclVP
         +CaS6kU6ubsBzSs6y5JWvNtDtC4gsPGO48e3NjvStMibeooqTH7LQLPYt7wbW1M9U3Wq
         x4qg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20161025;
        h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to
         :references;
        bh=e9gyGj45FXs7zwKMny8q3FQ4UF0GfDcBZYb7GUTwQxY=;
        b=pRf119KXAuxWLlbr8+GAPPAFw2R4CNoHVtBoEu4PvB+UtKelzt+C2EYyup5b97hdSS
         58j/dYh8ReGoHfMCG7Lk8SE9qNv1k2f92tLneqNEkDrGsuEjEd7j6Kzd90ncea2q0P5e
         T2VWD73pSOPTa8Gbkn+a1yQem6e+y4rGaoEUj6vepuRaSd70rmsStz08ZvuW2g2o16MZ
         Xe+15KcekvkjtodosFlvMcpDjdGIoqRzNIrD6WSy/pUNVgsBOeReF11L0S5+mA9erJYR
         lJfasl8CC68dHvI6Ilh42H/eIizGeoqr8yjpb17YnxsEaV++quiGSF1LNKP5lbi2m2wR
         YQYA==
X-Gm-Message-State: AJcUukcd3fJwwhRNJN8s0rYAmvaOhIskSZwmwHfrKTUxC/0hu7wA4LkE
        Bat8dLPE7OTjtEQ7t8zUEZXF0y7G
X-Google-Smtp-Source: 
 AFSGD/WoUnO8iKR1ZacJC2HYDunCzUdLfA5cIPq8F80IFueAZGUF2mvyZfE/BYeCJR8HctTcdyXPXg==
X-Received: by 2002:a25:f44d:: with SMTP id
 p13mr21511726ybe.453.1545867246794;
        Wed, 26 Dec 2018 15:34:06 -0800 (PST)
Received: from os42.localdomain ([192.19.228.250])
        by smtp.gmail.com with ESMTPSA id
 h145sm13616483ywc.72.2018.12.26.15.34.05
        (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128);
        Wed, 26 Dec 2018 15:34:06 -0800 (PST)
From: James Smart <jsmart2021@gmail.com>
To: linux-scsi@vger.kernel.org
Cc: James Smart <jsmart2021@gmail.com>,
        Dick Kennedy <dick.kennedy@broadcom.com>
Subject: [PATCH 14/25] lpfc: Allow override of hardware queue selection
 policies
Date: Wed, 26 Dec 2018 15:33:23 -0800
Message-Id: <20181226233334.27518-15-jsmart2021@gmail.com>
X-Mailer: git-send-email 2.13.7
In-Reply-To: <20181226233334.27518-1-jsmart2021@gmail.com>
References: <20181226233334.27518-1-jsmart2021@gmail.com>
Sender: linux-scsi-owner@vger.kernel.org
Precedence: bulk
List-ID: <linux-scsi.vger.kernel.org>
X-Mailing-List: linux-scsi@vger.kernel.org
X-Virus-Scanned: ClamAV using ClamSMTP

Default behavior is to use the information from the upper io
stacks to select the hardware queue to use for io submission.
which typically has good cpu affinity.

However, the driver, when used on some variants of the upstream
kernel, has found queuing information to be suboptimal for FCP
or io completion locked on particular cpus.

For command submission situations, the lpfc_fcp_io_sched module
parameter can be set to specify a hardware queue selection policy
that overrides the os stack information.

For io completion situations, rather than queing cq processing
based on the cpu servicing the interrupting event, schedule the
cq processing on the cpu associated with the hardware queue's cq.

Signed-off-by: Dick Kennedy <dick.kennedy@broadcom.com>
Signed-off-by: James Smart <jsmart2021@gmail.com>
Reviewed-by: Hannes Reinecke <hare@suse.com>
---
 drivers/scsi/lpfc/lpfc_attr.c | 11 ++++++-----
 drivers/scsi/lpfc/lpfc_hw4.h  |  2 +-
 drivers/scsi/lpfc/lpfc_nvme.c | 14 +++++++++++---
 drivers/scsi/lpfc/lpfc_scsi.c |  3 ++-
 drivers/scsi/lpfc/lpfc_sli.c  |  2 +-
 5 files changed, 21 insertions(+), 11 deletions(-)

diff --git a/drivers/scsi/lpfc/lpfc_attr.c b/drivers/scsi/lpfc/lpfc_attr.c
index 0781d65f79ed..b00373215aa0 100644
--- a/drivers/scsi/lpfc/lpfc_attr.c
+++ b/drivers/scsi/lpfc/lpfc_attr.c
@@ -5275,11 +5275,12 @@ LPFC_ATTR_R(xri_rebalancing, 1, 0, 1, "Enable/Disable XRI rebalancing");
 /*
  * lpfc_io_sched: Determine scheduling algrithmn for issuing FCP cmds
  * range is [0,1]. Default value is 0.
- * For [0], FCP commands are issued to Work Queues ina round robin fashion.
+ * For [0], FCP commands are issued to Work Queues based on upper layer
+ * hardware queue index.
  * For [1], FCP commands are issued to a Work Queue associated with the
  *          current CPU.
  *
- * LPFC_FCP_SCHED_ROUND_ROBIN == 0
+ * LPFC_FCP_SCHED_BY_HDWQ == 0
  * LPFC_FCP_SCHED_BY_CPU == 1
  *
  * The driver dynamically sets this to 1 (BY_CPU) if it's able to set up cpu
@@ -5287,11 +5288,11 @@ LPFC_ATTR_R(xri_rebalancing, 1, 0, 1, "Enable/Disable XRI rebalancing");
  * CPU. Otherwise, the default 0 (Round Robin) scheduling of FCP/NVME I/Os
  * through WQs will be used.
  */
-LPFC_ATTR_RW(fcp_io_sched, LPFC_FCP_SCHED_ROUND_ROBIN,
-	     LPFC_FCP_SCHED_ROUND_ROBIN,
+LPFC_ATTR_RW(fcp_io_sched, LPFC_FCP_SCHED_BY_HDWQ,
+	     LPFC_FCP_SCHED_BY_HDWQ,
 	     LPFC_FCP_SCHED_BY_CPU,
 	     "Determine scheduling algorithm for "
-	     "issuing commands [0] - Round Robin, [1] - Current CPU");
+	     "issuing commands [0] - Hardware Queue, [1] - Current CPU");
 
 /*
  * lpfc_ns_query: Determine algrithmn for NameServer queries after RSCN
diff --git a/drivers/scsi/lpfc/lpfc_hw4.h b/drivers/scsi/lpfc/lpfc_hw4.h
index c15b9b6fb840..cd39845c909f 100644
--- a/drivers/scsi/lpfc/lpfc_hw4.h
+++ b/drivers/scsi/lpfc/lpfc_hw4.h
@@ -194,7 +194,7 @@ struct lpfc_sli_intf {
 #define LPFC_ACT_INTR_CNT	4
 
 /* Algrithmns for scheduling FCP commands to WQs */
-#define	LPFC_FCP_SCHED_ROUND_ROBIN	0
+#define	LPFC_FCP_SCHED_BY_HDWQ		0
 #define	LPFC_FCP_SCHED_BY_CPU		1
 
 /* Algrithmns for NameServer Query after RSCN */
diff --git a/drivers/scsi/lpfc/lpfc_nvme.c b/drivers/scsi/lpfc/lpfc_nvme.c
index 0c6c91d39e2f..c9aacd56a449 100644
--- a/drivers/scsi/lpfc/lpfc_nvme.c
+++ b/drivers/scsi/lpfc/lpfc_nvme.c
@@ -1546,8 +1546,17 @@ lpfc_nvme_fcp_io_submit(struct nvme_fc_local_port *pnvme_lport,
 		}
 	}
 
-	lpfc_ncmd = lpfc_get_nvme_buf(phba, ndlp,
-				      lpfc_queue_info->index, expedite);
+	if (phba->cfg_fcp_io_sched == LPFC_FCP_SCHED_BY_HDWQ) {
+		idx = lpfc_queue_info->index;
+	} else {
+		cpu = smp_processor_id();
+		if (cpu < phba->cfg_hdw_queue)
+			idx = cpu;
+		else
+			idx = cpu % phba->cfg_hdw_queue;
+	}
+
+	lpfc_ncmd = lpfc_get_nvme_buf(phba, ndlp, idx, expedite);
 	if (lpfc_ncmd == NULL) {
 		atomic_inc(&lport->xmt_fcp_noxri);
 		lpfc_printf_vlog(vport, KERN_INFO, LOG_NVME_IOERR,
@@ -1585,7 +1594,6 @@ lpfc_nvme_fcp_io_submit(struct nvme_fc_local_port *pnvme_lport,
 	 * index to use and that they have affinitized a CPU to this hardware
 	 * queue. A hardware queue maps to a driver MSI-X vector/EQ/CQ/WQ.
 	 */
-	idx = lpfc_queue_info->index;
 	lpfc_ncmd->cur_iocbq.hba_wqidx = idx;
 	cstat = &phba->sli4_hba.hdwq[idx].nvme_cstat;
 
diff --git a/drivers/scsi/lpfc/lpfc_scsi.c b/drivers/scsi/lpfc/lpfc_scsi.c
index 484ac9493fd0..80d19b0d3306 100644
--- a/drivers/scsi/lpfc/lpfc_scsi.c
+++ b/drivers/scsi/lpfc/lpfc_scsi.c
@@ -689,7 +689,8 @@ lpfc_get_scsi_buf_s4(struct lpfc_hba *phba, struct lpfc_nodelist *ndlp,
 
 	cpu = smp_processor_id();
 	if (cmnd && shost_use_blk_mq(cmnd->device->host) &&
-	    phba->cfg_enable_scsi_mq) {
+	    phba->cfg_enable_scsi_mq &&
+	    (phba->cfg_fcp_io_sched == LPFC_FCP_SCHED_BY_HDWQ)) {
 		tag = blk_mq_unique_tag(cmnd->request);
 		idx = blk_mq_unique_tag_to_hwq(tag);
 	} else {
diff --git a/drivers/scsi/lpfc/lpfc_sli.c b/drivers/scsi/lpfc/lpfc_sli.c
index 280a1c7d162c..b0113a56efdc 100644
--- a/drivers/scsi/lpfc/lpfc_sli.c
+++ b/drivers/scsi/lpfc/lpfc_sli.c
@@ -14107,7 +14107,7 @@ lpfc_sli4_hba_handle_eqe(struct lpfc_hba *phba, struct lpfc_eqe *eqe,
 	/* Save EQ associated with this CQ */
 	cq->assoc_qp = phba->sli4_hba.hdwq[qidx].hba_eq;
 
-	if (!queue_work(phba->wq, &cq->irqwork))
+	if (!queue_work_on(cq->chann, phba->wq, &cq->irqwork))
 		lpfc_printf_log(phba, KERN_ERR, LOG_SLI,
 				"0363 Cannot schedule soft IRQ "
 				"for CQ eqcqid=%d, cqid=%d on CPU %d\n",