[07/17] target/core: Use system workqueues for TMF

Message ID	20180917213554.987-8-bvanassche@acm.org (mailing list archive)
State	New, archived
Headers	show Return-Path: <target-devel-owner@kernel.org> From: Bart Van Assche <bvanassche@acm.org> To: "Martin K . Petersen" <martin.petersen@oracle.com> Cc: Christoph Hellwig <hch@lst.de>, target-devel@vger.kernel.org, Bart Van Assche <bvanassche@acm.org>, Nicholas Bellinger <nab@linux-iscsi.org>, Mike Christie <mchristi@redhat.com>, Hannes Reinecke <hare@suse.de> Subject: [PATCH 07/17] target/core: Use system workqueues for TMF Date: Mon, 17 Sep 2018 14:35:44 -0700 Message-Id: <20180917213554.987-8-bvanassche@acm.org> In-Reply-To: <20180917213554.987-1-bvanassche@acm.org> References: <20180917213554.987-1-bvanassche@acm.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: target-devel-owner@vger.kernel.org Precedence: bulk
Series	Make ABORT and LUN RESET handling synchronous \| expand [00/17] Make ABORT and LUN RESET handling synchronous [01/17] target/core: Fix spelling in two source code comments [02/17] target/core: Use the SECTOR_SHIFT constant [03/17] target/core: Remove an unused data member from struct xcopy_pt_cmd [04/17] target/core: Change the return type of transport_init_session from void to int [05/17] target/core: Make sure that target_wait_for_sess_cmds() waits long enough [06/17] target/core: Use sg_alloc_table() instead of open-coding it [07/17] target/core: Use system workqueues for TMF [08/17] target/core: Make compare_and_write_callback() accept NULL as third argument [09/17] target/core: Always call transport_complete_callback() upon failure [10/17] target/core: Make it possible to wait from more than one context for command completion [11/17] target/core: Make ABORT and LUN RESET handling synchronous [12/17] target/core: Reduce the amount of code executed with a spinlock held [13/17] target/core: Avoid that LUN reset sporadically triggers data corruption [14/17] target/qla2xxx: Simplify the code for handling aborted commands [15/17] target/core: Remove the write_pending_status() callback function [16/17] target/core: Remove several state tests from the TMF code [17/17] target/core: Simplify the LUN RESET implementation

Message ID

20180917213554.987-8-bvanassche@acm.org (mailing list archive)

State

New, archived

Headers

From: Bart Van Assche <bvanassche@acm.org>
To: "Martin K . Petersen" <martin.petersen@oracle.com>
Cc: Christoph Hellwig <hch@lst.de>, target-devel@vger.kernel.org,
        Bart Van Assche <bvanassche@acm.org>,
        Nicholas Bellinger <nab@linux-iscsi.org>,
        Mike Christie <mchristi@redhat.com>,
        Hannes Reinecke <hare@suse.de>
Subject: [PATCH 07/17] target/core: Use system workqueues for TMF
Date: Mon, 17 Sep 2018 14:35:44 -0700
Message-Id: <20180917213554.987-8-bvanassche@acm.org>
In-Reply-To: <20180917213554.987-1-bvanassche@acm.org>
References: <20180917213554.987-1-bvanassche@acm.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Sender: target-devel-owner@vger.kernel.org
Precedence: bulk

Series

Make ABORT and LUN RESET handling synchronous | expand

Commit Message

Bart Van Assche Sept. 17, 2018, 9:35 p.m. UTC

A quote from SAM-5: "The order in which task management requests
are processed is not specified by the SCSI architecture model.
The SCSI architecture model does not require in-order delivery of
such task management requests or processing by the task manager
in the order received. To guarantee the processing order of task
management requests referencing sent to a specific logical unit,
an application client should not have more than one such task
management request pending to that logical unit." This means that
it is safe to use the system workqueues instead of tmr_wq for
processing TMFs. An intended side effect of this patch is that it
enables concurrent processing of TMFs.

Signed-off-by: Bart Van Assche <bvanassche@acm.org>
Cc: Nicholas Bellinger <nab@linux-iscsi.org>
Cc: Mike Christie <mchristi@redhat.com>
Cc: Christoph Hellwig <hch@lst.de>
Cc: Hannes Reinecke <hare@suse.de>
---
 drivers/target/target_core_device.c    | 16 ----------------
 drivers/target/target_core_transport.c |  2 +-
 include/target/target_core_base.h      |  1 -
 3 files changed, 1 insertion(+), 18 deletions(-)

Comments

Christoph Hellwig Oct. 6, 2018, 12:06 p.m. UTC | #1

Looks good,

Reviewed-by: Christoph Hellwig <hch@lst.de>

Nicholas A. Bellinger Oct. 8, 2018, 4:31 a.m. UTC | #2

On Mon, 2018-09-17 at 14:35 -0700, Bart Van Assche wrote:
> A quote from SAM-5: "The order in which task management requests
> are processed is not specified by the SCSI architecture model.
> The SCSI architecture model does not require in-order delivery of
> such task management requests or processing by the task manager
> in the order received. To guarantee the processing order of task
> management requests referencing sent to a specific logical unit,
> an application client should not have more than one such task
> management request pending to that logical unit." 

The important part is:

"The order in which task management requests are processed is not
specified by the SCSI architecture model."

And the specific bit:

".. the application client should not have more than one such task
management request pending to that logical unit."

This wording from SAM is alone not justification changing default target
behavior.  

> This means that
> it is safe to use the system workqueues instead of tmr_wq for
> processing TMFs. An intended side effect of this patch is that it
> enables concurrent processing of TMFs.
> 

This patch enables N number of TMRs to be processed in-flight to a
single se_device, using the global bounded system_wq with unlimited
max_active.

For users with ~1K /sys/kernel/config/target/core/$HBA/$DEV/
configurations, I don't see how this can possibly be beneficial.

There has never been a host environment that requires execution of N
TMRs in parallel, because as you pointed out above, the application
client should not have more than one TMR pending.

Also, I'm not aware of any hosts that have outstanding issues with the
long-standing default behavior in mainline of using a per se_device
workqueue with WQ_UNBOUNDED + max_active=1.

That said, allowing N TMRs to be executed for every device, with NO way
to control it's usage dynamically in the field for users is not
acceptable.

So what is the practical use-case..?

Bart Van Assche Oct. 8, 2018, 4:46 p.m. UTC | #3

On Sun, 2018-10-07 at 21:31 -0700, Nicholas A. Bellinger wrote:
> There has never been a host environment that requires execution of N
> TMRs in parallel.

That's wrong. Some time ago Hannes modified the SCSI initiator such that
it can have multiple SCSI abort requests outstanding. That change was made
to speed up SCSI error handling. See also commit e494f6a72839 ("[SCSI]
improved eh timeout handler") and more in particular the
SCSI_EH_ABORT_SCHEDULED flag that was introduced by that patch.

> That said, allowing N TMRs to be executed for every device, with NO way
> to control it's usage dynamically in the field for users is not
> acceptable.

I think that's also wrong. All target drivers I'm familiar with allocate a
fixed-size pool for SCSI commands and task management functions when a
session is created. In other words, the maximum number of TMFs that are
being processed at any time is limited to the size of that pool. See also
the "tag_num" argument of target_setup_session().

Bart.

diff --git a/drivers/target/target_core_device.c b/drivers/target/target_core_device.c
index 47b5ef153135..2340dab5ac2f 100644
--- a/drivers/target/target_core_device.c
+++ b/drivers/target/target_core_device.c
@@ -986,18 +986,6 @@  int target_configure_device(struct se_device *dev)
 	if (ret)
 		goto out_destroy_device;
 
-	/*
-	 * Startup the struct se_device processing thread
-	 */
-	dev->tmr_wq = alloc_workqueue("tmr-%s", WQ_MEM_RECLAIM | WQ_UNBOUND, 1,
-				      dev->transport->name);
-	if (!dev->tmr_wq) {
-		pr_err("Unable to create tmr workqueue for %s\n",
-			dev->transport->name);
-		ret = -ENOMEM;
-		goto out_free_alua;
-	}
-
 	/*
 	 * Setup work_queue for QUEUE_FULL
 	 */
@@ -1026,8 +1014,6 @@  int target_configure_device(struct se_device *dev)
 
 	return 0;
 
-out_free_alua:
-	core_alua_free_lu_gp_mem(dev);
 out_destroy_device:
 	dev->transport->destroy_device(dev);
 out_free_index:
@@ -1046,8 +1032,6 @@  void target_free_device(struct se_device *dev)
 	WARN_ON(!list_empty(&dev->dev_sep_list));
 
 	if (target_dev_configured(dev)) {
-		destroy_workqueue(dev->tmr_wq);
-
 		dev->transport->destroy_device(dev);
 
 		mutex_lock(&device_mutex);
diff --git a/drivers/target/target_core_transport.c b/drivers/target/target_core_transport.c
index 54ccd8f56a57..ec3cb16b9e0e 100644
--- a/drivers/target/target_core_transport.c
+++ b/drivers/target/target_core_transport.c
@@ -3445,7 +3445,7 @@  int transport_generic_handle_tmr(
 	}
 
 	INIT_WORK(&cmd->work, target_tmr_work);
-	queue_work(cmd->se_dev->tmr_wq, &cmd->work);
+	schedule_work(&cmd->work);
 	return 0;
 }
 EXPORT_SYMBOL(transport_generic_handle_tmr);
diff --git a/include/target/target_core_base.h b/include/target/target_core_base.h
index 2cfd3b4573b0..d084479fbe18 100644
--- a/include/target/target_core_base.h
+++ b/include/target/target_core_base.h
@@ -796,7 +796,6 @@  struct se_device {
 	struct t10_pr_registration *dev_pr_res_holder;
 	struct list_head	dev_sep_list;
 	struct list_head	dev_tmr_list;
-	struct workqueue_struct *tmr_wq;
 	struct work_struct	qf_work_queue;
 	struct list_head	delayed_cmd_list;
 	struct list_head	state_list;

[07/17] target/core: Use system workqueues for TMF

Commit Message

Comments

Patch