diff mbox series

[1/2] scsi: Avoid that .queuecommand() gets called for a quiesced SCSI device

Message ID 20190326204331.54352-2-bvanassche@acm.org (mailing list archive)
State Superseded
Headers show
Series Avoid that .queuecommand() gets called for a quiesced SCSI device | expand

Commit Message

Bart Van Assche March 26, 2019, 8:43 p.m. UTC
Several SCSI transport and LLD drivers surround code that does not
tolerate concurrent calls of .queuecommand() with scsi_target_block() /
scsi_target_unblock(). These last two functions use
blk_mq_quiesce_queue() / blk_mq_unquiesce_queue() for scsi-mq request
queues to prevent concurrent .queuecommand() calls. However, that is
not sufficient to prevent .queuecommand() calls from scsi_send_eh_cmnd().
Hence surround the .queuecommand() call from the SCSI error handler with
code that avoids that .queuecommand() gets called in the quiesced state.

Note: converting the .queuecommand() call in scsi_send_eh_cmnd() into
code that calls blk_get_request() + blk_execute_rq() is not an option
since scsi_send_eh_cmnd() must be able to make forward progress even
if all requests have been allocated.

Cc: Christoph Hellwig <hch@lst.de>
Cc: Ming Lei <ming.lei@redhat.com>
Cc: Hannes Reinecke <hare@suse.de>
Cc: Johannes Thumshirn <jthumshirn@suse.de>
Signed-off-by: Bart Van Assche <bvanassche@acm.org>
---
 drivers/scsi/scsi_error.c | 26 ++++++++++++++++++++++++--
 1 file changed, 24 insertions(+), 2 deletions(-)

Comments

Christoph Hellwig March 27, 2019, 8:13 a.m. UTC | #1
Looks good,

Reviewed-by: Christoph Hellwig <hch@lst.de>
Ming Lei March 27, 2019, 9:26 a.m. UTC | #2
On Wed, Mar 27, 2019 at 4:44 AM Bart Van Assche <bvanassche@acm.org> wrote:
>
> Several SCSI transport and LLD drivers surround code that does not
> tolerate concurrent calls of .queuecommand() with scsi_target_block() /
> scsi_target_unblock(). These last two functions use
> blk_mq_quiesce_queue() / blk_mq_unquiesce_queue() for scsi-mq request
> queues to prevent concurrent .queuecommand() calls. However, that is
> not sufficient to prevent .queuecommand() calls from scsi_send_eh_cmnd().
> Hence surround the .queuecommand() call from the SCSI error handler with
> code that avoids that .queuecommand() gets called in the quiesced state.
>
> Note: converting the .queuecommand() call in scsi_send_eh_cmnd() into
> code that calls blk_get_request() + blk_execute_rq() is not an option
> since scsi_send_eh_cmnd() must be able to make forward progress even
> if all requests have been allocated.
>
> Cc: Christoph Hellwig <hch@lst.de>
> Cc: Ming Lei <ming.lei@redhat.com>
> Cc: Hannes Reinecke <hare@suse.de>
> Cc: Johannes Thumshirn <jthumshirn@suse.de>
> Signed-off-by: Bart Van Assche <bvanassche@acm.org>
> ---
>  drivers/scsi/scsi_error.c | 26 ++++++++++++++++++++++++--
>  1 file changed, 24 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/scsi/scsi_error.c b/drivers/scsi/scsi_error.c
> index 8e9680572b9f..5c9b30251abd 100644
> --- a/drivers/scsi/scsi_error.c
> +++ b/drivers/scsi/scsi_error.c
> @@ -1054,7 +1054,7 @@ static int scsi_send_eh_cmnd(struct scsi_cmnd *scmd, unsigned char *cmnd,
>         struct scsi_device *sdev = scmd->device;
>         struct Scsi_Host *shost = sdev->host;
>         DECLARE_COMPLETION_ONSTACK(done);
> -       unsigned long timeleft = timeout;
> +       unsigned long timeleft = timeout, delay;
>         struct scsi_eh_save ses;
>         const unsigned long stall_for = msecs_to_jiffies(100);
>         int rtn;
> @@ -1065,7 +1065,29 @@ static int scsi_send_eh_cmnd(struct scsi_cmnd *scmd, unsigned char *cmnd,
>
>         scsi_log_send(scmd);
>         scmd->scsi_done = scsi_eh_done;
> -       rtn = shost->hostt->queuecommand(shost, scmd);
> +
> +       /*
> +        * Lock sdev->state_mutex to avoid that scsi_device_quiesce() can
> +        * change the SCSI device state after we have examined it and before
> +        * .queuecommand() is called.
> +        */
> +       mutex_lock(&sdev->state_mutex);
> +       while (sdev->sdev_state == SDEV_QUIESCE && timeleft > 0) {
> +               mutex_unlock(&sdev->state_mutex);
> +               SCSI_LOG_ERROR_RECOVERY(5, sdev_printk(KERN_DEBUG, sdev,
> +                       "%s: state %d <> %d\n", __func__, sdev->sdev_state,
> +                       SDEV_QUIESCE));
> +               delay = min(timeleft, stall_for);
> +               timeleft -= delay;
> +               msleep(jiffies_to_msecs(delay));
> +               mutex_lock(&sdev->state_mutex);
> +       }
> +       if (sdev->sdev_state != SDEV_QUIESCE)
> +               rtn = shost->hostt->queuecommand(shost, scmd);
> +       else
> +               rtn = SCSI_MLQUEUE_DEVICE_BUSY;
> +       mutex_unlock(&sdev->state_mutex);
> +
>         if (rtn) {
>                 if (timeleft > stall_for) {
>                         scsi_eh_restore_cmnd(scmd, &ses);

Some of SCSI commands are allowed when the state is SDEV_QUIESCE,
what if these commands need EH to make progress?

Thanks,
Ming Lei
Martin K. Petersen March 29, 2019, 1:59 p.m. UTC | #3
> Some of SCSI commands are allowed when the state is SDEV_QUIESCE,
> what if these commands need EH to make progress?

Bart: Please clarify!
Bart Van Assche March 29, 2019, 3:16 p.m. UTC | #4
On Wed, 2019-03-27 at 17:26 +0800, Ming Lei wrote:
> Some of SCSI commands are allowed when the state is SDEV_QUIESCE,
> what if these commands need EH to make progress?

Since scsi_target_block() changes the device state into SDEV_BLOCK I think
this patch should check for the SDEV_BLOCK state instead of SDEV_QUIESCE. I
will repost this patch series.

Bart.
diff mbox series

Patch

diff --git a/drivers/scsi/scsi_error.c b/drivers/scsi/scsi_error.c
index 8e9680572b9f..5c9b30251abd 100644
--- a/drivers/scsi/scsi_error.c
+++ b/drivers/scsi/scsi_error.c
@@ -1054,7 +1054,7 @@  static int scsi_send_eh_cmnd(struct scsi_cmnd *scmd, unsigned char *cmnd,
 	struct scsi_device *sdev = scmd->device;
 	struct Scsi_Host *shost = sdev->host;
 	DECLARE_COMPLETION_ONSTACK(done);
-	unsigned long timeleft = timeout;
+	unsigned long timeleft = timeout, delay;
 	struct scsi_eh_save ses;
 	const unsigned long stall_for = msecs_to_jiffies(100);
 	int rtn;
@@ -1065,7 +1065,29 @@  static int scsi_send_eh_cmnd(struct scsi_cmnd *scmd, unsigned char *cmnd,
 
 	scsi_log_send(scmd);
 	scmd->scsi_done = scsi_eh_done;
-	rtn = shost->hostt->queuecommand(shost, scmd);
+
+	/*
+	 * Lock sdev->state_mutex to avoid that scsi_device_quiesce() can
+	 * change the SCSI device state after we have examined it and before
+	 * .queuecommand() is called.
+	 */
+	mutex_lock(&sdev->state_mutex);
+	while (sdev->sdev_state == SDEV_QUIESCE && timeleft > 0) {
+		mutex_unlock(&sdev->state_mutex);
+		SCSI_LOG_ERROR_RECOVERY(5, sdev_printk(KERN_DEBUG, sdev,
+			"%s: state %d <> %d\n", __func__, sdev->sdev_state,
+			SDEV_QUIESCE));
+		delay = min(timeleft, stall_for);
+		timeleft -= delay;
+		msleep(jiffies_to_msecs(delay));
+		mutex_lock(&sdev->state_mutex);
+	}
+	if (sdev->sdev_state != SDEV_QUIESCE)
+		rtn = shost->hostt->queuecommand(shost, scmd);
+	else
+		rtn = SCSI_MLQUEUE_DEVICE_BUSY;
+	mutex_unlock(&sdev->state_mutex);
+
 	if (rtn) {
 		if (timeleft > stall_for) {
 			scsi_eh_restore_cmnd(scmd, &ses);