[v4] iscsi: Perform connection failure entirely in kernel space

From: Bharath Ravi <rbharath@google.com>

Khazhismel Kumykov <khazhy@google.com> writes:

>> >> +       if (!list_empty(&conn->conn_list_err))
>> > Does this check need to be under connlock?
>>
>> My understanding is that it is not necessary, since it is serialized
>> against the conn removal itself, through the rx_mutex, it seemed safe to
>> do the verification lockless.
>>
>> It can only race with the insertion, in which case, it will be safely
>> removed from the dispatch list here, under rx_mutex, and the worker will
>> detect and skipped it.
>
> My worry is the splice, which is under only connlock, not rx_mutex, which
> might lead to UB if we're checking empty while modifying the list_head ?

Hi,

Please consider the v4 below with the lock added.

-- >8 --
From: Bharath Ravi <rbharath@google.com>

Connection failure processing depends on a daemon being present to (at
least) stop the connection and start recovery.  This is a problem on a
multipath scenario, where if the daemon failed for whatever reason, the
SCSI path is never marked as down, multipath won't perform the
failover and IO to the device will be forever waiting for that
connection to come back.

This patch performs the connection failure entirely inside the kernel.
This way, the failover can happen and pending IO can continue even if
the daemon is dead. Once the daemon comes alive again, it can execute
recovery procedures if applicable.

Changes since v3:
  - Protect list_empty with connlock on session destroy

Changes since v2:
  - Don't hold rx_mutex for too long at once

Changes since v1:
  - Remove module parameter.
  - Always do kernel-side stop work.
  - Block recovery timeout handler if system is dying.
  - send a CONN_TERM stop if the system is dying.

Cc: Mike Christie <mchristi@redhat.com>
Cc: Lee Duncan <LDuncan@suse.com>
Cc: Bart Van Assche <bvanassche@acm.org>
Reviewed-by: Lee Duncan <lduncan@suse.com>
Co-developed-by: Dave Clausen <dclausen@google.com>
Signed-off-by: Dave Clausen <dclausen@google.com>
Co-developed-by: Nick Black <nlb@google.com>
Signed-off-by: Nick Black <nlb@google.com>
Co-developed-by: Vaibhav Nagarnaik <vnagarnaik@google.com>
Signed-off-by: Vaibhav Nagarnaik <vnagarnaik@google.com>
Co-developed-by: Anatol Pomazau <anatol@google.com>
Signed-off-by: Anatol Pomazau <anatol@google.com>
Co-developed-by: Tahsin Erdogan <tahsin@google.com>
Signed-off-by: Tahsin Erdogan <tahsin@google.com>
Co-developed-by: Frank Mayhar <fmayhar@google.com>
Signed-off-by: Frank Mayhar <fmayhar@google.com>
Co-developed-by: Junho Ryu <jayr@google.com>
Signed-off-by: Junho Ryu <jayr@google.com>
Co-developed-by: Khazhismel Kumykov <khazhy@google.com>
Signed-off-by: Khazhismel Kumykov <khazhy@google.com>
Signed-off-by: Bharath Ravi <rbharath@google.com>
Co-developed-by: Gabriel Krisman Bertazi <krisman@collabora.com>
Signed-off-by: Gabriel Krisman Bertazi <krisman@collabora.com>
---
 drivers/scsi/scsi_transport_iscsi.c | 68 +++++++++++++++++++++++++++++
 include/scsi/scsi_transport_iscsi.h |  1 +
 2 files changed, 69 insertions(+)

Message ID	85r20g2vfw.fsf_-_@collabora.com (mailing list archive)
State	Superseded
Headers	show Return-Path: <SRS0=yTIb=2Y=vger.kernel.org=linux-scsi-owner@kernel.org> Received: from mail.kernel.org (pdx-korg-mail-1.web.codeaurora.org [172.30.200.123]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id DDECA1395 for <patchwork-linux-scsi@patchwork.kernel.org>; Fri, 3 Jan 2020 19:26:18 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id BE1F7222C4 for <patchwork-linux-scsi@patchwork.kernel.org>; Fri, 3 Jan 2020 19:26:18 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728537AbgACT0S (ORCPT <rfc822;patchwork-linux-scsi@patchwork.kernel.org>); Fri, 3 Jan 2020 14:26:18 -0500 Received: from bhuna.collabora.co.uk ([46.235.227.227]:43182 "EHLO bhuna.collabora.co.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728513AbgACT0S (ORCPT <rfc822;linux-scsi@vger.kernel.org>); Fri, 3 Jan 2020 14:26:18 -0500 Received: from [127.0.0.1] (localhost [127.0.0.1]) (Authenticated sender: krisman) with ESMTPSA id CBFC828F6C5 From: Gabriel Krisman Bertazi <krisman@collabora.com> To: Khazhismel Kumykov <khazhy@google.com> Cc: lduncan@suse.com, Chris Leech <cleech@redhat.com>, jejb@linux.ibm.com, "Martin K. Petersen" <martin.petersen@oracle.com>, "'Khazhismel Kumykov' via open-iscsi" <open-iscsi@googlegroups.com>, linux-scsi@vger.kernel.org, Bharath Ravi <rbharath@google.com>, kernel@collabora.com, Mike Christie <mchristi@redhat.com>, Bart Van Assche <bvanassche@acm.org>, Dave Clausen <dclausen@google.com>, Nick Black <nlb@google.com>, Vaibhav Nagarnaik <vnagarnaik@google.com>, Anatol Pomazau <anatol@google.com>, Tahsin Erdogan <tahsin@google.com>, Frank Mayhar <fmayhar@google.com>, Junho Ryu <jayr@google.com> Subject: [PATCH v4] iscsi: Perform connection failure entirely in kernel space Organization: Collabora References: <20191226204746.2197233-1-krisman@collabora.com> <CACGdZYJ3hasgRV4MKpizX3rSQ1Tq4R+wDREcYXFUgx720ac5sg@mail.gmail.com> <85ftgx7mlr.fsf@collabora.com> <CACGdZYJKF85SgOt0-yHiROsqhP0K+x+XAg7CRJv_0oKt60VtvA@mail.gmail.com> Date: Fri, 03 Jan 2020 14:26:11 -0500 In-Reply-To: <CACGdZYJKF85SgOt0-yHiROsqhP0K+x+XAg7CRJv_0oKt60VtvA@mail.gmail.com> (Khazhismel Kumykov's message of "Thu, 2 Jan 2020 13:24:39 -0500") Message-ID: <85r20g2vfw.fsf_-_@collabora.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain Sender: linux-scsi-owner@vger.kernel.org Precedence: bulk List-ID: <linux-scsi.vger.kernel.org> X-Mailing-List: linux-scsi@vger.kernel.org
Series	[v4] iscsi: Perform connection failure entirely in kernel space \| expand [v4] iscsi: Perform connection failure entirely in kernel space

[v4] iscsi: Perform connection failure entirely in kernel space

Commit Message

Comments

Patch