[v6,19/33] target: Avoid that LUN reset sporadically triggers data corruption

On Tue, 2017-02-21 at 21:42 +0000, Bart Van Assche wrote:
> On 02/20/2017 03:52 PM, Nicholas A. Bellinger wrote:
> > As-is just ignoring CMD_T_COMPLETE in core_tmr_drain_state_list() is not
> > enough to address this bug and not cause other issues, because once a
> > se_cmd descriptor is handed back to the fabric driver after
> > transport_cmd_check_stop_to_fabric() is called,
> > __target_check_io_state() must not attempt to abort the descriptor.
> 
> Please note that my patch is not ignoring CMD_T_COMPLETE - what it does
> is to postpone sending back the LUN RESET response to the initiator
> until the responses for all commands affected by the LUN RESET have been
> sent.

Not exactly.  Here are the relevant parts of the patch again for
reference:

@@ -127,7 +127,7 @@ static bool __target_check_io_state(struct se_cmd *se_cmd,
         * long as se_cmd->cmd_kref is still active unless zero.
         */
        spin_lock(&se_cmd->t_state_lock);
-       if (se_cmd->transport_state & (CMD_T_COMPLETE | CMD_T_FABRIC_STOP)) {
+       if (se_cmd->transport_state & (skip_flags | CMD_T_FABRIC_STOP)) {
                pr_debug("Attempted to abort io tag: %llu already complete or"
                        " fabric stop, skipping\n", se_cmd->tag);
                spin_unlock(&se_cmd->t_state_lock);
@@ -354,7 +355,7 @@ static void core_tmr_drain_state_list(
                        continue;
 
                spin_lock(&sess->sess_cmd_lock);
-               rc = __target_check_io_state(cmd, tmr_sess, tas);
+               rc = __target_check_io_state(cmd, 0, tmr_sess, tas);
                spin_unlock(&sess->sess_cmd_lock);
                if (!rc)
                        continue;

Your patch ignores CMD_T_COMPLETE in __target_check_io_state() for
se_cmd descriptors in se_device->state_list, in order to ensure
target_complete_ok_work() -> TFO->queue_status() gets called and
transport_cmd_check_stop_to_fabric() catches CMD_T_STOP to quiesce the
se_cmd, before the final LUN_RESET response is sent.

My concern was not that this didn't patch address the original bug, but
that it introduced other regressions as a consequence.

To my original concern if ignoring CMD_T_COMPLETE in
__target_check_io_state() after the hand-off to fabric code via
transport_cmd_check_stop_to_fabric() is a problem, this should not be an
issue considering target_remove_from_state_list() is called to remove
se_cmd from se_device->state_list before checking CMD_T_STOP, and
clearing CMD_T_ACTIVE in transport_cmd_check_stop_to_fabric().

However, there is still a race between when core_tmr_drain_state_list()
calls __target_check_io_state() to set CMD_T_ABORTED and invokes
transport_wait_for_tasks() to set CMD_T_STOP, and when
transport_cmd_check_stop_to_fabric() does the final CMD_T_STOP check
after the se_cmd hand-off back to fabric driver code.

Which means a se_cmd w/ CMD_T_ABORTED could be handed back to fabric
driver code and miss the last CMD_T_STOP check within
transport_cmd_check_stop_to_fabric() because transport_wait_for_tasks()
had not been called yet, which would leave transport_wait_for_tasks()
blocked on cmd->t_transport_stop_comp waiting for the last CMD_T_STOP
check to complete after fabric hand-off, that never comes.

The other issue with this patch is that it still sends the actual status
for the original se_cmd, even after the se_cmd has been CMD_T_ABORTED.

> I can move this patch after the patch that makes TMF handling
> synchronous because that patch makes it safe to set the CMD_T_ABORTED
> flag at any time.

To reiterate again.  Any patch intended to address a bug in existing
upstream code needs to be tested as a stand-alone patch, and not depend
upon other non bug-fix related changes.

> 
> > That said, here is how I'd like to address this particular bug.
> > 
> > 1) Allow CMD_T_COMPLETE to occur, but still ignore se_cmds that have
> > already called transport_cmd_check_stop_to_fabric().  Eg: CMD_T_ACTIVE
> > is not set, but CMD_T_SENT is set.
> 
> My understanding of your patch is that it will cause the LUN RESET
> implementation to ignore those commands for which CMD_T_FABRIC_STOP has
> been set and that commands for which CMD_T_ACTIVE has been set but
> CMD_T_SENT not will also be ignored.

No.  As per your original problem statement, the issue is when a se_cmd
has been marked as CMD_T_COMPLETE before TFO->queue_status() has
been called, which gets skipped by __target_check_io_state() and
CMD_T_ABORTED, causing LUN_RESET response to get sent before
target_complete_ok_work() completes.

Like your patch, it waits for outstanding se_cmds to complete from
backend driver core before sending the LUN_RESET response, by also
ignoring CMD_T_COMPLETE state check in __target_check_io_state() for
se_cmds specifically within core_tmr_drain_state_list().

It was different because it also checked to make ensure the hand-off
back to fabric driver code didn't already happen, which in retrospect is
unnecessary given se_cmd->active_list is already getting removed within
transport_cmd_check_stop_to_fabric().

The other thing it did was avoid invoking TFO->queue_status(), et al. in
target_complete_ok_work(), once a se_cmd had been CMD_T_ABORTED.

>  Sorry but I don't think this
> approach is sufficient to fix the data corruption issue I observed.
> 

Sure it does.  Alas, I was hoping that you'd actually prove it by
testing it, but that hasn't happened yet.  :)

Anyways, here is the updated version based on your original that I'd
like you to verify, before pushing to mainline.

It contains:

1) Closes the race between CMD_T_ABORTED + CMD_T_STOP assignment
mentioned above, by adding a CMD_T_ABORTED check in
transport_cmd_check_stop_to_fabric() to match what's already in
target_complete_cmd().

Note given your previous patch in target-pending/for-next to re-factor
transport_cmd_check_stop_to_fabric(), this will require manual patching
for stable.

2) Avoids sending the TFO->queue_status(), et al. once a se_cmd has been
CMD_T_ABORTED within target_complete_ok_work().

Why don't you give it a spin with ib_srpt as a stand-alone patch..?


[v6,19/33] target: Avoid that LUN reset sporadically triggers data corruption

Commit Message

Patch