btrfs list corruption and soft lockups while testing writeback error handling

On Fri, 2017-05-12 at 08:12 -0400, Jeff Layton wrote:
> On Thu, 2017-05-11 at 15:56 -0400, Chris Mason wrote:
> > On 05/11/2017 03:52 PM, Jeff Layton wrote:
> > > On Thu, 2017-05-11 at 07:13 -0400, Jeff Layton wrote:
> > > > I finally got my writeback error handling test to work on btrfs (thanks,
> > > > Chris!), by making the filesystem stripe the data and mirror the
> > > > metadata across two devices. The test passes now, but on one run, I got
> > > > the following list corruption warning and then a soft lockup (which is
> > > > probably fallout from the list corruption).
> > > > 
> > > > I ran the test several times before and since then without this failure,
> > > > so I don't have a clear reproducer. The kernel in this instance is
> > > > basically a v4.11 kernel with my pile of writeback error handling
> > > > patches on top:
> > > > 
> > > >     https://urldefense.proofpoint.com/v2/url?u=https-3A__git.samba.org_-3Fp-3Djlayton_linux.git-3Ba-3Dshortlog-3Bh-3Drefs_heads_wberr&d=DwICaQ&c=5VD0RTtNlTh3ycd41b3MUw&r=9QPtTAxcitoznaWRKKHoEQ&m=BXXwaUFQNFNaGGFYHEVlvNBwkrXiIoH7K5iOdR_PvxM&s=xE6pIXeQ1rlaxAV8aTYBSiI06pb3WZoiRJW8Vo1L3NQ&e=
> > > > 
> > > > It may be that they are a contributing factor, but this smells more like
> > > > a bug down in btrfs. Let me know if you need other info:
> > 
> > [ btrfs inode logging ]
> > 
> > > (cc'ing Liu Bo since we were discussing this earlier this week)
> > > 
> > > I can't reproduce this on stock v4.11, so I think this is a bug in my
> > > series.
> > > 
> > > I think this is due to the differences in how errors are being reported
> > > from filemap_fdatawait_range now causing some transactions to end up
> > > being freed while they're still on the log_ctxs list. I'm working on
> > > hunting down the problem now.
> > > 
> > > Sorry for the noise!
> > > 
> > 
> > There's a list in the inode logging code that we consistently seem to 
> > find list debugging assertions with.  We've fixed up all the known 
> > issues, but I wouldn't be surprised if we've got a goto fail in there.
> > 
> > I'll take a look ;)
> > 
> 
> Thanks. I'm running test 999 here in a loop to reproduce it on a kernel
> with my patch series applied:
> 
> https://git.samba.org/?p=jlayton/xfstests.git;a=shortlog;h=refs/heads/wberr
> 
> The patch below seems to prevent it from crashing, but I'm not at all
> sure that this is a correct fix. Still, I think that the way errors are
> tracked within btrfs might need some rework around errseq_t's. In
> principle, it could make things even simpler now that we don't need to
> worry about resetting errors that have been cleared, etc...
> 

This patch instead rolls up all of the btrfs changes in the earlier
patches so it may make a bit more sense. I also tried to clean up the
changelog. I think this is probably along the lines of what we want, but
I'd want someone with more btrfs chops to scrutinize it closely:

-----------------------8<----------------------

[PATCH] SQUASH: btrfs: convert over to errseq_t based writeback error tracking

Writeback in btrfs is somewhat complicated and it tries to use
filemap_* functions to drive it, while tracking errors with flags,
alternate error fields, etc.

With the change to errseq_t based error reporting in the kernel, we
can simplify this somewhat and ensure that errors are caught properly
even when there are parallel operations on the same inode.

The btrfs_log_ctx has an io_err field in it that gets an error stored in
it when there is an I/O error. Instead, sample the mapping's errseq_t
when the context is initialized and use that to tell whether there has
been a writeback error since that point.

Note that btrfs_sync_log passes in NULL for the inode when initializing
the context, but that codepath doesn't seem to use the io_err field
anyway.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
---
 fs/btrfs/file.c     | 17 ++++++-----------
 fs/btrfs/tree-log.c | 24 ------------------------
 fs/btrfs/tree-log.h |  7 +++++--
 3 files changed, 11 insertions(+), 37 deletions(-)

btrfs list corruption and soft lockups while testing writeback error handling

Commit Message

Comments

Patch