diff mbox

[v2,3/3] generic/470: add syncfs test

Message ID 1512613327-218016-3-git-send-email-cgxu519@icloud.com (mailing list archive)
State New, archived
Headers show

Commit Message

Chengguang Xu Dec. 7, 2017, 2:22 a.m. UTC
Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
underlying filesystem.
Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
to check syncfs result.

Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
---

Changes since v1:
Use fs shutdown and fssum to check syncfs result instead of
checking delalloc state of extents.

 tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
 tests/generic/470.out |  2 ++
 tests/generic/group   |  1 +
 3 files changed, 91 insertions(+)
 create mode 100755 tests/generic/470
 create mode 100644 tests/generic/470.out

Comments

Amir Goldstein Dec. 7, 2017, 3:04 a.m. UTC | #1
On Thu, Dec 7, 2017 at 4:22 AM, Chengguang Xu <cgxu519@icloud.com> wrote:
> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
> underlying filesystem.
> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
> to check syncfs result.
>
> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
> ---
>
> Changes since v1:
> Use fs shutdown and fssum to check syncfs result instead of
> checking delalloc state of extents.
>
>  tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>  tests/generic/470.out |  2 ++
>  tests/generic/group   |  1 +
>  3 files changed, 91 insertions(+)
>  create mode 100755 tests/generic/470
>  create mode 100644 tests/generic/470.out
>
> diff --git a/tests/generic/470 b/tests/generic/470
> new file mode 100755
> index 0000000..b488747
> --- /dev/null
> +++ b/tests/generic/470
> @@ -0,0 +1,88 @@
> +#! /bin/bash
> +# FS QA Test 470
> +#
> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
> +# underlying filesystem.
> +#
> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
> +# to check syncfs result.
> +#
> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
> +# does not support shutdown.
> +#
> +#-----------------------------------------------------------------------
> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
> +# All Rights Reserved.
> +#
> +# This program is free software; you can redistribute it and/or
> +# modify it under the terms of the GNU General Public License as
> +# published by the Free Software Foundation.
> +#
> +# This program is distributed in the hope that it would be useful,
> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
> +# GNU General Public License for more details.
> +#
> +# You should have received a copy of the GNU General Public License
> +# along with this program; if not, write the Free Software Foundation,
> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
> +#-----------------------------------------------------------------------
> +#
> +
> +seq=`basename $0`
> +seqres=$RESULT_DIR/$seq
> +echo "QA output created by $seq"
> +
> +here=`pwd`
> +tmp=/tmp/$$
> +status=1
> +trap "_cleanup; exit \$status" 0 1 2 3 15
> +
> +_cleanup()
> +{
> +       cd /
> +       rm -f $tmp.*
> +}
> +
> +# get standard environment, filters and checks
> +. ./common/rc
> +. ./common/filter
> +
> +# remove previous $seqres.full before test
> +rm -f $seqres.full
> +
> +# real QA test starts here
> +
> +_supported_fs generic
> +_supported_os Linux
> +_require_test
> +_require_fssum
> +_require_scratch
> +_require_scratch_shutdown
> +_require_xfs_io_command "syncfs"
> +
> +
> +FCNT=1000
> +
> +_scratch_mkfs >/dev/null 2>&1
> +_scratch_mount
> +
> +# In order to mitigate interference of write-back,
> +# create many files for test.
> +for i in `seq 1 $FCNT`; do
> +       $XFS_IO_PROG -f -c "pwrite 1K 1M" \
> +               $SCRATCH_MNT/$i >/dev/null 2>&1
> +done
> +
> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/before_test $SCRATCH_MNT
> +$XFS_IO_PROG -c "syncfs" $SCRATCH_MNT/${FCNT} >/dev/null 2>&1
> +_scratch_shutdown
> +_scratch_cycle_mount
> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/after_test $SCRATCH_MNT
> +diff $TEST_DIR/before_test $TEST_DIR/after_test

Just FYI, md5sum can take multiple files so this can be written as:

md5sum $SCRATCH_MNT/$i > $TEST_DIR/before_test
...
md5sum -c $TEST_DIR/before_test > /dev/null 2>&1

Which reads a lot simpler to me.  At least I don't know what 'ugomAcdES'
stands for and for this simple use case I probably don't need to know.

Cheers,
Amir.
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 7, 2017, 3:31 a.m. UTC | #2
> 
> 在 2017年12月7日,上午11:04,Amir Goldstein <amir73il@gmail.com> 写道:
> 
> On Thu, Dec 7, 2017 at 4:22 AM, Chengguang Xu <cgxu519@icloud.com> wrote:
>> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
>> underlying filesystem.
>> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>> to check syncfs result.
>> 
>> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
>> ---
>> 
>> Changes since v1:
>> Use fs shutdown and fssum to check syncfs result instead of
>> checking delalloc state of extents.
>> 
>> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>> tests/generic/470.out |  2 ++
>> tests/generic/group   |  1 +
>> 3 files changed, 91 insertions(+)
>> create mode 100755 tests/generic/470
>> create mode 100644 tests/generic/470.out
>> 
>> diff --git a/tests/generic/470 b/tests/generic/470
>> new file mode 100755
>> index 0000000..b488747
>> --- /dev/null
>> +++ b/tests/generic/470
>> @@ -0,0 +1,88 @@
>> +#! /bin/bash
>> +# FS QA Test 470
>> +#
>> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
>> +# underlying filesystem.
>> +#
>> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>> +# to check syncfs result.
>> +#
>> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
>> +# does not support shutdown.
>> +#
>> +#-----------------------------------------------------------------------
>> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
>> +# All Rights Reserved.
>> +#
>> +# This program is free software; you can redistribute it and/or
>> +# modify it under the terms of the GNU General Public License as
>> +# published by the Free Software Foundation.
>> +#
>> +# This program is distributed in the hope that it would be useful,
>> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
>> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>> +# GNU General Public License for more details.
>> +#
>> +# You should have received a copy of the GNU General Public License
>> +# along with this program; if not, write the Free Software Foundation,
>> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
>> +#-----------------------------------------------------------------------
>> +#
>> +
>> +seq=`basename $0`
>> +seqres=$RESULT_DIR/$seq
>> +echo "QA output created by $seq"
>> +
>> +here=`pwd`
>> +tmp=/tmp/$$
>> +status=1
>> +trap "_cleanup; exit \$status" 0 1 2 3 15
>> +
>> +_cleanup()
>> +{
>> +       cd /
>> +       rm -f $tmp.*
>> +}
>> +
>> +# get standard environment, filters and checks
>> +. ./common/rc
>> +. ./common/filter
>> +
>> +# remove previous $seqres.full before test
>> +rm -f $seqres.full
>> +
>> +# real QA test starts here
>> +
>> +_supported_fs generic
>> +_supported_os Linux
>> +_require_test
>> +_require_fssum
>> +_require_scratch
>> +_require_scratch_shutdown
>> +_require_xfs_io_command "syncfs"
>> +
>> +
>> +FCNT=1000
>> +
>> +_scratch_mkfs >/dev/null 2>&1
>> +_scratch_mount
>> +
>> +# In order to mitigate interference of write-back,
>> +# create many files for test.
>> +for i in `seq 1 $FCNT`; do
>> +       $XFS_IO_PROG -f -c "pwrite 1K 1M" \
>> +               $SCRATCH_MNT/$i >/dev/null 2>&1
>> +done
>> +
>> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/before_test $SCRATCH_MNT
>> +$XFS_IO_PROG -c "syncfs" $SCRATCH_MNT/${FCNT} >/dev/null 2>&1
>> +_scratch_shutdown
>> +_scratch_cycle_mount
>> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/after_test $SCRATCH_MNT
>> +diff $TEST_DIR/before_test $TEST_DIR/after_test
> 
> Just FYI, md5sum can take multiple files so this can be written as:
> 
> md5sum $SCRATCH_MNT/$i > $TEST_DIR/before_test
> ...
> md5sum -c $TEST_DIR/before_test > /dev/null 2>&1
> 
> Which reads a lot simpler to me.  At least I don't know what 'ugomAcdES'
> stands for and for this simple use case I probably don't need to know.
> 

Hmm, fssum seems better than md5sum for metadata check, as it’s usage shows below.
If short comment of fssum option is acceptable, I prefer to keep it. How do you think? 


$ ./fssum  -h
usage: fssum <options> <path>
  options:
    -f          : write out a full manifest file
    -w <file>   : send output to file
    -v          : verbose mode (debugging only)
    -r <file>   : read checksum or manifest from file
    -[ugoamcde] : specify which fields to include in checksum calculation.
         u      : include uid
         g      : include gid
         o      : include mode
         m      : include mtime
         a      : include atime
         c      : include ctime
         d      : include file data
         e      : include open errors (aborts otherwise)
         s      : include block structure (holes)
    -[UGOAMCDES]: exclude respective field from calculation
    -n          : reset all flags
    -N          : set all flags
    -x path     : exclude path when building checksum (multiple ok)
    -h          : this help

The default field mask is ugoamCdES. If the checksum/manifest is read from a
file, the mask is taken from there and the values given on the command line
are ignored.


Thanks,
-cgxu--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Amir Goldstein Dec. 7, 2017, 3:43 a.m. UTC | #3
On Thu, Dec 7, 2017 at 5:31 AM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>
>> 在 2017年12月7日,上午11:04,Amir Goldstein <amir73il@gmail.com> 写道:
>>
>> On Thu, Dec 7, 2017 at 4:22 AM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
>>> underlying filesystem.
>>> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>> to check syncfs result.
>>>
>>> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
>>> ---
>>>
>>> Changes since v1:
>>> Use fs shutdown and fssum to check syncfs result instead of
>>> checking delalloc state of extents.
>>>
>>> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>>> tests/generic/470.out |  2 ++
>>> tests/generic/group   |  1 +
>>> 3 files changed, 91 insertions(+)
>>> create mode 100755 tests/generic/470
>>> create mode 100644 tests/generic/470.out
>>>
>>> diff --git a/tests/generic/470 b/tests/generic/470
>>> new file mode 100755
>>> index 0000000..b488747
>>> --- /dev/null
>>> +++ b/tests/generic/470
>>> @@ -0,0 +1,88 @@
>>> +#! /bin/bash
>>> +# FS QA Test 470
>>> +#
>>> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
>>> +# underlying filesystem.
>>> +#
>>> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>> +# to check syncfs result.
>>> +#
>>> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
>>> +# does not support shutdown.
>>> +#
>>> +#-----------------------------------------------------------------------
>>> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
>>> +# All Rights Reserved.
>>> +#
>>> +# This program is free software; you can redistribute it and/or
>>> +# modify it under the terms of the GNU General Public License as
>>> +# published by the Free Software Foundation.
>>> +#
>>> +# This program is distributed in the hope that it would be useful,
>>> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
>>> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>>> +# GNU General Public License for more details.
>>> +#
>>> +# You should have received a copy of the GNU General Public License
>>> +# along with this program; if not, write the Free Software Foundation,
>>> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
>>> +#-----------------------------------------------------------------------
>>> +#
>>> +
>>> +seq=`basename $0`
>>> +seqres=$RESULT_DIR/$seq
>>> +echo "QA output created by $seq"
>>> +
>>> +here=`pwd`
>>> +tmp=/tmp/$$
>>> +status=1
>>> +trap "_cleanup; exit \$status" 0 1 2 3 15
>>> +
>>> +_cleanup()
>>> +{
>>> +       cd /
>>> +       rm -f $tmp.*
>>> +}
>>> +
>>> +# get standard environment, filters and checks
>>> +. ./common/rc
>>> +. ./common/filter
>>> +
>>> +# remove previous $seqres.full before test
>>> +rm -f $seqres.full
>>> +
>>> +# real QA test starts here
>>> +
>>> +_supported_fs generic
>>> +_supported_os Linux
>>> +_require_test
>>> +_require_fssum
>>> +_require_scratch
>>> +_require_scratch_shutdown
>>> +_require_xfs_io_command "syncfs"
>>> +
>>> +
>>> +FCNT=1000
>>> +
>>> +_scratch_mkfs >/dev/null 2>&1
>>> +_scratch_mount
>>> +
>>> +# In order to mitigate interference of write-back,
>>> +# create many files for test.
>>> +for i in `seq 1 $FCNT`; do
>>> +       $XFS_IO_PROG -f -c "pwrite 1K 1M" \
>>> +               $SCRATCH_MNT/$i >/dev/null 2>&1
>>> +done
>>> +
>>> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/before_test $SCRATCH_MNT
>>> +$XFS_IO_PROG -c "syncfs" $SCRATCH_MNT/${FCNT} >/dev/null 2>&1
>>> +_scratch_shutdown
>>> +_scratch_cycle_mount
>>> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/after_test $SCRATCH_MNT
>>> +diff $TEST_DIR/before_test $TEST_DIR/after_test
>>
>> Just FYI, md5sum can take multiple files so this can be written as:
>>
>> md5sum $SCRATCH_MNT/$i > $TEST_DIR/before_test
>> ...
>> md5sum -c $TEST_DIR/before_test > /dev/null 2>&1
>>
>> Which reads a lot simpler to me.  At least I don't know what 'ugomAcdES'
>> stands for and for this simple use case I probably don't need to know.
>>
>
> Hmm, fssum seems better than md5sum for metadata check, as it’s usage shows below.
> If short comment of fssum option is acceptable, I prefer to keep it. How do you think?
>

Short comment would be great.
Thanks,
Amir.
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Eryu Guan Dec. 7, 2017, 5:44 a.m. UTC | #4
On Thu, Dec 07, 2017 at 10:22:07AM +0800, Chengguang Xu wrote:
> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
> underlying filesystem.
> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
> to check syncfs result.
> 
> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
> ---
> 
> Changes since v1:
> Use fs shutdown and fssum to check syncfs result instead of
> checking delalloc state of extents.
> 
>  tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>  tests/generic/470.out |  2 ++
>  tests/generic/group   |  1 +
>  3 files changed, 91 insertions(+)
>  create mode 100755 tests/generic/470
>  create mode 100644 tests/generic/470.out
> 
> diff --git a/tests/generic/470 b/tests/generic/470
> new file mode 100755
> index 0000000..b488747
> --- /dev/null
> +++ b/tests/generic/470
> @@ -0,0 +1,88 @@
> +#! /bin/bash
> +# FS QA Test 470
> +#
> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
> +# underlying filesystem. 

Trailing whitespace in above line.

> +#
> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
> +# to check syncfs result.
> +#
> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
> +# does not support shutdown.
> +#
> +#-----------------------------------------------------------------------
> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
> +# All Rights Reserved.
> +#
> +# This program is free software; you can redistribute it and/or
> +# modify it under the terms of the GNU General Public License as
> +# published by the Free Software Foundation.
> +#
> +# This program is distributed in the hope that it would be useful,
> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
> +# GNU General Public License for more details.
> +#
> +# You should have received a copy of the GNU General Public License
> +# along with this program; if not, write the Free Software Foundation,
> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
> +#-----------------------------------------------------------------------
> +#
> +
> +seq=`basename $0`
> +seqres=$RESULT_DIR/$seq
> +echo "QA output created by $seq"
> +
> +here=`pwd`
> +tmp=/tmp/$$
> +status=1
> +trap "_cleanup; exit \$status" 0 1 2 3 15
> +
> +_cleanup()
> +{
> +	cd /
> +	rm -f $tmp.*
> +}
> +
> +# get standard environment, filters and checks
> +. ./common/rc
> +. ./common/filter
> +
> +# remove previous $seqres.full before test
> +rm -f $seqres.full
> +
> +# real QA test starts here
> +
> +_supported_fs generic
> +_supported_os Linux
> +_require_test
> +_require_fssum
> +_require_scratch
> +_require_scratch_shutdown
> +_require_xfs_io_command "syncfs"
> +
> +
> +FCNT=1000
> +
> +_scratch_mkfs >/dev/null 2>&1
> +_scratch_mount
> +
> +# In order to mitigate interference of write-back,
> +# create many files for test.

Sorry, I still don't understand how writeback could interfere this test
from this comment, what happens if we don't create such files? Why
writing files starting from offset 1k?

> +for i in `seq 1 $FCNT`; do
> +	$XFS_IO_PROG -f -c "pwrite 1K 1M" \
> +		$SCRATCH_MNT/$i >/dev/null 2>&1
> +done
> +
> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/before_test $SCRATCH_MNT

$tmp.fssum looks a better choice of the fssum record file, then you
don't need _require_test anymore.

> +$XFS_IO_PROG -c "syncfs" $SCRATCH_MNT/${FCNT} >/dev/null 2>&1
> +_scratch_shutdown
> +_scratch_cycle_mount
> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/after_test $SCRATCH_MNT
> +diff $TEST_DIR/before_test $TEST_DIR/after_test
> +if [ $? -eq 0 ]; then
> +	status=0
> +fi

No need to check return value of diff here, just set status=0
unconditionally, test will break the golden image matching anyway if
diff finds something different.

But why not use "-r" option of fssum to read and compare the checksums?
e.g.

$FSSUM_PROG -r $tmp.fssum $SCRATCH_MNT

Thanks,
Eryu

> +
> +echo "Silence is golden"
> +exit
> diff --git a/tests/generic/470.out b/tests/generic/470.out
> new file mode 100644
> index 0000000..79fb532
> --- /dev/null
> +++ b/tests/generic/470.out
> @@ -0,0 +1,2 @@
> +QA output created by 470
> +Silence is golden
> diff --git a/tests/generic/group b/tests/generic/group
> index 6c3bb03..493a44c 100644
> --- a/tests/generic/group
> +++ b/tests/generic/group
> @@ -472,3 +472,4 @@
>  467 auto quick exportfs
>  468 shutdown auto quick metadata
>  469 auto quick
> +470 auto quick shutdown sync
> -- 
> 1.8.3.1
> 
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 7, 2017, 6:20 a.m. UTC | #5
> 
> 在 2017年12月7日,下午1:44,Eryu Guan <eguan@redhat.com> 写道:
> 
> On Thu, Dec 07, 2017 at 10:22:07AM +0800, Chengguang Xu wrote:
>> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
>> underlying filesystem.
>> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>> to check syncfs result.
>> 
>> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
>> ---
>> 
>> Changes since v1:
>> Use fs shutdown and fssum to check syncfs result instead of
>> checking delalloc state of extents.
>> 
>> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>> tests/generic/470.out |  2 ++
>> tests/generic/group   |  1 +
>> 3 files changed, 91 insertions(+)
>> create mode 100755 tests/generic/470
>> create mode 100644 tests/generic/470.out
>> 
>> diff --git a/tests/generic/470 b/tests/generic/470
>> new file mode 100755
>> index 0000000..b488747
>> --- /dev/null
>> +++ b/tests/generic/470
>> @@ -0,0 +1,88 @@
>> +#! /bin/bash
>> +# FS QA Test 470
>> +#
>> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
>> +# underlying filesystem. 
> 
> Trailing whitespace in above line.
> 
>> +#
>> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>> +# to check syncfs result.
>> +#
>> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
>> +# does not support shutdown.
>> +#
>> +#-----------------------------------------------------------------------
>> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
>> +# All Rights Reserved.
>> +#
>> +# This program is free software; you can redistribute it and/or
>> +# modify it under the terms of the GNU General Public License as
>> +# published by the Free Software Foundation.
>> +#
>> +# This program is distributed in the hope that it would be useful,
>> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
>> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>> +# GNU General Public License for more details.
>> +#
>> +# You should have received a copy of the GNU General Public License
>> +# along with this program; if not, write the Free Software Foundation,
>> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
>> +#-----------------------------------------------------------------------
>> +#
>> +
>> +seq=`basename $0`
>> +seqres=$RESULT_DIR/$seq
>> +echo "QA output created by $seq"
>> +
>> +here=`pwd`
>> +tmp=/tmp/$$
>> +status=1
>> +trap "_cleanup; exit \$status" 0 1 2 3 15
>> +
>> +_cleanup()
>> +{
>> +	cd /
>> +	rm -f $tmp.*
>> +}
>> +
>> +# get standard environment, filters and checks
>> +. ./common/rc
>> +. ./common/filter
>> +
>> +# remove previous $seqres.full before test
>> +rm -f $seqres.full
>> +
>> +# real QA test starts here
>> +
>> +_supported_fs generic
>> +_supported_os Linux
>> +_require_test
>> +_require_fssum
>> +_require_scratch
>> +_require_scratch_shutdown
>> +_require_xfs_io_command "syncfs"
>> +
>> +
>> +FCNT=1000
>> +
>> +_scratch_mkfs >/dev/null 2>&1
>> +_scratch_mount
>> +
>> +# In order to mitigate interference of write-back,
>> +# create many files for test.
> 
> Sorry, I still don't understand how writeback could interfere this test
> from this comment, what happens if we don't create such files? Why
> writing files starting from offset 1k?

There is no explicit explanation how writeback interferes this case,
also there are many triggers make writeback starts syncing work.
I just want to increase hit ratio of failure by make many test files,
as many as possible, but it’s also limited by time and other resource.

The reason of offset 1k is same as above, compare to test a normal file, 
I think file with hole can increase failure ratio sometimes.

> 
>> +for i in `seq 1 $FCNT`; do
>> +	$XFS_IO_PROG -f -c "pwrite 1K 1M" \
>> +		$SCRATCH_MNT/$i >/dev/null 2>&1
>> +done
>> +
>> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/before_test $SCRATCH_MNT
> 
> $tmp.fssum looks a better choice of the fssum record file, then you
> don't need _require_test anymore.
> 
>> +$XFS_IO_PROG -c "syncfs" $SCRATCH_MNT/${FCNT} >/dev/null 2>&1
>> +_scratch_shutdown
>> +_scratch_cycle_mount
>> +$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/after_test $SCRATCH_MNT
>> +diff $TEST_DIR/before_test $TEST_DIR/after_test
>> +if [ $? -eq 0 ]; then
>> +	status=0
>> +fi
> 
> No need to check return value of diff here, just set status=0
> unconditionally, test will break the golden image matching anyway if
> diff finds something different.
> 
> But why not use "-r" option of fssum to read and compare the checksums?
> e.g.
> 
> $FSSUM_PROG -r $tmp.fssum $SCRATCH_MNT
> 
> Thanks,
> Eryu
> 
>> +
>> +echo "Silence is golden"
>> +exit
>> diff --git a/tests/generic/470.out b/tests/generic/470.out
>> new file mode 100644
>> index 0000000..79fb532
>> --- /dev/null
>> +++ b/tests/generic/470.out
>> @@ -0,0 +1,2 @@
>> +QA output created by 470
>> +Silence is golden
>> diff --git a/tests/generic/group b/tests/generic/group
>> index 6c3bb03..493a44c 100644
>> --- a/tests/generic/group
>> +++ b/tests/generic/group
>> @@ -472,3 +472,4 @@
>> 467 auto quick exportfs
>> 468 shutdown auto quick metadata
>> 469 auto quick
>> +470 auto quick shutdown sync
>> -- 
>> 1.8.3.1
>> 

--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Eryu Guan Dec. 7, 2017, 7:13 a.m. UTC | #6
On Thu, Dec 07, 2017 at 02:20:26PM +0800, Chengguang Xu wrote:
> > 
> > 在 2017年12月7日,下午1:44,Eryu Guan <eguan@redhat.com> 写道:
> > 
> > On Thu, Dec 07, 2017 at 10:22:07AM +0800, Chengguang Xu wrote:
> >> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
> >> underlying filesystem.
> >> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
> >> to check syncfs result.
> >> 
> >> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
> >> ---
> >> 
> >> Changes since v1:
> >> Use fs shutdown and fssum to check syncfs result instead of
> >> checking delalloc state of extents.
> >> 
> >> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
> >> tests/generic/470.out |  2 ++
> >> tests/generic/group   |  1 +
> >> 3 files changed, 91 insertions(+)
> >> create mode 100755 tests/generic/470
> >> create mode 100644 tests/generic/470.out
> >> 
> >> diff --git a/tests/generic/470 b/tests/generic/470
> >> new file mode 100755
> >> index 0000000..b488747
> >> --- /dev/null
> >> +++ b/tests/generic/470
> >> @@ -0,0 +1,88 @@
> >> +#! /bin/bash
> >> +# FS QA Test 470
> >> +#
> >> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
> >> +# underlying filesystem. 
> > 
> > Trailing whitespace in above line.
> > 
> >> +#
> >> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
> >> +# to check syncfs result.
> >> +#
> >> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
> >> +# does not support shutdown.
> >> +#
> >> +#-----------------------------------------------------------------------
> >> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
> >> +# All Rights Reserved.
> >> +#
> >> +# This program is free software; you can redistribute it and/or
> >> +# modify it under the terms of the GNU General Public License as
> >> +# published by the Free Software Foundation.
> >> +#
> >> +# This program is distributed in the hope that it would be useful,
> >> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
> >> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
> >> +# GNU General Public License for more details.
> >> +#
> >> +# You should have received a copy of the GNU General Public License
> >> +# along with this program; if not, write the Free Software Foundation,
> >> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
> >> +#-----------------------------------------------------------------------
> >> +#
> >> +
> >> +seq=`basename $0`
> >> +seqres=$RESULT_DIR/$seq
> >> +echo "QA output created by $seq"
> >> +
> >> +here=`pwd`
> >> +tmp=/tmp/$$
> >> +status=1
> >> +trap "_cleanup; exit \$status" 0 1 2 3 15
> >> +
> >> +_cleanup()
> >> +{
> >> +	cd /
> >> +	rm -f $tmp.*
> >> +}
> >> +
> >> +# get standard environment, filters and checks
> >> +. ./common/rc
> >> +. ./common/filter
> >> +
> >> +# remove previous $seqres.full before test
> >> +rm -f $seqres.full
> >> +
> >> +# real QA test starts here
> >> +
> >> +_supported_fs generic
> >> +_supported_os Linux
> >> +_require_test
> >> +_require_fssum
> >> +_require_scratch
> >> +_require_scratch_shutdown
> >> +_require_xfs_io_command "syncfs"
> >> +
> >> +
> >> +FCNT=1000
> >> +
> >> +_scratch_mkfs >/dev/null 2>&1
> >> +_scratch_mount
> >> +
> >> +# In order to mitigate interference of write-back,
> >> +# create many files for test.
> > 
> > Sorry, I still don't understand how writeback could interfere this test
> > from this comment, what happens if we don't create such files? Why
> > writing files starting from offset 1k?
> 
> There is no explicit explanation how writeback interferes this case,
> also there are many triggers make writeback starts syncing work.
> I just want to increase hit ratio of failure by make many test files,
> as many as possible, but it’s also limited by time and other resource.
> 
> The reason of offset 1k is same as above, compare to test a normal file, 
> I think file with hole can increase failure ratio sometimes.

Yeah, increasing the reproducibility would be a good reason too. Do you
happen to tune the number of files to see if 1000 is a good fit? e.g.
with 100 files test reproduced the overlay bug 20% of times, with 1000
files the reproducibility increased to 80%, etc. And the hole in the
beginning too, what's the actual impact on the reproducibility?

And you're right about the test time, usually we want to balance between
test time and reproducibility too, so we need to tune and measure the
numbers like test files, loop counts etc.

I think these are all good comments for test :)

Thanks,
Eryu
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 7, 2017, 7:42 a.m. UTC | #7
在 2017年12月7日,下午3:13,Eryu Guan <eguan@redhat.com> 写道:
> 
> On Thu, Dec 07, 2017 at 02:20:26PM +0800, Chengguang Xu wrote:
>>> 
>>> 在 2017年12月7日,下午1:44,Eryu Guan <eguan@redhat.com> 写道:
>>> 
>>> On Thu, Dec 07, 2017 at 10:22:07AM +0800, Chengguang Xu wrote:
>>>> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
>>>> underlying filesystem.
>>>> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>>> to check syncfs result.
>>>> 
>>>> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
>>>> ---
>>>> 
>>>> Changes since v1:
>>>> Use fs shutdown and fssum to check syncfs result instead of
>>>> checking delalloc state of extents.
>>>> 
>>>> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> tests/generic/470.out |  2 ++
>>>> tests/generic/group   |  1 +
>>>> 3 files changed, 91 insertions(+)
>>>> create mode 100755 tests/generic/470
>>>> create mode 100644 tests/generic/470.out
>>>> 
>>>> diff --git a/tests/generic/470 b/tests/generic/470
>>>> new file mode 100755
>>>> index 0000000..b488747
>>>> --- /dev/null
>>>> +++ b/tests/generic/470
>>>> @@ -0,0 +1,88 @@
>>>> +#! /bin/bash
>>>> +# FS QA Test 470
>>>> +#
>>>> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
>>>> +# underlying filesystem. 
>>> 
>>> Trailing whitespace in above line.
>>> 
>>>> +#
>>>> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>>> +# to check syncfs result.
>>>> +#
>>>> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
>>>> +# does not support shutdown.
>>>> +#
>>>> +#-----------------------------------------------------------------------
>>>> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
>>>> +# All Rights Reserved.
>>>> +#
>>>> +# This program is free software; you can redistribute it and/or
>>>> +# modify it under the terms of the GNU General Public License as
>>>> +# published by the Free Software Foundation.
>>>> +#
>>>> +# This program is distributed in the hope that it would be useful,
>>>> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
>>>> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>>>> +# GNU General Public License for more details.
>>>> +#
>>>> +# You should have received a copy of the GNU General Public License
>>>> +# along with this program; if not, write the Free Software Foundation,
>>>> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
>>>> +#-----------------------------------------------------------------------
>>>> +#
>>>> +
>>>> +seq=`basename $0`
>>>> +seqres=$RESULT_DIR/$seq
>>>> +echo "QA output created by $seq"
>>>> +
>>>> +here=`pwd`
>>>> +tmp=/tmp/$$
>>>> +status=1
>>>> +trap "_cleanup; exit \$status" 0 1 2 3 15
>>>> +
>>>> +_cleanup()
>>>> +{
>>>> +	cd /
>>>> +	rm -f $tmp.*
>>>> +}
>>>> +
>>>> +# get standard environment, filters and checks
>>>> +. ./common/rc
>>>> +. ./common/filter
>>>> +
>>>> +# remove previous $seqres.full before test
>>>> +rm -f $seqres.full
>>>> +
>>>> +# real QA test starts here
>>>> +
>>>> +_supported_fs generic
>>>> +_supported_os Linux
>>>> +_require_test
>>>> +_require_fssum
>>>> +_require_scratch
>>>> +_require_scratch_shutdown
>>>> +_require_xfs_io_command "syncfs"
>>>> +
>>>> +
>>>> +FCNT=1000
>>>> +
>>>> +_scratch_mkfs >/dev/null 2>&1
>>>> +_scratch_mount
>>>> +
>>>> +# In order to mitigate interference of write-back,
>>>> +# create many files for test.
>>> 
>>> Sorry, I still don't understand how writeback could interfere this test
>>> from this comment, what happens if we don't create such files? Why
>>> writing files starting from offset 1k?
>> 
>> There is no explicit explanation how writeback interferes this case,
>> also there are many triggers make writeback starts syncing work.
>> I just want to increase hit ratio of failure by make many test files,
>> as many as possible, but it’s also limited by time and other resource.
>> 
>> The reason of offset 1k is same as above, compare to test a normal file, 
>> I think file with hole can increase failure ratio sometimes.
> 
> Yeah, increasing the reproducibility would be a good reason too. Do you
> happen to tune the number of files to see if 1000 is a good fit? e.g.
> with 100 files test reproduced the overlay bug 20% of times, with 1000
> files the reproducibility increased to 80%, etc. And the hole in the
> beginning too, what's the actual impact on the reproducibility?
> 
> And you're right about the test time, usually we want to balance between
> test time and reproducibility too, so we need to tune and measure the
> numbers like test files, loop counts etc.
> 
> I think these are all good comments for test :)

I didn’t do much accurate testing about reproducibility, and also don’t have
a plan to do that. Actually in my testing ENV, the BUG is always reproducible
even for only a few of test files. 1000 is just my definition for
quite many files, and it’s really hard and meaningless to guess what number is 
best suit for variety of testing ENVs. So if you think the number 1000 is 
improper number, then I can modify it to right number that you think. Hole is
the same.

> 
> Thanks,
> Eryu
> --
> To unsubscribe from this list: send the line "unsubscribe linux-unionfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Amir Goldstein Dec. 7, 2017, 8:17 a.m. UTC | #8
On Thu, Dec 7, 2017 at 9:42 AM, Chengguang Xu <cgxu519@icloud.com> wrote:
> 在 2017年12月7日,下午3:13,Eryu Guan <eguan@redhat.com> 写道:
>>
>> On Thu, Dec 07, 2017 at 02:20:26PM +0800, Chengguang Xu wrote:
>>>>
>>>> 在 2017年12月7日,下午1:44,Eryu Guan <eguan@redhat.com> 写道:
>>>>
>>>> On Thu, Dec 07, 2017 at 10:22:07AM +0800, Chengguang Xu wrote:
>>>>> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
>>>>> underlying filesystem.
>>>>> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>>>> to check syncfs result.
>>>>>
>>>>> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
>>>>> ---
>>>>>
>>>>> Changes since v1:
>>>>> Use fs shutdown and fssum to check syncfs result instead of
>>>>> checking delalloc state of extents.
>>>>>
>>>>> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> tests/generic/470.out |  2 ++
>>>>> tests/generic/group   |  1 +
>>>>> 3 files changed, 91 insertions(+)
>>>>> create mode 100755 tests/generic/470
>>>>> create mode 100644 tests/generic/470.out
>>>>>
>>>>> diff --git a/tests/generic/470 b/tests/generic/470
>>>>> new file mode 100755
>>>>> index 0000000..b488747
>>>>> --- /dev/null
>>>>> +++ b/tests/generic/470
>>>>> @@ -0,0 +1,88 @@
>>>>> +#! /bin/bash
>>>>> +# FS QA Test 470
>>>>> +#
>>>>> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
>>>>> +# underlying filesystem.
>>>>
>>>> Trailing whitespace in above line.
>>>>
>>>>> +#
>>>>> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>>>> +# to check syncfs result.
>>>>> +#
>>>>> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
>>>>> +# does not support shutdown.
>>>>> +#
>>>>> +#-----------------------------------------------------------------------
>>>>> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
>>>>> +# All Rights Reserved.
>>>>> +#
>>>>> +# This program is free software; you can redistribute it and/or
>>>>> +# modify it under the terms of the GNU General Public License as
>>>>> +# published by the Free Software Foundation.
>>>>> +#
>>>>> +# This program is distributed in the hope that it would be useful,
>>>>> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
>>>>> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>>>>> +# GNU General Public License for more details.
>>>>> +#
>>>>> +# You should have received a copy of the GNU General Public License
>>>>> +# along with this program; if not, write the Free Software Foundation,
>>>>> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
>>>>> +#-----------------------------------------------------------------------
>>>>> +#
>>>>> +
>>>>> +seq=`basename $0`
>>>>> +seqres=$RESULT_DIR/$seq
>>>>> +echo "QA output created by $seq"
>>>>> +
>>>>> +here=`pwd`
>>>>> +tmp=/tmp/$$
>>>>> +status=1
>>>>> +trap "_cleanup; exit \$status" 0 1 2 3 15
>>>>> +
>>>>> +_cleanup()
>>>>> +{
>>>>> +  cd /
>>>>> +  rm -f $tmp.*
>>>>> +}
>>>>> +
>>>>> +# get standard environment, filters and checks
>>>>> +. ./common/rc
>>>>> +. ./common/filter
>>>>> +
>>>>> +# remove previous $seqres.full before test
>>>>> +rm -f $seqres.full
>>>>> +
>>>>> +# real QA test starts here
>>>>> +
>>>>> +_supported_fs generic
>>>>> +_supported_os Linux
>>>>> +_require_test
>>>>> +_require_fssum
>>>>> +_require_scratch
>>>>> +_require_scratch_shutdown
>>>>> +_require_xfs_io_command "syncfs"
>>>>> +
>>>>> +
>>>>> +FCNT=1000
>>>>> +
>>>>> +_scratch_mkfs >/dev/null 2>&1
>>>>> +_scratch_mount
>>>>> +
>>>>> +# In order to mitigate interference of write-back,
>>>>> +# create many files for test.
>>>>
>>>> Sorry, I still don't understand how writeback could interfere this test
>>>> from this comment, what happens if we don't create such files? Why
>>>> writing files starting from offset 1k?
>>>
>>> There is no explicit explanation how writeback interferes this case,
>>> also there are many triggers make writeback starts syncing work.
>>> I just want to increase hit ratio of failure by make many test files,
>>> as many as possible, but it’s also limited by time and other resource.
>>>
>>> The reason of offset 1k is same as above, compare to test a normal file,
>>> I think file with hole can increase failure ratio sometimes.
>>
>> Yeah, increasing the reproducibility would be a good reason too. Do you
>> happen to tune the number of files to see if 1000 is a good fit? e.g.
>> with 100 files test reproduced the overlay bug 20% of times, with 1000
>> files the reproducibility increased to 80%, etc. And the hole in the
>> beginning too, what's the actual impact on the reproducibility?
>>
>> And you're right about the test time, usually we want to balance between
>> test time and reproducibility too, so we need to tune and measure the
>> numbers like test files, loop counts etc.
>>
>> I think these are all good comments for test :)
>
> I didn’t do much accurate testing about reproducibility, and also don’t have
> a plan to do that. Actually in my testing ENV, the BUG is always reproducible
> even for only a few of test files.

What is the underlying fs you are testing with?

> 1000 is just my definition for
> quite many files, and it’s really hard and meaningless to guess what number is
> best suit for variety of testing ENVs. So if you think the number 1000 is
> improper number, then I can modify it to right number that you think. Hole is
> the same.
>

I agree with Eryu that you should not invent numbers, unless test completes
in a few seconds and reproduces reliably - then you can invent numbers...

But this got me thinking about the details of overlayfs syncfs bug.
Overlayfs syncfs *will* actually call underlying fs syncfs (I fixed that)
but *will not* flush dirty inode data. What that means depends on
the underlying fs. For ext4 with default jounal=ordered, syncfs will
commit uncommitted journal transactions to disk, that will force data writeback
for all inodes, whose *metadata* is modified in uncommitted transactions.

Ted, please correct me if I am wrong.

For xfs, situation is a bit different, create operations are also delayed,
so your test could fail on overlayfs over xfs more easily.

I think that means that if you create the files and write them in the same
transaction, syncfs *will* actually sync on inodes data, so it is anyway
only the very last files that you write that won't be flushed no matter how
many files you will write.
What you should try to do to increase the changes of the bug on more fs:
- create new files and truncate them to final size but leaving them sparse
- sync
- buffered write to all files
- syncfs
- shutdown
- cycle_mount

The rules for choosing the right amount of files/data should be:
- After first sync, all inodes in the system is not dirty
- During the time it takes to write all files, flusher thread may kick in
  (default 30 seconds), flush data of your tests files and interfere with
  the test. Lets rule out another user doing sync, because most test
  machines are VMs that just run the tests
- If one loop iteration of the test above takes less than X second for a
  valid fs on a slow disk, then if flusher thread does interfere, its
  interference will be over in less then X second
  (because we are the only ones dirtying data on the system).
- If X < 10 seconds, then running 2 or 3 loops of the iteration should
  be enough to guaranty that we run at least one iteration without
  interference.

So I think if you actually write as little files and data as possible,
but loop several times.
IMO, tune the parameters, so an iteration takes ~1 second on slow disk
and run 3 iterations.
I have a test setup with spinning disk I can run your test on xfs/ext4 if
you like more samples then your own test environment.

And please write comments about those heuristics...

Thanks,
Amir.
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 11, 2017, 10:03 a.m. UTC | #9
> 
> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
> 
> On Thu, Dec 7, 2017 at 9:42 AM, Chengguang Xu <cgxu519@icloud.com> wrote:
>> 在 2017年12月7日,下午3:13,Eryu Guan <eguan@redhat.com> 写道:
>>> 
>>> On Thu, Dec 07, 2017 at 02:20:26PM +0800, Chengguang Xu wrote:
>>>>> 
>>>>> 在 2017年12月7日,下午1:44,Eryu Guan <eguan@redhat.com> 写道:
>>>>> 
>>>>> On Thu, Dec 07, 2017 at 10:22:07AM +0800, Chengguang Xu wrote:
>>>>>> Inspired by syncfs bug of overlayfs which does not sync dirtyinodes in
>>>>>> underlying filesystem.
>>>>>> Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>>>>> to check syncfs result.
>>>>>> 
>>>>>> Signed-off-by: Chengguang Xu <cgxu519@icloud.com>
>>>>>> ---
>>>>>> 
>>>>>> Changes since v1:
>>>>>> Use fs shutdown and fssum to check syncfs result instead of
>>>>>> checking delalloc state of extents.
>>>>>> 
>>>>>> tests/generic/470     | 88 +++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>> tests/generic/470.out |  2 ++
>>>>>> tests/generic/group   |  1 +
>>>>>> 3 files changed, 91 insertions(+)
>>>>>> create mode 100755 tests/generic/470
>>>>>> create mode 100644 tests/generic/470.out
>>>>>> 
>>>>>> diff --git a/tests/generic/470 b/tests/generic/470
>>>>>> new file mode 100755
>>>>>> index 0000000..b488747
>>>>>> --- /dev/null
>>>>>> +++ b/tests/generic/470
>>>>>> @@ -0,0 +1,88 @@
>>>>>> +#! /bin/bash
>>>>>> +# FS QA Test 470
>>>>>> +#
>>>>>> +# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
>>>>>> +# underlying filesystem.
>>>>> 
>>>>> Trailing whitespace in above line.
>>>>> 
>>>>>> +#
>>>>>> +# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
>>>>>> +# to check syncfs result.
>>>>>> +#
>>>>>> +# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
>>>>>> +# does not support shutdown.
>>>>>> +#
>>>>>> +#-----------------------------------------------------------------------
>>>>>> +# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
>>>>>> +# All Rights Reserved.
>>>>>> +#
>>>>>> +# This program is free software; you can redistribute it and/or
>>>>>> +# modify it under the terms of the GNU General Public License as
>>>>>> +# published by the Free Software Foundation.
>>>>>> +#
>>>>>> +# This program is distributed in the hope that it would be useful,
>>>>>> +# but WITHOUT ANY WARRANTY; without even the implied warranty of
>>>>>> +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>>>>>> +# GNU General Public License for more details.
>>>>>> +#
>>>>>> +# You should have received a copy of the GNU General Public License
>>>>>> +# along with this program; if not, write the Free Software Foundation,
>>>>>> +# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
>>>>>> +#-----------------------------------------------------------------------
>>>>>> +#
>>>>>> +
>>>>>> +seq=`basename $0`
>>>>>> +seqres=$RESULT_DIR/$seq
>>>>>> +echo "QA output created by $seq"
>>>>>> +
>>>>>> +here=`pwd`
>>>>>> +tmp=/tmp/$$
>>>>>> +status=1
>>>>>> +trap "_cleanup; exit \$status" 0 1 2 3 15
>>>>>> +
>>>>>> +_cleanup()
>>>>>> +{
>>>>>> +  cd /
>>>>>> +  rm -f $tmp.*
>>>>>> +}
>>>>>> +
>>>>>> +# get standard environment, filters and checks
>>>>>> +. ./common/rc
>>>>>> +. ./common/filter
>>>>>> +
>>>>>> +# remove previous $seqres.full before test
>>>>>> +rm -f $seqres.full
>>>>>> +
>>>>>> +# real QA test starts here
>>>>>> +
>>>>>> +_supported_fs generic
>>>>>> +_supported_os Linux
>>>>>> +_require_test
>>>>>> +_require_fssum
>>>>>> +_require_scratch
>>>>>> +_require_scratch_shutdown
>>>>>> +_require_xfs_io_command "syncfs"
>>>>>> +
>>>>>> +
>>>>>> +FCNT=1000
>>>>>> +
>>>>>> +_scratch_mkfs >/dev/null 2>&1
>>>>>> +_scratch_mount
>>>>>> +
>>>>>> +# In order to mitigate interference of write-back,
>>>>>> +# create many files for test.
>>>>> 
>>>>> Sorry, I still don't understand how writeback could interfere this test
>>>>> from this comment, what happens if we don't create such files? Why
>>>>> writing files starting from offset 1k?
>>>> 
>>>> There is no explicit explanation how writeback interferes this case,
>>>> also there are many triggers make writeback starts syncing work.
>>>> I just want to increase hit ratio of failure by make many test files,
>>>> as many as possible, but it’s also limited by time and other resource.
>>>> 
>>>> The reason of offset 1k is same as above, compare to test a normal file,
>>>> I think file with hole can increase failure ratio sometimes.
>>> 
>>> Yeah, increasing the reproducibility would be a good reason too. Do you
>>> happen to tune the number of files to see if 1000 is a good fit? e.g.
>>> with 100 files test reproduced the overlay bug 20% of times, with 1000
>>> files the reproducibility increased to 80%, etc. And the hole in the
>>> beginning too, what's the actual impact on the reproducibility?
>>> 
>>> And you're right about the test time, usually we want to balance between
>>> test time and reproducibility too, so we need to tune and measure the
>>> numbers like test files, loop counts etc.
>>> 
>>> I think these are all good comments for test :)
>> 
>> I didn’t do much accurate testing about reproducibility, and also don’t have
>> a plan to do that. Actually in my testing ENV, the BUG is always reproducible
>> even for only a few of test files.
> 
> What is the underlying fs you are testing with?

I usually use xfs as underlying.

> 
>> 1000 is just my definition for
>> quite many files, and it’s really hard and meaningless to guess what number is
>> best suit for variety of testing ENVs. So if you think the number 1000 is
>> improper number, then I can modify it to right number that you think. Hole is
>> the same.
>> 
> 
> I agree with Eryu that you should not invent numbers, unless test completes
> in a few seconds and reproduces reliably - then you can invent numbers...
> 
> But this got me thinking about the details of overlayfs syncfs bug.
> Overlayfs syncfs *will* actually call underlying fs syncfs (I fixed that)
> but *will not* flush dirty inode data. What that means depends on
> the underlying fs. For ext4 with default jounal=ordered, syncfs will
> commit uncommitted journal transactions to disk, that will force data writeback
> for all inodes, whose *metadata* is modified in uncommitted transactions.
> 
> Ted, please correct me if I am wrong.
> 
> For xfs, situation is a bit different, create operations are also delayed,
> so your test could fail on overlayfs over xfs more easily.
> 
> I think that means that if you create the files and write them in the same
> transaction, syncfs *will* actually sync on inodes data, so it is anyway
> only the very last files that you write that won't be flushed no matter how
> many files you will write.
> What you should try to do to increase the changes of the bug on more fs:
> - create new files and truncate them to final size but leaving them sparse
> - sync
> - buffered write to all files
> - syncfs
> - shutdown
> - cycle_mount
> 
> The rules for choosing the right amount of files/data should be:
> - After first sync, all inodes in the system is not dirty
> - During the time it takes to write all files, flusher thread may kick in
>  (default 30 seconds), flush data of your tests files and interfere with
>  the test. Lets rule out another user doing sync, because most test
>  machines are VMs that just run the tests
> - If one loop iteration of the test above takes less than X second for a
>  valid fs on a slow disk, then if flusher thread does interfere, its
>  interference will be over in less then X second
>  (because we are the only ones dirtying data on the system).
> - If X < 10 seconds, then running 2 or 3 loops of the iteration should
>  be enough to guaranty that we run at least one iteration without
>  interference.

I did more detail tests for three different data modes of ext4 and found
the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
but on data=journal mode, data is flushed and correct. I only wrote only a few words
to a single file and the bug is always reproducible on my test environment. 

For writeback interferences, AFAIK, from dirty ratio and period.
If we drop all dirty caches & sync before the test, I think we can
avoid interference from it.

So if we don’t have anything else to interference test result, 
I just want to modify to write a small single file as test target.

Am I missing anything?

> 
> So I think if you actually write as little files and data as possible,
> but loop several times.
> IMO, tune the parameters, so an iteration takes ~1 second on slow disk
> and run 3 iterations.
> I have a test setup with spinning disk I can run your test on xfs/ext4 if
> you like more samples then your own test environment.
> 
> And please write comments about those heuristics...
> 
> Thanks,
> Amir.

--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Amir Goldstein Dec. 11, 2017, 10:46 a.m. UTC | #10
On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>
>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
[...]
>
> I did more detail tests for three different data modes of ext4 and found
> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
> but on data=journal mode, data is flushed and correct.

That is expected, because overlayfs does call upper's sync_fs() method and
for journal=data that will flush all dirty pages as well.

> I only wrote only a few words
> to a single file and the bug is always reproducible on my test environment.
>
> For writeback interferences, AFAIK, from dirty ratio and period.
> If we drop all dirty caches & sync before the test, I think we can
> avoid interference from it.
>

Why? does either drop_caches or sync() reset the flusher thread
periodic flush dirty pages?

> So if we don’t have anything else to interference test result,
> I just want to modify to write a small single file as test target.
>
> Am I missing anything?

I think the chance of flusher thread interfering the test and
flushing the dirty page you wrote before _scratch_shutdown exists,
but is small enough so we can neglect it and keep the test as simple
as possible.

Amir.
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 11, 2017, 12:33 p.m. UTC | #11
> 
> 在 2017年12月11日,下午6:46,Amir Goldstein <amir73il@gmail.com> 写道:
> 
> On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>> 
>>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
> [...]
>> 
>> I did more detail tests for three different data modes of ext4 and found
>> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
>> but on data=journal mode, data is flushed and correct.
> 
> That is expected, because overlayfs does call upper's sync_fs() method and
> for journal=data that will flush all dirty pages as well.
> 
>> I only wrote only a few words
>> to a single file and the bug is always reproducible on my test environment.
>> 
>> For writeback interferences, AFAIK, from dirty ratio and period.
>> If we drop all dirty caches & sync before the test, I think we can
>> avoid interference from it.
>> 
> 
> Why? does either drop_caches or sync() reset the flusher thread
> periodic flush dirty pages?

Sorry,We have to finish test in 30 seconds after we write test file, 
otherwise may be affected by background flusher. 


> 
>> So if we don’t have anything else to interference test result,
>> I just want to modify to write a small single file as test target.
>> 
>> Am I missing anything?
> 
> I think the chance of flusher thread interfering the test and
> flushing the dirty page you wrote before _scratch_shutdown exists,
> but is small enough so we can neglect it and keep the test as simple
> as possible.

As I know, in normal case flusher thread check dirty inode expiring every 5 seconds(default) and flush dirty 
inode when expires 30 seconds(default). If we can finish test in 30s after running test, it would be OK. 
What do you think? 


> 
> Amir.

--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Amir Goldstein Dec. 11, 2017, 12:44 p.m. UTC | #12
On Mon, Dec 11, 2017 at 2:33 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>
>> 在 2017年12月11日,下午6:46,Amir Goldstein <amir73il@gmail.com> 写道:
>>
>> On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>>>
>>>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
>> [...]
>>>
>>> I did more detail tests for three different data modes of ext4 and found
>>> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
>>> but on data=journal mode, data is flushed and correct.
>>
>> That is expected, because overlayfs does call upper's sync_fs() method and
>> for journal=data that will flush all dirty pages as well.
>>
>>> I only wrote only a few words
>>> to a single file and the bug is always reproducible on my test environment.
>>>
>>> For writeback interferences, AFAIK, from dirty ratio and period.
>>> If we drop all dirty caches & sync before the test, I think we can
>>> avoid interference from it.
>>>
>>
>> Why? does either drop_caches or sync() reset the flusher thread
>> periodic flush dirty pages?
>
> Sorry,We have to finish test in 30 seconds after we write test file,
> otherwise may be affected by background flusher.
>
>
>>
>>> So if we don’t have anything else to interference test result,
>>> I just want to modify to write a small single file as test target.
>>>
>>> Am I missing anything?
>>
>> I think the chance of flusher thread interfering the test and
>> flushing the dirty page you wrote before _scratch_shutdown exists,
>> but is small enough so we can neglect it and keep the test as simple
>> as possible.
>
> As I know, in normal case flusher thread check dirty inode expiring every 5 seconds(default) and flush dirty
> inode when expires 30 seconds(default). If we can finish test in 30s after running test, it would be OK.
> What do you think?
>

5 seconds is ext4 default journal commit interval. this is when dirty
metadata will be flushed.
30 is generic flusher thread interval.
What if test started 29 seconds after last flush?

I am saying that the chances of interference from flusher thread are quite
if the test is very quick.

If you create files and sync at the start of the test, overlayfs
syncfs will call
ext4 sync_fs and that will have nothing to do, because no metadata is dirty,
so test will be quick and we can neglect that change of interference.

*If* you wish to reduce that chance for interference loop the test twice, but
I don't think that's a must.

Amir.
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 11, 2017, 1:20 p.m. UTC | #13
> 
> 在 2017年12月11日,下午8:44,Amir Goldstein <amir73il@gmail.com> 写道:
> 
> On Mon, Dec 11, 2017 at 2:33 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>> 
>>> 在 2017年12月11日,下午6:46,Amir Goldstein <amir73il@gmail.com> 写道:
>>> 
>>> On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>>>> 
>>>>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
>>> [...]
>>>> 
>>>> I did more detail tests for three different data modes of ext4 and found
>>>> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
>>>> but on data=journal mode, data is flushed and correct.
>>> 
>>> That is expected, because overlayfs does call upper's sync_fs() method and
>>> for journal=data that will flush all dirty pages as well.
>>> 
>>>> I only wrote only a few words
>>>> to a single file and the bug is always reproducible on my test environment.
>>>> 
>>>> For writeback interferences, AFAIK, from dirty ratio and period.
>>>> If we drop all dirty caches & sync before the test, I think we can
>>>> avoid interference from it.
>>>> 
>>> 
>>> Why? does either drop_caches or sync() reset the flusher thread
>>> periodic flush dirty pages?
>> 
>> Sorry,We have to finish test in 30 seconds after we write test file,
>> otherwise may be affected by background flusher.
>> 
>> 
>>> 
>>>> So if we don’t have anything else to interference test result,
>>>> I just want to modify to write a small single file as test target.
>>>> 
>>>> Am I missing anything?
>>> 
>>> I think the chance of flusher thread interfering the test and
>>> flushing the dirty page you wrote before _scratch_shutdown exists,
>>> but is small enough so we can neglect it and keep the test as simple
>>> as possible.
>> 
>> As I know, in normal case flusher thread check dirty inode expiring every 5 seconds(default) and flush dirty
>> inode when expires 30 seconds(default). If we can finish test in 30s after running test, it would be OK.
>> What do you think?
>> 
> 
> 5 seconds is ext4 default journal commit interval. this is when dirty
> metadata will be flushed.
> 30 is generic flusher thread interval.
> What if test started 29 seconds after last flush?

Maybe we have misunderstanding about the flushing intervals.
I’m saying the intervals which are under /proc/sys/vm to control 
writeback behaviors, it seems not related to any specific filesystem 
like ext4 or others.

dirty_expire_interval
default:3000(millisecond)

dirty_writeback_interval
default:500(millisecond)

Anyway, let me modify test case base on our discussion.


> 
> I am saying that the chances of interference from flusher thread are quite
> if the test is very quick.
> 
> If you create files and sync at the start of the test, overlayfs
> syncfs will call
> ext4 sync_fs and that will have nothing to do, because no metadata is dirty,
> so test will be quick and we can neglect that change of interference.
> 
> *If* you wish to reduce that chance for interference loop the test twice, but
> I don't think that's a must.
> 
> Amir.

--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Chengguang Xu Dec. 11, 2017, 2:31 p.m. UTC | #14
> 在 2017年12月11日,下午9:20,Chengguang Xu <cgxu519@icloud.com> 写道:
> 
>> 
>> 在 2017年12月11日,下午8:44,Amir Goldstein <amir73il@gmail.com> 写道:
>> 
>> On Mon, Dec 11, 2017 at 2:33 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>>> 
>>>> 在 2017年12月11日,下午6:46,Amir Goldstein <amir73il@gmail.com> 写道:
>>>> 
>>>> On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>>>>> 
>>>>>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
>>>> [...]
>>>>> 
>>>>> I did more detail tests for three different data modes of ext4 and found
>>>>> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
>>>>> but on data=journal mode, data is flushed and correct.
>>>> 
>>>> That is expected, because overlayfs does call upper's sync_fs() method and
>>>> for journal=data that will flush all dirty pages as well.
>>>> 
>>>>> I only wrote only a few words
>>>>> to a single file and the bug is always reproducible on my test environment.
>>>>> 
>>>>> For writeback interferences, AFAIK, from dirty ratio and period.
>>>>> If we drop all dirty caches & sync before the test, I think we can
>>>>> avoid interference from it.
>>>>> 
>>>> 
>>>> Why? does either drop_caches or sync() reset the flusher thread
>>>> periodic flush dirty pages?
>>> 
>>> Sorry,We have to finish test in 30 seconds after we write test file,
>>> otherwise may be affected by background flusher.
>>> 
>>> 
>>>> 
>>>>> So if we don’t have anything else to interference test result,
>>>>> I just want to modify to write a small single file as test target.
>>>>> 
>>>>> Am I missing anything?
>>>> 
>>>> I think the chance of flusher thread interfering the test and
>>>> flushing the dirty page you wrote before _scratch_shutdown exists,
>>>> but is small enough so we can neglect it and keep the test as simple
>>>> as possible.
>>> 
>>> As I know, in normal case flusher thread check dirty inode expiring every 5 seconds(default) and flush dirty
>>> inode when expires 30 seconds(default). If we can finish test in 30s after running test, it would be OK.
>>> What do you think?
>>> 
>> 
>> 5 seconds is ext4 default journal commit interval. this is when dirty
>> metadata will be flushed.
>> 30 is generic flusher thread interval.
>> What if test started 29 seconds after last flush?
> 
> Maybe we have misunderstanding about the flushing intervals.
> I’m saying the intervals which are under /proc/sys/vm to control 
> writeback behaviors, it seems not related to any specific filesystem 
> like ext4 or others.
> 
> dirty_expire_interval
> default:3000(millisecond)
> 
> dirty_writeback_interval
> default:500(millisecond)

Correction:

dirty_expire_interval
default:3000(centiseconds)

dirty_writeback_interval
default:500(centiseconds)


> 
> Anyway, let me modify test case base on our discussion.
> 
> 
>> 
>> I am saying that the chances of interference from flusher thread are quite
>> if the test is very quick.
>> 
>> If you create files and sync at the start of the test, overlayfs
>> syncfs will call
>> ext4 sync_fs and that will have nothing to do, because no metadata is dirty,
>> so test will be quick and we can neglect that change of interference.
>> 
>> *If* you wish to reduce that chance for interference loop the test twice, but
>> I don't think that's a must.
>> 
>> Amir.
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-unionfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Amir Goldstein Dec. 11, 2017, 2:47 p.m. UTC | #15
On Mon, Dec 11, 2017 at 4:31 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>
>> 在 2017年12月11日,下午9:20,Chengguang Xu <cgxu519@icloud.com> 写道:
>>
>>>
>>> 在 2017年12月11日,下午8:44,Amir Goldstein <amir73il@gmail.com> 写道:
>>>
>>> On Mon, Dec 11, 2017 at 2:33 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>>>>
>>>>> 在 2017年12月11日,下午6:46,Amir Goldstein <amir73il@gmail.com> 写道:
>>>>>
>>>>> On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
>>>>>>>
>>>>>>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
>>>>> [...]
>>>>>>
>>>>>> I did more detail tests for three different data modes of ext4 and found
>>>>>> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
>>>>>> but on data=journal mode, data is flushed and correct.
>>>>>
>>>>> That is expected, because overlayfs does call upper's sync_fs() method and
>>>>> for journal=data that will flush all dirty pages as well.
>>>>>
>>>>>> I only wrote only a few words
>>>>>> to a single file and the bug is always reproducible on my test environment.
>>>>>>
>>>>>> For writeback interferences, AFAIK, from dirty ratio and period.
>>>>>> If we drop all dirty caches & sync before the test, I think we can
>>>>>> avoid interference from it.
>>>>>>
>>>>>
>>>>> Why? does either drop_caches or sync() reset the flusher thread
>>>>> periodic flush dirty pages?
>>>>
>>>> Sorry,We have to finish test in 30 seconds after we write test file,
>>>> otherwise may be affected by background flusher.
>>>>
>>>>
>>>>>
>>>>>> So if we don’t have anything else to interference test result,
>>>>>> I just want to modify to write a small single file as test target.
>>>>>>
>>>>>> Am I missing anything?
>>>>>
>>>>> I think the chance of flusher thread interfering the test and
>>>>> flushing the dirty page you wrote before _scratch_shutdown exists,
>>>>> but is small enough so we can neglect it and keep the test as simple
>>>>> as possible.
>>>>
>>>> As I know, in normal case flusher thread check dirty inode expiring every 5 seconds(default) and flush dirty
>>>> inode when expires 30 seconds(default). If we can finish test in 30s after running test, it would be OK.
>>>> What do you think?
>>>>
>>>
>>> 5 seconds is ext4 default journal commit interval. this is when dirty
>>> metadata will be flushed.
>>> 30 is generic flusher thread interval.
>>> What if test started 29 seconds after last flush?
>>
>> Maybe we have misunderstanding about the flushing intervals.
>> I’m saying the intervals which are under /proc/sys/vm to control
>> writeback behaviors, it seems not related to any specific filesystem
>> like ext4 or others.
>>
>> dirty_expire_interval
>> default:3000(millisecond)
>>
>> dirty_writeback_interval
>> default:500(millisecond)
>
> Correction:
>
> dirty_expire_interval
> default:3000(centiseconds)
>
> dirty_writeback_interval
> default:500(centiseconds)
>
>

Right. I was confusing the 2 different tunables.
I agree your test should be fine with simple small write after sync
in an isolated test environment.

Amir.
--
To unsubscribe from this list: send the line "unsubscribe fstests" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Dave Chinner Dec. 12, 2017, 12:18 a.m. UTC | #16
On Mon, Dec 11, 2017 at 02:44:50PM +0200, Amir Goldstein wrote:
> On Mon, Dec 11, 2017 at 2:33 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
> >>
> >> 在 2017年12月11日,下午6:46,Amir Goldstein <amir73il@gmail.com> 写道:
> >>
> >> On Mon, Dec 11, 2017 at 12:03 PM, Chengguang Xu <cgxu519@icloud.com> wrote:
> >>>>
> >>>> 在 2017年12月7日,下午4:17,Amir Goldstein <amir73il@gmail.com> 写道:
> >> [...]
> >>>
> >>> I did more detail tests for three different data modes of ext4 and found
> >>> the overlayfs syncfs bug is reproducible on data=ordered and data=writeback,
> >>> but on data=journal mode, data is flushed and correct.
> >>
> >> That is expected, because overlayfs does call upper's sync_fs() method and
> >> for journal=data that will flush all dirty pages as well.
> >>
> >>> I only wrote only a few words
> >>> to a single file and the bug is always reproducible on my test environment.
> >>>
> >>> For writeback interferences, AFAIK, from dirty ratio and period.
> >>> If we drop all dirty caches & sync before the test, I think we can
> >>> avoid interference from it.
> >>>
> >>
> >> Why? does either drop_caches or sync() reset the flusher thread
> >> periodic flush dirty pages?
> >
> > Sorry,We have to finish test in 30 seconds after we write test file,
> > otherwise may be affected by background flusher.
> >
> >
> >>
> >>> So if we don’t have anything else to interference test result,
> >>> I just want to modify to write a small single file as test target.
> >>>
> >>> Am I missing anything?
> >>
> >> I think the chance of flusher thread interfering the test and
> >> flushing the dirty page you wrote before _scratch_shutdown exists,
> >> but is small enough so we can neglect it and keep the test as simple
> >> as possible.
> >
> > As I know, in normal case flusher thread check dirty inode expiring every 5 seconds(default) and flush dirty
> > inode when expires 30 seconds(default). If we can finish test in 30s after running test, it would be OK.
> > What do you think?
> >
> 
> 5 seconds is ext4 default journal commit interval. this is when dirty
> metadata will be flushed.
> 30 is generic flusher thread interval.
> What if test started 29 seconds after last flush?

Then the inode is only 1 second "old" when the flush occurs and so
it gets ignored. it'll get flushed only once it's been dirty for
longer than a flusher thread interval.

IIRC there's quite a few tests where we make the assumption we've
got at least 30s before the dirty cached data will be written by the
kernel. e.g. for shutting down the filesystem before data hits the
disk to test metadata vs data recovery state.

Cheers,

Dave.
diff mbox

Patch

diff --git a/tests/generic/470 b/tests/generic/470
new file mode 100755
index 0000000..b488747
--- /dev/null
+++ b/tests/generic/470
@@ -0,0 +1,88 @@ 
+#! /bin/bash
+# FS QA Test 470
+#
+# Inspired by syncfs bug of overlayfs which does not sync dirty inodes in
+# underlying filesystem. 
+#
+# Run syncfs and shutdown filesystem(or underlying filesystem of overlayfs)
+# to check syncfs result.
+#
+# Test will be skipped if filesystem(or underlying filesystem of overlayfs)
+# does not support shutdown.
+#
+#-----------------------------------------------------------------------
+# Copyright (c) 2017 Chengguang Xu <cgxu519@icloud.com>
+# All Rights Reserved.
+#
+# This program is free software; you can redistribute it and/or
+# modify it under the terms of the GNU General Public License as
+# published by the Free Software Foundation.
+#
+# This program is distributed in the hope that it would be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+# GNU General Public License for more details.
+#
+# You should have received a copy of the GNU General Public License
+# along with this program; if not, write the Free Software Foundation,
+# Inc.,  51 Franklin St, Fifth Floor, Boston, MA  02110-1301  USA
+#-----------------------------------------------------------------------
+#
+
+seq=`basename $0`
+seqres=$RESULT_DIR/$seq
+echo "QA output created by $seq"
+
+here=`pwd`
+tmp=/tmp/$$
+status=1
+trap "_cleanup; exit \$status" 0 1 2 3 15
+
+_cleanup()
+{
+	cd /
+	rm -f $tmp.*
+}
+
+# get standard environment, filters and checks
+. ./common/rc
+. ./common/filter
+
+# remove previous $seqres.full before test
+rm -f $seqres.full
+
+# real QA test starts here
+
+_supported_fs generic
+_supported_os Linux
+_require_test
+_require_fssum
+_require_scratch
+_require_scratch_shutdown
+_require_xfs_io_command "syncfs"
+
+
+FCNT=1000
+
+_scratch_mkfs >/dev/null 2>&1
+_scratch_mount
+
+# In order to mitigate interference of write-back,
+# create many files for test.
+for i in `seq 1 $FCNT`; do
+	$XFS_IO_PROG -f -c "pwrite 1K 1M" \
+		$SCRATCH_MNT/$i >/dev/null 2>&1
+done
+
+$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/before_test $SCRATCH_MNT
+$XFS_IO_PROG -c "syncfs" $SCRATCH_MNT/${FCNT} >/dev/null 2>&1
+_scratch_shutdown
+_scratch_cycle_mount
+$FSSUM_PROG -ugomAcdES -f -w $TEST_DIR/after_test $SCRATCH_MNT
+diff $TEST_DIR/before_test $TEST_DIR/after_test
+if [ $? -eq 0 ]; then
+	status=0
+fi
+
+echo "Silence is golden"
+exit
diff --git a/tests/generic/470.out b/tests/generic/470.out
new file mode 100644
index 0000000..79fb532
--- /dev/null
+++ b/tests/generic/470.out
@@ -0,0 +1,2 @@ 
+QA output created by 470
+Silence is golden
diff --git a/tests/generic/group b/tests/generic/group
index 6c3bb03..493a44c 100644
--- a/tests/generic/group
+++ b/tests/generic/group
@@ -472,3 +472,4 @@ 
 467 auto quick exportfs
 468 shutdown auto quick metadata
 469 auto quick
+470 auto quick shutdown sync