btrfs: volumes: Allow missing devices to be writeable

[BUG]
There is a long existing bug that degraded mounted btrfs can allocate new
SINGLE/DUP chunks on a RAID1 fs:
  #!/bin/bash

  dev1=/dev/test/scratch1
  dev2=/dev/test/scratch2
  mnt=/mnt/btrfs

  umount $mnt &> /dev/null
  umount $dev1 &> /dev/null
  umount $dev2 &> /dev/null

  dmesg -C
  mkfs.btrfs -f -m raid1 -d raid1 $dev1 $dev2

  wipefs -fa $dev2

  mount -o degraded $dev1 $mnt
  btrfs balance start --full $mnt
  umount $mnt
  echo "=== chunk after degraded mount ==="
  btrfs ins dump-tree -t chunk $dev1 | grep stripe_len.*type

The result fs will have chunks with SINGLE and DUP only:
  === chunk after degraded mount ===
                  length 33554432 owner 2 stripe_len 65536 type SYSTEM
                  length 1073741824 owner 2 stripe_len 65536 type DATA
                  length 1073741824 owner 2 stripe_len 65536 type DATA|DUP
                  length 219676672 owner 2 stripe_len 65536 type METADATA|DUP
                  length 33554432 owner 2 stripe_len 65536 type SYSTEM|DUP

This behavior greatly breaks the RAID1 tolerance.

Even with missing device replaced, if the device with DUP/SINGLE chunks
on them get missing, the whole fs can't be mounted RW any more.
And we already have reports that user even can't mount the fs as some
essential tree blocks got written to those DUP chunks.

[CAUSE]
The cause is pretty simple, we treat missing devices as non-writable.
Thus when we need to allocate chunks, we can only fall back to single
device profiles (SINGLE and DUP).

[FIX]
Just consider the missing devices as WRITABLE, so we allocate new chunks
on them to maintain old profiles.

Signed-off-by: Qu Wenruo <wqu@suse.com>
---
 fs/btrfs/volumes.c | 12 ++++++++++++
 1 file changed, 12 insertions(+)

Message ID	20190829071731.11521-1-wqu@suse.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=lsqs=WZ=vger.kernel.org=linux-btrfs-owner@kernel.org> From: Qu Wenruo <wqu@suse.com> To: linux-btrfs@vger.kernel.org Subject: [PATCH] btrfs: volumes: Allow missing devices to be writeable Date: Thu, 29 Aug 2019 15:17:31 +0800 Message-Id: <20190829071731.11521-1-wqu@suse.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-btrfs-owner@vger.kernel.org Precedence: bulk
Series	btrfs: volumes: Allow missing devices to be writeable \| expand btrfs: volumes: Allow missing devices to be writeable

btrfs: volumes: Allow missing devices to be writeable

Commit Message

Comments

Patch