[v2] blktests: replace module removal with patient module removal

A long time ago, in a galaxy far, far away...

I ran into some odd scsi_debug false positives with fstests. This
prompted me to look into them given these false positives prevents
me from moving forward with establishing a test baseline with high
number of cycles. That is, this stupid issue was prevening creating
high confidence in testing.

I reported it [0] and exchanged some ideas with Doug. However, in
the end, despite efforts to help things with scsi_debug there were
still issues lingering which seemed to defy our expectations upstream.
One of the last hanging fruit issues is and always has been that
userspace expectations for proper module removal has been broken,
so in the end I have demonstrated this is a generic issue [1].

Long ago a WAIT option for module removal was added... that was then
removed as it was deemed not needed as folks couldn't figure out when
these races happened. The races are actually pretty easy to trigger, it
was just never properly documented. A simpe blkdev_open() will easily
bump a module refcnt, and these days many thing scan do that sort of
thing.

The proper solution is to implement then a patient module removal
on kmod and patches have been sent for that and those patches are
under review. In the meantime we need a work around to open code a
similar solution for users of old versions of kmod. I sent an open
coded solution for fstests about since August 19th and has been used
there for a few months now. Now that that stuff is merged and tested
in fstests with more exposure, its time to match parity on blktests.

I've tested blktests with this for things which I can run virtually
for a while now. More wider testig is welcomed.

[0] https://bugzilla.kernel.org/show_bug.cgi?id=212337
[1] https://bugzilla.kernel.org/show_bug.cgi?id=214015

Signed-off-by: Luis Chamberlain <mcgrof@kernel.org>
Reviewed-by: Chaitanya Kulkarni <kch@nvidia.com>
---

This v2:

Goes tested with a series of fixes for nvme and srp. It also goes
tested against shellcheck. The biggest change is to adapt the patient
rmmod check support into a helper has_modprobe_patient(). The other
bigger change is to ensure we use nvme_loop when using the patient
removal as the directory we use to for the recfct *is* case sensitive,
we can't use aliases.

I've tested this with as many things I could now and I can't find
any regressions. For those curious my baseline for v5.17-rc can be
found here:

https://github.com/mcgrof/kdevops/blob/master/workflows/blktests/expunges/5.17-rc7/failures.txt

 common/multipath-over-rdma |  11 +--
 common/null_blk            |   9 ++-
 common/rc                  | 153 ++++++++++++++++++++++++++++++++++---
 common/scsi_debug          |   9 ++-
 tests/nvme/rc              |  12 +--
 tests/nvmeof-mp/rc         |  15 ++--
 tests/srp/rc               |  19 ++---
 7 files changed, 176 insertions(+), 52 deletions(-)

Message ID	YlogluONIoc1VTCI@bombadil.infradead.org (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-block-owner@kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 4EA54C433EF for <linux-block@archiver.kernel.org>; Sat, 16 Apr 2022 02:20:42 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229528AbiDPCWj (ORCPT <rfc822;linux-block@archiver.kernel.org>); Fri, 15 Apr 2022 22:22:39 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60924 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229767AbiDPCWS (ORCPT <rfc822;linux-block@vger.kernel.org>); Fri, 15 Apr 2022 22:22:18 -0400 Received: from bombadil.infradead.org (bombadil.infradead.org [IPv6:2607:7c80:54:e::133]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A2BF359386 for <linux-block@vger.kernel.org>; Fri, 15 Apr 2022 19:19:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=bombadil.20210309; h=Sender:Content-Type:MIME-Version: Message-ID:Subject:Cc:To:From:Date:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:In-Reply-To:References; bh=Tm8kf1CVsiCkas7EvRVvZ7JffHNYPtHCjVUn4ME5I2w=; b=GXnUMAFP4ccZ0/tDwsgQxJpeHx ho85L9AWTHVr9G3rOpWlGBXKCgrSO/hm0VwYahP9uGQpgfyG/5uZ4O6kkExxpwRat2q0hBAcRXxc3 Gkj+mwfdE6ay+GBDjckrExEzy9+XTOnuMLTa/BF/36YMUmzglh7y4IlCnGS/tPHOHae6sEtpM19sk kIgkYxuG9TK+O1m5Cd5xiKh4Ix4eBWfbpQg2/aV0xIusxtm53ttY6kMT5ut6rfFf6ZYwMrlE7Lyt7 e4tEd9Z9oCyE+ZwfHXfCY7a0B1oEa6yfuvgOKte1Ke0MImapbx/03f8FnEqsuVH8J+1xRYSB0QOAF qVdchg3A==; Received: from mcgrof by bombadil.infradead.org with local (Exim 4.94.2 #2 (Red Hat Linux)) id 1nfXYk-00BuhQ-Qv; Sat, 16 Apr 2022 01:49:10 +0000 Date: Fri, 15 Apr 2022 18:49:10 -0700 From: Luis Chamberlain <mcgrof@kernel.org> To: osandov@fb.com, Bart Van Assche <bvanassche@acm.org> Cc: linux-block@vger.kernel.org, mcgrof@kernel.org Subject: [PATCH v2] blktests: replace module removal with patient module removal Message-ID: <YlogluONIoc1VTCI@bombadil.infradead.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Sender: Luis Chamberlain <mcgrof@infradead.org> Precedence: bulk List-ID: <linux-block.vger.kernel.org> X-Mailing-List: linux-block@vger.kernel.org
Series	[v2] blktests: replace module removal with patient module removal \| expand [v2] blktests: replace module removal with patient module removal

[v2] blktests: replace module removal with patient module removal

Commit Message

Comments

Patch