From patchwork Sat Jan 30 17:22:44 2016 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Al Viro X-Patchwork-Id: 8171951 Return-Path: X-Original-To: patchwork-linux-fsdevel@patchwork.kernel.org Delivered-To: patchwork-parsemail@patchwork2.web.kernel.org Received: from mail.kernel.org (mail.kernel.org [198.145.29.136]) by patchwork2.web.kernel.org (Postfix) with ESMTP id 5F11BBEEE5 for ; Sat, 30 Jan 2016 17:22:51 +0000 (UTC) Received: from mail.kernel.org (localhost [127.0.0.1]) by mail.kernel.org (Postfix) with ESMTP id 96B6120396 for ; Sat, 30 Jan 2016 17:22:50 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 5B09420395 for ; Sat, 30 Jan 2016 17:22:49 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932288AbcA3RWs (ORCPT ); Sat, 30 Jan 2016 12:22:48 -0500 Received: from zeniv.linux.org.uk ([195.92.253.2]:58253 "EHLO ZenIV.linux.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932215AbcA3RWr (ORCPT ); Sat, 30 Jan 2016 12:22:47 -0500 Received: from viro by ZenIV.linux.org.uk with local (Exim 4.76 #1 (Red Hat Linux)) id 1aPZES-0001Yd-BW; Sat, 30 Jan 2016 17:22:44 +0000 Date: Sat, 30 Jan 2016 17:22:44 +0000 From: Al Viro To: Mike Marshall Cc: Linus Torvalds , linux-fsdevel Subject: Re: Orangefs ABI documentation Message-ID: <20160130172244.GD17997@ZenIV.linux.org.uk> References: <20160123001202.GJ17997@ZenIV.linux.org.uk> <20160123012808.GK17997@ZenIV.linux.org.uk> <20160123191055.GN17997@ZenIV.linux.org.uk> <20160123214006.GO17997@ZenIV.linux.org.uk> <20160124001615.GT17997@ZenIV.linux.org.uk> <20160124040529.GX17997@ZenIV.linux.org.uk> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-fsdevel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-fsdevel@vger.kernel.org X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_HI, RP_MATCHES_RCVD, UNPARSEABLE_RELAY autolearn=unavailable version=3.3.1 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on mail.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP On Sun, Jan 24, 2016 at 05:12:30PM -0500, Mike Marshall wrote: > But in my tests, if I kill the client-core bad things happen... > sometimes the client-core doesn't restart, and the kernel gets > sick (hangs or slows way down but no oops). When the client-core > does restart, the activity I had going on (dbench again) fizzles out, > and the filesystem is corrupted... > Anyhow, I don't think the "restart the client-core" code is up to snuff . > > I'll look closer at how the out-of-tree module works, maybe it really > does work and we've broken it with our massive changes to the > upstream version over the last few years. I see that the client (whose > job it is to restart the client-core) and the client-core implement > signal handling with signal(2), whose man page says to use > sigaction(2) instead... Could you try this and see if either WARN_ON() actually triggers? --- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html diff --git a/fs/orangefs/file.c b/fs/orangefs/file.c index c585063d..e2ab0d4 100644 --- a/fs/orangefs/file.c +++ b/fs/orangefs/file.c @@ -246,10 +246,7 @@ populate_shared_memory: iter, new_op->downcall.resp.io.amt_complete); if (ret < 0) { - /* - * put error codes in downcall so that handle_io_error() - * preserves it properly - */ + WARN_ON(!op_state_serviced(new_op)); new_op->downcall.status = ret; handle_io_error(); goto out; diff --git a/fs/orangefs/waitqueue.c b/fs/orangefs/waitqueue.c index cdbf57b..191d886 100644 --- a/fs/orangefs/waitqueue.c +++ b/fs/orangefs/waitqueue.c @@ -205,6 +205,7 @@ retry_servicing: /* op uses shared memory */ if (orangefs_get_bufmap_init() == 0) { + WARN_ON(1); /* * This operation uses the shared memory system AND * the system is not yet ready. This situation occurs