From patchwork Thu Nov 17 08:36:14 2016 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Sven Eckelmann X-Patchwork-Id: 9433621 X-Patchwork-Delegate: kvalo@adurom.com Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id 51DEA6047D for ; Thu, 17 Nov 2016 08:36:32 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 4415629309 for ; Thu, 17 Nov 2016 08:36:32 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 37D392933D; Thu, 17 Nov 2016 08:36:32 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.9 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_HI autolearn=ham version=3.3.1 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id B281629309 for ; Thu, 17 Nov 2016 08:36:31 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932329AbcKQIga (ORCPT ); Thu, 17 Nov 2016 03:36:30 -0500 Received: from mail-wm0-f42.google.com ([74.125.82.42]:36165 "EHLO mail-wm0-f42.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752305AbcKQIg3 (ORCPT ); Thu, 17 Nov 2016 03:36:29 -0500 Received: by mail-wm0-f42.google.com with SMTP id g23so295182506wme.1 for ; Thu, 17 Nov 2016 00:36:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=open-mesh-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=lRqykWL1T+1sTcQGKGY4jRf9kysPeBfc6ZnrQ5aR/4A=; b=KcsomSg3hVwlBNljs/FfYnpXZ7rA6jM6Y755+dyN0+N0GXeMOEuZYWCfCMCQN0W3dJ ZO5ueRqR5+dJoGVC4WguWMPyJyHr91QUAHlBbQnUmYZRX18eSwmyUYn8KCJHhcju6xba 9w+c5FRbCoDO9oRZrxsDizv7y2UqT+WInDy6hNFpDZl7+/Eg35axxCYSzv/9b+Bigv8H VGb0PKwY6UmgF+79fHYc5QHhNeg3qauv7YwkogY7MMZX67UxeU/bN2+zEyNyrpRgA0dH NQfxMobbozrg54Aurk7XFlUKgfMW3YBWfRN1bK9Ye9FxW6lhSlNJ3mtGnPlf+qSz29UV rv5w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=lRqykWL1T+1sTcQGKGY4jRf9kysPeBfc6ZnrQ5aR/4A=; b=AREYOJy35tCIgzrnd+R68CrTnqxEuMbj4ep+B3JHCcSpoerKyIcYyPc9PuPwLh3xjm tI6OOtp5Cf9cE3nbF6IWAyHKUtRlcYg6RcWMgb+szXCod6+f8XZPfsPUuonDNyNu6Ztw +TtZAEIfd4fq21IeGXc4D+mVqQVkUV793HATp4BuY7g6B63+/51gEGiMwwoRhuY4R/iy vxkcrE9Eic4iBZdVVo/X37YEqArd5IQvBudTrQ6ncHNJIEI3KDTKug1X/nhpIkGou+D8 TVYV9nIUG1bVAiuRg7qpxcHr1zMsLHq+m+JGgvLGq9se4Z8LliGsh22UHyOS1JL/RpFV GTFQ== X-Gm-Message-State: AKaTC02gUidEQK8vaP0KinK4dyRwwRYyCRYfnS95vI1ZDr/EYuBfoDwVtYQWdV1XsAxqEGyB X-Received: by 10.194.124.100 with SMTP id mh4mr1425972wjb.154.1479371787505; Thu, 17 Nov 2016 00:36:27 -0800 (PST) Received: from sven-desktop.home.narfation.org (p2003007C6F610DFE714A10EABC54387E.dip0.t-ipconnect.de. [2003:7c:6f61:dfe:714a:10ea:bc54:387e]) by smtp.gmail.com with ESMTPSA id e5sm14333480wma.12.2016.11.17.00.36.25 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 17 Nov 2016 00:36:26 -0800 (PST) From: Sven Eckelmann To: ath9k-devel@lists.ath9k.org Cc: linux-wireless@vger.kernel.org, ath9k-devel@qca.qualcomm.com, Simon Wunderlich , Sven Eckelmann Subject: [RFC v2 2/2] ath9k: Reset chip on potential deaf state Date: Thu, 17 Nov 2016 09:36:14 +0100 Message-Id: <20161117083614.19188-2-sven.eckelmann@open-mesh.com> X-Mailer: git-send-email 2.10.2 In-Reply-To: <20161117083614.19188-1-sven.eckelmann@open-mesh.com> References: <20161117083614.19188-1-sven.eckelmann@open-mesh.com> Sender: linux-wireless-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-wireless@vger.kernel.org X-Virus-Scanned: ClamAV using ClamSMTP From: Simon Wunderlich The chip is switching seemingly random into a state which can be described as "deaf". No or nearly no interrupts are generated anymore for incoming packets. Existing links either break down after a while and new links will not be established. The driver doesn't know if there is no other device available or if it ended up in an "deaf" state. Resetting the chip proactively avoids permanent problems in case the chip really was in its "deaf" state but maybe causes unnecessary resets in case it wasn't "deaf". Signed-off-by: Simon Wunderlich [sven.eckelmann@open-mesh.com: port to recent ath9k, add commit message] Signed-off-by: Sven Eckelmann --- v2: - reduce amount of possible goto-raptor attacks by one (thanks Kalle Valo) This problem was discovered in mesh setups. It was noticed that some nodes were not able to see their neighbors (mostly after running for a while) - even when those neighbors received data from them via IBSS. A simple `iw dev wlan0 scan` fixed the problem for them. But the problem seems to reappear after while(tm) in a large enough(tm) mesh. This patch is a little bit obscure because it requires CONFIG_ATH9K_DEBUGFS to actually work. But there still seems to be potential interest in Freifunk communities or Freifunk meta-projects (e.g. freifunk-gluon). It is currently not known if it helps them but publishing this to allow them to test and play around with it will not hurt :) --- drivers/net/wireless/ath/ath9k/ath9k.h | 3 +++ drivers/net/wireless/ath/ath9k/debug.c | 1 + drivers/net/wireless/ath/ath9k/debug.h | 1 + drivers/net/wireless/ath/ath9k/link.c | 48 +++++++++++++++++++++++++++++++++- 4 files changed, 52 insertions(+), 1 deletion(-) diff --git a/drivers/net/wireless/ath/ath9k/ath9k.h b/drivers/net/wireless/ath/ath9k/ath9k.h index 9c6fee7..3987ad5 100644 --- a/drivers/net/wireless/ath/ath9k/ath9k.h +++ b/drivers/net/wireless/ath/ath9k/ath9k.h @@ -996,6 +996,9 @@ struct ath_softc { short nbcnvifs; unsigned long ps_usecount; + unsigned long last_check_time; + u32 last_check_interrupts; + struct ath_rx rx; struct ath_tx tx; struct ath_beacon beacon; diff --git a/drivers/net/wireless/ath/ath9k/debug.c b/drivers/net/wireless/ath/ath9k/debug.c index 608b370..6d5c253 100644 --- a/drivers/net/wireless/ath/ath9k/debug.c +++ b/drivers/net/wireless/ath/ath9k/debug.c @@ -768,6 +768,7 @@ static int read_file_reset(struct seq_file *file, void *data) [RESET_TX_DMA_ERROR] = "Tx DMA stop error", [RESET_RX_DMA_ERROR] = "Rx DMA stop error", [RESET_TYPE_DEADBEEF] = "deadbeef hang", + [RESET_TYPE_DEAF] = "deaf hang", }; int i; diff --git a/drivers/net/wireless/ath/ath9k/debug.h b/drivers/net/wireless/ath/ath9k/debug.h index 0d77abbf6..6f186bd 100644 --- a/drivers/net/wireless/ath/ath9k/debug.h +++ b/drivers/net/wireless/ath/ath9k/debug.h @@ -53,6 +53,7 @@ enum ath_reset_type { RESET_TX_DMA_ERROR, RESET_RX_DMA_ERROR, RESET_TYPE_DEADBEEF, + RESET_TYPE_DEAF, __RESET_TYPE_MAX }; diff --git a/drivers/net/wireless/ath/ath9k/link.c b/drivers/net/wireless/ath/ath9k/link.c index 04195d5..ae99c02 100644 --- a/drivers/net/wireless/ath/ath9k/link.c +++ b/drivers/net/wireless/ath/ath9k/link.c @@ -158,13 +158,59 @@ static bool ath_hw_hang_deadbeef(struct ath_softc *sc) return true; } +static bool ath_hw_hang_deaf(struct ath_softc *sc) +{ +#ifndef CONFIG_ATH9K_DEBUGFS + return false; +#else + struct ath_common *common = ath9k_hw_common(sc->sc_ah); + u32 interrupts, interrupt_per_s; + unsigned int interval; + + /* get historic data */ + interval = jiffies_to_msecs(jiffies - sc->last_check_time); + if (sc->sc_ah->caps.hw_caps & ATH9K_HW_CAP_EDMA) + interrupts = sc->debug.stats.istats.rxlp; + else + interrupts = sc->debug.stats.istats.rxok; + + interrupts -= sc->last_check_interrupts; + + /* save current data */ + sc->last_check_time = jiffies; + if (sc->sc_ah->caps.hw_caps & ATH9K_HW_CAP_EDMA) + sc->last_check_interrupts = sc->debug.stats.istats.rxlp; + else + sc->last_check_interrupts = sc->debug.stats.istats.rxok; + + /* sanity check, should be 30 seconds */ + if (interval > 40000 || interval < 20000) + return false; + + /* should be at least one interrupt per second */ + interrupt_per_s = interrupts / (interval / 1000); + if (interrupt_per_s >= 1) + return false; + + ath_dbg(common, RESET, + "RX deaf hang is detected. Schedule chip reset\n"); + ath9k_queue_reset(sc, RESET_TYPE_DEAF); + + return true; +#endif +} + void ath_hw_hang_work(struct work_struct *work) { struct ath_softc *sc = container_of(work, struct ath_softc, hw_hang_work.work); - ath_hw_hang_deadbeef(sc); + if (ath_hw_hang_deadbeef(sc)) + goto requeue_worker; + + ath_hw_hang_deaf(sc); +requeue_worker: ieee80211_queue_delayed_work(sc->hw, &sc->hw_hang_work, msecs_to_jiffies(ATH_HANG_WORK_INTERVAL)); }