From patchwork Mon Aug 7 21:26:05 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Saeed Mahameed X-Patchwork-Id: 13345052 X-Patchwork-Delegate: kuba@kernel.org Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 3A3221805B for ; Mon, 7 Aug 2023 21:26:24 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id EB4D9C433B6; Mon, 7 Aug 2023 21:26:23 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1691443584; bh=d7nqTyzUfwZGjRmclL6wPh97MDHAWWdLkKp6UwdXVYI=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=a1gapi0l8kXJtOoCgIXb6nQIxp5mhXGxHhhIlUfj4z8RHFPQ2O+Sk0ffiKhAYSi5D GMPZs482ORJBzavahF6qgJkJAPd0U7Ai38oulkKigg3J4LvhJNe14iWgePF2HGBC3W E27nJgLuWIvHa9uKuP3EumlAdHwX5xNaLYq+uJPgJZh7pOtUDA0Ma5MY1ES1RKSIgb /zrIUnh1WnbVkzXLMDb0FvjmAv5BCmHp42fW91HHK4KrzlJc0seMhn2fJWfNNDcIl4 iaTeBq1RgpqRkogFUfQ4i8PC0EAEQ0DcR1RaEmcV4g3FleuAJKm/bbkr/ZeHhuG7C5 h3iCaJ5YgX4KA== From: Saeed Mahameed To: "David S. Miller" , Jakub Kicinski , Paolo Abeni , Eric Dumazet Cc: Saeed Mahameed , netdev@vger.kernel.org, Tariq Toukan , Moshe Shemesh , Ganesh G R , Aya Levin Subject: [net 09/11] net/mlx5: Skip clock update work when device is in error state Date: Mon, 7 Aug 2023 14:26:05 -0700 Message-ID: <20230807212607.50883-10-saeed@kernel.org> X-Mailer: git-send-email 2.41.0 In-Reply-To: <20230807212607.50883-1-saeed@kernel.org> References: <20230807212607.50883-1-saeed@kernel.org> Precedence: bulk X-Mailing-List: netdev@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 X-Patchwork-Delegate: kuba@kernel.org From: Moshe Shemesh When device is in error state, marked by the flag MLX5_DEVICE_STATE_INTERNAL_ERROR, the HW and PCI may not be accessible and so clock update work should be skipped. Furthermore, such access through PCI in error state, after calling mlx5_pci_disable_device() can result in failing to recover from pci errors. Fixes: ef9814deafd0 ("net/mlx5e: Add HW timestamping (TS) support") Reported-and-tested-by: Ganesh G R Closes: https://lore.kernel.org/netdev/9bdb9b9d-140a-7a28-f0de-2e64e873c068@nvidia.com Signed-off-by: Moshe Shemesh Reviewed-by: Aya Levin Signed-off-by: Saeed Mahameed --- drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c b/drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c index 973babfaff25..377372f0578a 100644 --- a/drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c +++ b/drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c @@ -227,10 +227,15 @@ static void mlx5_timestamp_overflow(struct work_struct *work) clock = container_of(timer, struct mlx5_clock, timer); mdev = container_of(clock, struct mlx5_core_dev, clock); + if (mdev->state == MLX5_DEVICE_STATE_INTERNAL_ERROR) + goto out; + write_seqlock_irqsave(&clock->lock, flags); timecounter_read(&timer->tc); mlx5_update_clock_info_page(mdev); write_sequnlock_irqrestore(&clock->lock, flags); + +out: schedule_delayed_work(&timer->overflow_work, timer->overflow_period); }