[v4,4/6] blk-mq: Introduce blk_mq_delay_run_hw_queue()

Message ID	20170407181654.27836-5-bart.vanassche@sandisk.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <linux-block-owner@kernel.org> Received-SPF: Pass (protection.outlook.com: domain of sandisk.com designates 63.163.107.21 as permitted sender) receiver=protection.outlook.com; client-ip=63.163.107.21; helo=milsmgep15.sandisk.com; From: Bart Van Assche <bart.vanassche@sandisk.com> To: Jens Axboe <axboe@kernel.dk> CC: <linux-block@vger.kernel.org>, <linux-scsi@vger.kernel.org>, "Bart Van Assche" <bart.vanassche@sandisk.com>, Christoph Hellwig <hch@lst.de>, "Hannes Reinecke" <hare@suse.de>, Long Li <longli@microsoft.com>, "K . Y . Srinivasan" <kys@microsoft.com> Subject: [PATCH v4 4/6] blk-mq: Introduce blk_mq_delay_run_hw_queue() Date: Fri, 7 Apr 2017 11:16:52 -0700 Message-ID: <20170407181654.27836-5-bart.vanassche@sandisk.com> In-Reply-To: <20170407181654.27836-1-bart.vanassche@sandisk.com> References: <20170407181654.27836-1-bart.vanassche@sandisk.com> MIME-Version: 1.0 Content-Type: text/plain WDCIPOUTBOUND: EOP-TRUE X-Microsoft-Exchange-Diagnostics: 1; BL2PR04MB098; 20:BGWcHNgfB/Q8nnyhdZ7aofiBCx7uQ4AMP3VO/klRttl+AHeZE+GiuBgrTQ8AcnwpKtN+mtQKQ7kCdTe2g5/PkM/pvLnpefJFqGJxMVVrsgZYAhDZIrchizJgBX2H48pCtszNWbn0JfQX6U0iDVgEhU7gYPIEMdQEI07bOB4tAW1HaY2bB4/IWbKLSPSrk7Fl/gVSsRbVBmLzzAFOD6dInzTy6jNThjAhWmrPeYtMyZwpFoe6P2D2dyR/tJrkPEcJVyaBcfz8192e1QgEf0SWVjeq6KxjpsLNFKpLKeSDui8dT2/Unt4QR7QWDIvPaY9CReSNbkSrifR1KMXmchvtmlQ8v5pnErb4KYhb4fKV2fWAz9NI4HHKhtABhgCFGUHv7fzdPRfhtLHtscYZmkhEeLJfd0cSBDOjtcBi2QyBxXxjMKql73jH7dF3r0t8qVKQRTiUU6V6ySFbk1mczVnjwpqIucGmrMIGQbqYkiIqvKiPqkD9dSDVbKOtsBsuI6q/ SpamDiagnosticOutput: 1:99 SpamDiagnosticMetadata: NSPM X-Microsoft-Exchange-Diagnostics: 1; BL2PR04MB098; 7:WBAyHzuTJgmrMaFr1lEMbT8g4CqSNw55fZ8kRcaJ0HVDx9DuVEVYmo//9AMksRJf3tNgYH/xfn/LZG9cnxL9w9qM51rhBU2lk78SzxHh7DnC/oZKlO0cd9kmpfpRQFLhWt+NeisEvwQ/zhQt+5k0+bemxVpXRsdEFuszz97MWusEJO5fqQ8mDNQzttJYtsi4dNOdJ9CowYDE/wy/8VGROxuPMkvB48FPnvCAoubY5IbEn0h2wqK7wUEITDr42Dh9qR4yYUW/n4ZXSSFCCb9OCGnK+9Rn7bHWhp8QCZD3ADXVhCP4SN06luhq1lf2axBdrnUqR13ObG7HUfLeplhi8g==; 20:0JbCtEYoRC301LuG7W0quCoqC1C5ueP37AMNXsws9TSMCzzQShOqGWsWtkYjyZo7dBo+oC0yfvPIzJ7IqE71nzHLcnrCf2tkcLeQTM1GEhxrSgiLccat2A+23ZDQYipUzZL258nDo0Efc7QLgya6AP87Ep7EsIDAArS4rqQMHHY= Sender: linux-block-owner@vger.kernel.org Precedence: bulk

Message ID

20170407181654.27836-5-bart.vanassche@sandisk.com (mailing list archive)

State

New, archived

Headers

Received-SPF: Pass (protection.outlook.com: domain of sandisk.com designates
	63.163.107.21 as permitted sender)
	receiver=protection.outlook.com; 
	client-ip=63.163.107.21; helo=milsmgep15.sandisk.com;
From: Bart Van Assche <bart.vanassche@sandisk.com>
To: Jens Axboe <axboe@kernel.dk>
CC: <linux-block@vger.kernel.org>, <linux-scsi@vger.kernel.org>,
	"Bart Van Assche" <bart.vanassche@sandisk.com>,
	Christoph Hellwig <hch@lst.de>,
	"Hannes Reinecke" <hare@suse.de>, Long Li <longli@microsoft.com>,
	"K . Y . Srinivasan" <kys@microsoft.com>
Subject: [PATCH v4 4/6] blk-mq: Introduce blk_mq_delay_run_hw_queue()
Date: Fri, 7 Apr 2017 11:16:52 -0700
Message-ID: <20170407181654.27836-5-bart.vanassche@sandisk.com>
In-Reply-To: <20170407181654.27836-1-bart.vanassche@sandisk.com>
References: <20170407181654.27836-1-bart.vanassche@sandisk.com>
MIME-Version: 1.0
Content-Type: text/plain
WDCIPOUTBOUND: EOP-TRUE
SpamDiagnosticOutput: 1:99
SpamDiagnosticMetadata: NSPM
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 07 Apr 2017 18:17:02.2715
	(UTC)
X-MS-Exchange-CrossTenant-Id: b61c8803-16f3-4c35-9b17-6f65f441df86
X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=b61c8803-16f3-4c35-9b17-6f65f441df86;
	Ip=[63.163.107.21]; Helo=[milsmgep15.sandisk.com]
X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BL2PR04MB098
Sender: linux-block-owner@vger.kernel.org
Precedence: bulk
List-ID: <linux-block.vger.kernel.org>
X-Mailing-List: linux-block@vger.kernel.org
X-Virus-Scanned: ClamAV using ClamSMTP

Commit Message

Bart Van Assche April 7, 2017, 6:16 p.m. UTC

Introduce a function that runs a hardware queue unconditionally
after a delay. Note: there is already a function that stops and
restarts a hardware queue after a delay, namely blk_mq_delay_queue().

This function will be used in the next patch in this series.

Signed-off-by: Bart Van Assche <bart.vanassche@sandisk.com>
Cc: Christoph Hellwig <hch@lst.de>
Cc: Hannes Reinecke <hare@suse.de>
Cc: Long Li <longli@microsoft.com>
Cc: K. Y. Srinivasan <kys@microsoft.com>
---
 block/blk-mq.c         | 32 ++++++++++++++++++++++++++++++--
 include/linux/blk-mq.h |  2 ++
 2 files changed, 32 insertions(+), 2 deletions(-)

Comments

Christoph Hellwig April 10, 2017, 7:12 a.m. UTC | #1

> +	if (msecs == 0)
> +		kblockd_schedule_work_on(blk_mq_hctx_next_cpu(hctx),
> +					 &hctx->run_work);
> +	else
> +		kblockd_schedule_delayed_work_on(blk_mq_hctx_next_cpu(hctx),
> +						 &hctx->delayed_run_work,
> +						 msecs_to_jiffies(msecs));
> +}

I'd rather make run_work a delayed_work (again) and use
kblockd_schedule_delayed_work_on with a timeout of zero for the immediate
run case instead of having two competing work structs.

Jens Axboe April 10, 2017, 3:02 p.m. UTC | #2

On 04/10/2017 01:12 AM, Christoph Hellwig wrote:
>> +	if (msecs == 0)
>> +		kblockd_schedule_work_on(blk_mq_hctx_next_cpu(hctx),
>> +					 &hctx->run_work);
>> +	else
>> +		kblockd_schedule_delayed_work_on(blk_mq_hctx_next_cpu(hctx),
>> +						 &hctx->delayed_run_work,
>> +						 msecs_to_jiffies(msecs));
>> +}
> 
> I'd rather make run_work a delayed_work (again) and use
> kblockd_schedule_delayed_work_on with a timeout of zero for the immediate
> run case instead of having two competing work structs.

Yeah that's a good point, it'd have to be an incremental patch at this
point though. Also note that blk_mq_stop_hw_queue() isn't currently
canceling the new ->delayed_run_work, that looks like a bug.

And looking at it, right now we have 3 (three!) work items in the
hardware queue. The two delayed items differ in that one of them only
runs the queue it was previously stopped, that's it. The non-delayed one
is identical to the non stopped checking delayed variant.

I'll send out a patch.

diff --git a/block/blk-mq.c b/block/blk-mq.c
index aff85d41cea3..836e3a17da54 100644
--- a/block/blk-mq.c
+++ b/block/blk-mq.c
@@ -1146,7 +1146,8 @@  static int blk_mq_hctx_next_cpu(struct blk_mq_hw_ctx *hctx)
 	return hctx->next_cpu;
 }
 
-void blk_mq_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async)
+static void __blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async,
+					unsigned long msecs)
 {
 	if (unlikely(blk_mq_hctx_stopped(hctx) ||
 		     !blk_mq_hw_queue_mapped(hctx)))
@@ -1163,7 +1164,24 @@  void blk_mq_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async)
 		put_cpu();
 	}
 
-	kblockd_schedule_work_on(blk_mq_hctx_next_cpu(hctx), &hctx->run_work);
+	if (msecs == 0)
+		kblockd_schedule_work_on(blk_mq_hctx_next_cpu(hctx),
+					 &hctx->run_work);
+	else
+		kblockd_schedule_delayed_work_on(blk_mq_hctx_next_cpu(hctx),
+						 &hctx->delayed_run_work,
+						 msecs_to_jiffies(msecs));
+}
+
+void blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, unsigned long msecs)
+{
+	__blk_mq_delay_run_hw_queue(hctx, true, msecs);
+}
+EXPORT_SYMBOL(blk_mq_delay_run_hw_queue);
+
+void blk_mq_run_hw_queue(struct blk_mq_hw_ctx *hctx, bool async)
+{
+	__blk_mq_delay_run_hw_queue(hctx, async, 0);
 }
 
 void blk_mq_run_hw_queues(struct request_queue *q, bool async)
@@ -1266,6 +1284,15 @@  static void blk_mq_run_work_fn(struct work_struct *work)
 	__blk_mq_run_hw_queue(hctx);
 }
 
+static void blk_mq_delayed_run_work_fn(struct work_struct *work)
+{
+	struct blk_mq_hw_ctx *hctx;
+
+	hctx = container_of(work, struct blk_mq_hw_ctx, delayed_run_work.work);
+
+	__blk_mq_run_hw_queue(hctx);
+}
+
 static void blk_mq_delay_work_fn(struct work_struct *work)
 {
 	struct blk_mq_hw_ctx *hctx;
@@ -1866,6 +1893,7 @@  static int blk_mq_init_hctx(struct request_queue *q,
 		node = hctx->numa_node = set->numa_node;
 
 	INIT_WORK(&hctx->run_work, blk_mq_run_work_fn);
+	INIT_DELAYED_WORK(&hctx->delayed_run_work, blk_mq_delayed_run_work_fn);
 	INIT_DELAYED_WORK(&hctx->delay_work, blk_mq_delay_work_fn);
 	spin_lock_init(&hctx->lock);
 	INIT_LIST_HEAD(&hctx->dispatch);
diff --git a/include/linux/blk-mq.h b/include/linux/blk-mq.h
index bdea90d75274..b90c3d5766cd 100644
--- a/include/linux/blk-mq.h
+++ b/include/linux/blk-mq.h
@@ -51,6 +51,7 @@  struct blk_mq_hw_ctx {
 
 	atomic_t		nr_active;
 
+	struct delayed_work	delayed_run_work;
 	struct delayed_work	delay_work;
 
 	struct hlist_node	cpuhp_dead;
@@ -236,6 +237,7 @@  void blk_mq_stop_hw_queues(struct request_queue *q);
 void blk_mq_start_hw_queues(struct request_queue *q);
 void blk_mq_start_stopped_hw_queue(struct blk_mq_hw_ctx *hctx, bool async);
 void blk_mq_start_stopped_hw_queues(struct request_queue *q, bool async);
+void blk_mq_delay_run_hw_queue(struct blk_mq_hw_ctx *hctx, unsigned long msecs);
 void blk_mq_run_hw_queues(struct request_queue *q, bool async);
 void blk_mq_delay_queue(struct blk_mq_hw_ctx *hctx, unsigned long msecs);
 void blk_mq_tagset_busy_iter(struct blk_mq_tag_set *tagset,

[v4,4/6] blk-mq: Introduce blk_mq_delay_run_hw_queue()

Commit Message

Comments

Patch