block: mq-deadline: Fix write completion handling For a zoned block device using mq-deadline, if a write request for a zone is received while another write was already dispatched for the same zone, dd_dispatch_request() will return NULL and the newly inserted write request is kept in the scheduler queue waiting for the ongoing zone write to complete. With this behavior, when no other request has been dispatched, rq_list in blk_mq_sched_dispatch_requests() is empty and blk_mq_sched_mark_restart_hctx() not called. This in turn leads to __blk_mq_free_request() call of blk_mq_sched_restart() to not run the queue when the already dispatched write request completes. The newly dispatched request stays stuck in the scheduler queue until eventually another request is submitted. This problem does not affect SCSI disk as the SCSI stack handles queue restart on request completion. However, this problem is can be triggered the nullblk driver with zoned mode enabled. Fix this by always requesting a queue restart in dd_dispatch_request() if no request was dispatched while WRITE requests are queued. Fixes: 5700f69178e9 ("mq-deadline: Introduce zone locking support") Cc: <stable@vger.kernel.org> Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> Add missing export of blk_mq_sched_restart() Signed-off-by: Jens Axboe <axboe@kernel.dk>

commit: 7211aef86f79583e59b88a0aba0bc830566f7e8e [log] [tgz]
author: Damien Le Moal <damien.lemoal@wdc.com> Mon Dec 17 15:14:05 2018 +0900
committer: Jens Axboe <axboe@kernel.dk> Mon Dec 17 11:19:39 2018 -0700
tree: c4204803303baa242dba8562e55dfc781ea5b97a
parent: 7e849dd9cf37bc52aff9b5236377c405040c959c [diff] [blame]
diff --git a/block/mq-deadline.c b/block/mq-deadline.c
index 1bd06ce..14288f8 100644
--- a/block/mq-deadline.c
+++ b/block/mq-deadline.c

@@ -373,9 +373,16 @@ static struct request *__dd_dispatch_request(struct deadline_data *dd)
 
 /*
  * One confusing aspect here is that we get called for a specific
- * hardware queue, but we return a request that may not be for a
+ * hardware queue, but we may return a request that is for a
  * different hardware queue. This is because mq-deadline has shared
  * state for all hardware queues, in terms of sorting, FIFOs, etc.
+ *
+ * For a zoned block device, __dd_dispatch_request() may return NULL
+ * if all the queued write requests are directed at zones that are already
+ * locked due to on-going write requests. In this case, make sure to mark
+ * the queue as needing a restart to ensure that the queue is run again
+ * and the pending writes dispatched once the target zones for the ongoing
+ * write requests are unlocked in dd_finish_request().
  */
 static struct request *dd_dispatch_request(struct blk_mq_hw_ctx *hctx)
 {
@@ -384,6 +391,9 @@ static struct request *dd_dispatch_request(struct blk_mq_hw_ctx *hctx)
 
 	spin_lock(&dd->lock);
 	rq = __dd_dispatch_request(dd);
+	if (!rq && blk_queue_is_zoned(hctx->queue) &&
+	    !list_empty(&dd->fifo_list[WRITE]))
+		blk_mq_sched_mark_restart_hctx(hctx);
 	spin_unlock(&dd->lock);
 
 	return rq;
commit	7211aef86f79583e59b88a0aba0bc830566f7e8e	[log] [tgz]
author	Damien Le Moal <damien.lemoal@wdc.com>	Mon Dec 17 15:14:05 2018 +0900
committer	Jens Axboe <axboe@kernel.dk>	Mon Dec 17 11:19:39 2018 -0700
tree	c4204803303baa242dba8562e55dfc781ea5b97a
parent	7e849dd9cf37bc52aff9b5236377c405040c959c [diff] [blame]