question about request merge

From: "Zhengyuan Liu"<liuzhengyuan@kylinos.cn>;

Hi, Shaohua

I found it indeed doesn't do front merge when two threads flush plug list  concurrently.   To 
reappear , I prepared two IO threads , named a0.io and a1.io .
Thread a1.io  uses libaio to write 5 requests : 
    sectors: 16 + 8, 40 + 8, 64 + 8, 88 + 8, 112 + 8
Thread a0.io  uses libaio to write other 5 requests : 
    sectors: 8+ 8, 32 + 8, 56 + 8, 80 + 8, 104 + 8

To meet the condition that thread a1.io flush all its requests to queuee before  thread a0.io
does, I delay thread a1.io to run queue after it adds all pluged requests to queue,  seeing
bellow patch:

With the patch I restart  "./a1.io & sleep 1 && ./a0.io" and blktrace result showed bellow:

      8,16   2      249   471.608930240  1473  Q  WS 16 + 8 [a1.io]
      8,16   2      250   471.608934940  1473  G  WS 16 + 8 [a1.io]
      8,16   2      252   471.608940680  1473  Q  WS 40 + 8 [a1.io]
      8,16   2      253   471.608942840  1473  G  WS 40 + 8 [a1.io]
      8,16   2      254   471.608946500  1473  Q  WS 64 + 8 [a1.io]
      8,16   2      255   471.608948040  1473  G  WS 64 + 8 [a1.io]
      8,16   2      256   471.608951460  1473  Q  WS 88 + 8 [a1.io]
      8,16   2      257   471.608952960  1473  G  WS 88 + 8 [a1.io]
      8,16   2      258   471.608956340  1473  Q  WS 112 + 8 [a1.io]
      8,16   2      259   471.608957780  1473  G  WS 112 + 8 [a1.io]
      8,16   2      260   471.608959880  1473  I  WS 16 + 8 [a1.io]
      8,16   2      261   471.608975880  1473  I  WS 40 + 8 [a1.io]
      8,16   2      262   471.608986420  1473  I  WS 64 + 8 [a1.io]
      8,16   2      263   471.608995480  1473  I  WS 88 + 8 [a1.io]
      8,16   2      264   471.609003720  1473  I  WS 112 + 8 [a1.io]
      8,16   3      267   471.610200680  1474  Q  WS 8 + 8 [a0.io]
      8,16   3      268   471.610204700  1474  G  WS 8 + 8 [a0.io]
      8,16   3      270   471.610210280  1474  Q  WS 32 + 8 [a0.io]
      8,16   3      271   471.610211840  1474  G  WS 32 + 8 [a0.io]
      8,16   3      272   471.610215660  1474  Q  WS 56 + 8 [a0.io]
      8,16   3      273   471.610217200  1474  G  WS 56 + 8 [a0.io]
      8,16   3      274   471.610220860  1474  Q  WS 80 + 8 [a0.io]
      8,16   3      275   471.610222300  1474  G  WS 80 + 8 [a0.io]
      8,16   3      276   471.610225720  1474  Q  WS 104 + 8 [a0.io]
      8,16   3      277   471.610227540  1474  G  WS 104 + 8 [a0.io]
      8,16   3      278   471.610229100  1474  I  WS 8 + 8 [a0.io]
      8,16   3      279   471.615291420  1474  I  WS 32 + 8 [a0.io]
      8,16   3      280   471.620311200  1474  I  WS 56 + 8 [a0.io]
      8,16   3      281   471.625327620  1474  I  WS 80 + 8 [a0.io]
      8,16   3      282   471.630343080  1474  I  WS 104 + 8 [a0.io]
      8,16   3      284   471.637881600  1474  D  WS 104 + 16 [a0.io]
      8,16   3      285   471.640429120  1474  D  WS 80 + 16 [a0.io]
      8,16   0      397   471.644573100     3  C  WS 104 + 16 [0]
      8,16   0      398   471.647159640     3  D  WS 56 + 16 [ksoftirqd/0]
      8,16   1       49   471.649825940    13  C  WS 80 + 16 [0]
      8,16   1       50   471.652391540    13  D  WS 32 + 16 [ksoftirqd/1]
      8,16   0      399   471.654446580     3  C  WS 56 + 16 [0]
      8,16   0      400   471.657000820     3  D  WS 8 + 16 [ksoftirqd/0]
      8,16   0      401   471.659445000     3  C  WS 32 + 16 [0]
      8,16   0      402   471.663946360     3  C  WS 8 + 16 [0]

Now those adjacent request get merged.

------------------ Original ------------------
From:  "Zhengyuan Liu"<liuzhengyuan@kylinos.cn>;
Date:  Fri, Apr 13, 2018 04:16 PM
To:  "shli"<shli@fb.com>;
Subject:  question about request merge

Hi, Shaohua

There is a question around me while viewing block layer code, so I'm writing to you to get help.

When a request from plug list get flushed to queue, it try to merge into a existing request by
calling function elv_attempt_insert_merge before added directly to the queue, seeing bellow:

          /*
         * Attempt to do an insertion back merge. Only check for the case where
         * we can append 'rq' to an existing request, so we can throw 'rq' away
         * afterwards.
         *
         * Returns true if we merged, false otherwise
         */
        bool elv_attempt_insert_merge(struct request_queue *q, struct request *rq)
        {
                struct request *__rq;
                bool ret;

                if (blk_queue_nomerges(q))
                        return false;

                /*
                 * First try one-hit cache.
                 */
                if (q->last_merge && blk_attempt_req_merge(q, q->last_merge, rq))
                        return true;

                if (blk_queue_noxmerges(q))
                        return false;

                ret = false;
                /*
                 * See if our hash lookup can find a potential backmerge.
                 */
                while (1) {
                        __rq = elv_rqhash_find(q, blk_rq_pos(rq));
                        if (!__rq || !blk_attempt_req_merge(q, __rq, rq))
                                break;

                        /* The merged request could be merged with others, try again */
                        ret = true;
                        rq = __rq;
                }

                return ret;
        }

From the comment and code we can see it only do backmerge, I think its enough when there is only
one thread doing IO operation since we have sorted the requests in plug list before unplug, seeing the 
patch from Jianpeng Ma 422765c("block: Remove should_sort judgement when flush blk_plug") and
975927b("block: Add blk_rq_pos(rq) to sort rq when plushing"). However when comes to multiple
threads, lets image bellow IO scenario, thread A and B both hold three requests in plug list.

    threadA: a+1, a+4, a+7
    threadB: a+2, a+5, a+8

if threadA has flushed all its request to queue before threadB does, then request a+1 and a+2  and
other adjacent pairs have the chance to get merged through backmerge, but if  threadB flushs all its
requests to queue before threadA how could those adjacent requests get merged? or it doesn't matter
at all?  I know your patch bee0393("block: recursive merge requests") has solved a multi-thread merge
problem but not mines.
Its hard to simulate such a IO scenario from userspace and I cann't fingure out other useful methods
to verify whether  the requests in such IO scenario really cann't get merged. All my thoughts are from
speculation, so I hope you could give me some answers. Any reply would be thankful.

Zhengyuan

question about request merge

Commit Message

Comments

Patch