[5/5] mm: Target compaction on pageblocks that were recently fragmented

Despite the earlier patches, external fragmentation events are still
inevitable as not all callers can stall or are appropriate to stall
(e.g. unmovable allocations that kswapd reclaim will not necessarily
help). In the event there is a mixed pageblock, it's desirable to move all
movable pages from that block so that unmovable/unreclaimable allocations
do not further pollute the address space.

This patch queues such pageblocks for early compaction and relies on
kswapd to wake kcompactd when some pages are reclaimed. Waking kcompactd
after kswapd makes progress is so that the compaction is more likely to
have a suitable migration destination.

This patch may be controversial as there are multiple other design
decisions that can be made. We could refuse to change pageblock ownership
in some cases but great care would need to be taken to avoid premature
OOMs or a livelock. Similarly, we could tag pageblocks as mixed and
search for them but that would increase scanning costs. Finally, there
is a corner case that a mixed pageblock that is after the point where a
free scanner can operate may fail to clean the pageblock but addressing
that would require a fundamental alteration to how compaction works.

Unlike the previous series, this one is harder to prove that it is a benefit
because it ideally require a very long-lived workload that is fragmenting
to show if it's really effective. The timing of such an allocation stream
would be critical and detecting the change would be difficult can be
within the noise. Hence, the potential benefit of this patch is more
conceptual than quantitive even though there are some positive results.

1-socket Skylake machine
config-global-dhp__workload_thpfioscale XFS (no special madvise)
4 fio threads, 1 THP allocating thread
--------------------------------------

4.19 extfrag events < order 0:  71227
4.19+patch1:                    36456 (49% reduction)
4.19+patch1-3:                   4510 (94% reduction)
4.19+patch1-4:                    548 (99% reduction)
4.19+patch1-5:                    422 (99% reduction)

                                       4.19.0                 4.19.0
                                   stall-v1r6         proactive-v1r6
Amean     fault-base-1      839.48 (   0.00%)      860.89 *  -2.55%*
Amean     fault-huge-1      172.74 (   0.00%)      159.49 (   7.67%)

                                  4.19.0                 4.19.0
                              stall-v1r6         proactive-v1r6
Percentage huge-1        1.04 (   0.00%)        2.29 ( 119.35%)

While there is an improvement in the reduction of fragmentation events
and allocation success rates, the differences are marginal enough that
it may not be significant.

1-socket Skylake machine
global-dhp__workload_thpfioscale-madvhugepage-xfs (MADV_HUGEPAGE)
-----------------------------------------------------------------

4.19 extfrag events < order 0:  40761
4.19+patch1:                    36085 (11% reduction)
4.19+patch1-3:                   1887 (95% reduction)
4.19+patch1-4:                    394 (99% reduction)
4.19+patch1-5:                    440 (99% reduction)

thpfioscale Fault Latencies
                                       4.19.0                 4.19.0
                                   stall-v1r6         proactive-v1r6
Amean     fault-base-1     3943.28 (   0.00%)     2704.46 *  31.42%*
Amean     fault-huge-1     2739.80 (   0.00%)     2552.13 *   6.85%*

thpfioscale Percentage Faults Huge
                                  4.19.0                 4.19.0
                              stall-v1r6         proactive-v1r6
Percentage huge-1       98.55 (   0.00%)       98.76 (   0.20%)

Slight increase in fragmentation events albeit very small. The latency
is much improved as well as a slight increase in allocation success
rates but this may be a co-incidence of the system state.

2-socket Haswell machine
config-global-dhp__workload_thpfioscale XFS (no special madvise)
4 fio threads, 5 THP allocating threads
----------------------------------------------------------------

4.19 extfrag events < order 0:  882868
4.19+patch1:                    476937 (46% reduction)
4.19+patch1-3:                   29044 (97% reduction)
4.19+patch1-4:                   29290 (97% reduction)
4.19+patch1-5:                   30791 (97% reduction)

thpfioscale Fault Latencies
                                       4.19.0                 4.19.0
                                   stall-v1r6         proactive-v1r6
Amean     fault-base-5     1773.24 (   0.00%)     1519.89 *  14.29%*
Amean     fault-huge-5    17791.20 (   0.00%)      536.44 (  96.98%)

                                  4.19.0                 4.19.0
                              stall-v1r6         proactive-v1r6
Percentage huge-5        0.17 (   0.00%)        0.98 ( 490.00%)

Again, the fragmentation causing events is slightly increased although
this is likely within the noise. The latency is massively improved but
the success rate is only marginally improved. Given the low success rate,
it may be a co-incidence of the exact system state during the test but
the fact it happened on both 1 and 2 socket machines is encouraging.

2-socket Haswell machine
global-dhp__workload_thpfioscale-madvhugepage-xfs (MADV_HUGEPAGE)
-----------------------------------------------------------------

4.19 extfrag events < order 0: 803099
4.19+patch1:                   654671 (23% reduction)
4.19+patch1-3:                  24352 (97% reduction)
4.19+patch1-4:                  16698 (98% reduction)
4.19+patch1-5:                  32623 (96% reduction)

thpfioscale Fault Latencies
                                       4.19.0                 4.19.0
                                   stall-v1r6         proactive-v1r6
Amean     fault-base-5     8649.60 (   0.00%)    13074.71 * -51.16%*
Amean     fault-huge-5     2799.82 (   0.00%)     3410.02 * -21.79%*

thpfioscale Percentage Faults Huge
                                  4.19.0                 4.19.0
                              stall-v1r6         proactive-v1r6
Percentage huge-5       77.80 (   0.00%)       83.30 (   7.06%)

This shows an increase in both fragmentation events and latency. However
it is somewhat balanced by the higher allocation success rates which in
themselves can increase fragmentation pressure.

This is less an obvious universal win. It does control fragmentation
better to some extent in that pageblocks can be found faster in some
cases but the nature of the workload makes it less clear-cut.

Signed-off-by: Mel Gorman <mgorman@techsingularity.net>
---
 include/linux/compaction.h        |   4 ++
 include/linux/migrate.h           |   7 +-
 include/linux/mmzone.h            |   4 ++
 include/trace/events/compaction.h |  62 ++++++++++++++++
 mm/compaction.c                   | 146 +++++++++++++++++++++++++++++++++++---
 mm/migrate.c                      |   6 +-
 mm/page_alloc.c                   |   7 ++
 7 files changed, 225 insertions(+), 11 deletions(-)

Message ID	20181031160645.7633-6-mgorman@techsingularity.net (mailing list archive)
State	New, archived
Headers	show Return-Path: <owner-linux-mm@kvack.org> Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id E722114DE for <patchwork-linux-mm@patchwork.kernel.org>; Wed, 31 Oct 2018 16:07:10 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id D518E26D08 for <patchwork-linux-mm@patchwork.kernel.org>; Wed, 31 Oct 2018 16:07:10 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id C637E29EB2; Wed, 31 Oct 2018 16:07:10 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.9 required=2.0 tests=BAYES_00,MAILING_LIST_MULTI, RCVD_IN_DNSWL_NONE autolearn=ham version=3.3.1 Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 7A3B226D08 for <patchwork-linux-mm@patchwork.kernel.org>; Wed, 31 Oct 2018 16:07:09 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C95B96B026B; Wed, 31 Oct 2018 12:06:50 -0400 (EDT) Delivered-To: linux-mm-outgoing@kvack.org Received: by kanga.kvack.org (Postfix, from userid 40) id C45A66B026E; Wed, 31 Oct 2018 12:06:50 -0400 (EDT) X-Original-To: int-list-linux-mm@kvack.org X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 9AEDF6B026F; Wed, 31 Oct 2018 12:06:50 -0400 (EDT) X-Original-To: linux-mm@kvack.org X-Delivered-To: linux-mm@kvack.org Received: from mail-ed1-f70.google.com (mail-ed1-f70.google.com [209.85.208.70]) by kanga.kvack.org (Postfix) with ESMTP id 1988F6B026A for <linux-mm@kvack.org>; Wed, 31 Oct 2018 12:06:50 -0400 (EDT) Received: by mail-ed1-f70.google.com with SMTP id x1-v6so10976061eds.16 for <linux-mm@kvack.org>; Wed, 31 Oct 2018 09:06:50 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-original-authentication-results:x-gm-message-state:from:to:cc :subject:date:message-id:in-reply-to:references; bh=GAUan2SuAD2Tp76K2SuQfT4XdYilWRU+Ql0ZD4Ov4r4=; b=kOq6EdJRde4Chui7NAOLvVsB8yz3iG72I+I/qO4qRTP2FVYXG1dkqu3Gj6crHono6F +JXXfogKE/tat5pgeZEdacAhR/DZxbSZY6rSSUPsZ77yjDAYtG6cnHKWr/xKIOeWrYd7 6L7AamcxUTkIZGQag19pvRYT6aHIC92u2YvkUHuEmJM8kB0reH2DWsOQhwgPXOpfmR46 tKcHJmbJjXWPCHVFS/Ut1NLbgvwi7oKoCQKaO9f3Irbwis02KncXmYHU+rgRiXJJpFGr QMDofLpKDHjB1cpNtDFqECmvmaaeI2UJ7ptPXzzjWuhG6yxSOPl0Lun/jyTsn0sQVDyS US9Q== X-Original-Authentication-Results: mx.google.com; spf=pass (google.com: domain of mgorman@techsingularity.net designates 81.17.249.35 as permitted sender) smtp.mailfrom=mgorman@techsingularity.net X-Gm-Message-State: AGRZ1gITJ/GpU/ejst3c/tStMSPCwSnfpNydzTfJ2kMRfVd6xvFQdlRS fQmMIzFHK0PEoSsyuGWPYYuR5JYb9AXL1Yfl005XSPNejL71NNxEEruwWbX0NireGcFd3laJ11s SYyndkN2KRzov3/Y8BX7Zuhq4tmT8/K1ybGIJ6uOYlKOk4D+dWsxn1UMpw3+wQpz4iw== X-Received: by 2002:a17:906:f14e:: with SMTP id gw14-v6mr1997760ejb.231.1541002009456; Wed, 31 Oct 2018 09:06:49 -0700 (PDT) X-Google-Smtp-Source: AJdET5eswuw5sKbzazoQZ3u5/sH2CC5wbCdOjIZn9ad2G9B+1xFhAi8RNVrDSTbW9R78S3fCRin7 X-Received: by 2002:a17:906:f14e:: with SMTP id gw14-v6mr1997669ejb.231.1541002007414; Wed, 31 Oct 2018 09:06:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1541002007; cv=none; d=google.com; s=arc-20160816; b=T+xoU0TwFRTbVI7lxNNAB+KzHtun60kmNHcdHyb6unFT1iblqqGC9YL/sA7aFnZoz6 sFNb+Rkqmr9BrP7CEJPNGuBiqaj8P+cJ6ocmWjdPC6GJdiYxBjyK1M62likzT8m/ZAT3 s1VV5EhabvqRycy3EDLkP+lGwzInCPXO+ULuFbqr1tE7jyFJNFU+b4hoPzXuiU7VVe+C U17siK6HHT9Y6CxuJkoC6cT95OG2sRWU5rU8SWG6ddo4S7lQvyvv8tJn0vw/wpbSh+k/ cAg/Y2DlK7AIUJAqvR3RyWgWeGJ9Kc/n5PCGgpT7WTYueBMMOV8bdQJO53QsKFvo9x2f YUOA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=references:in-reply-to:message-id:date:subject:cc:to:from; bh=GAUan2SuAD2Tp76K2SuQfT4XdYilWRU+Ql0ZD4Ov4r4=; b=j9A5zTl9wdgjbsmKYrUsGyPPjWUv2aXKktSDfonzHtxoWMho77LSiMY2PaxC9Jn9Ux LVWfsFGMS6HcbwKGTySRGcpiwTXeD3aJDwcbDZJPnNECobmYFaMs7KRhtSGSLV0TS66Y WHGV9fViweWWZALpU5GN0clJSfvY64SbhjrotQ35Hl5kHQwAT48tGPqtSPcBZuVMA8Df ihm02SkuSTbmD9/xl3enomRGrcSsYKsI48fXC6XUl9+wyGYxIFxjP7lIoVr06fm83aQ1 B+VfWBPV9HTeFnVUAhrjtOVx4VMdCFB/NphcsfD51xThr2ZIFdIkEhoOX0XwXq9Lefbr ++cA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of mgorman@techsingularity.net designates 81.17.249.35 as permitted sender) smtp.mailfrom=mgorman@techsingularity.net Received: from outbound-smtp04.blacknight.com (outbound-smtp04.blacknight.com. [81.17.249.35]) by mx.google.com with ESMTPS id a2si481537edv.415.2018.10.31.09.06.47 for <linux-mm@kvack.org> (version=TLS1 cipher=AES128-SHA bits=128/128); Wed, 31 Oct 2018 09:06:47 -0700 (PDT) Received-SPF: pass (google.com: domain of mgorman@techsingularity.net designates 81.17.249.35 as permitted sender) client-ip=81.17.249.35; Authentication-Results: mx.google.com; spf=pass (google.com: domain of mgorman@techsingularity.net designates 81.17.249.35 as permitted sender) smtp.mailfrom=mgorman@techsingularity.net Received: from mail.blacknight.com (pemlinmail04.blacknight.ie [81.17.254.17]) by outbound-smtp04.blacknight.com (Postfix) with ESMTPS id 1A24B9896F for <linux-mm@kvack.org>; Wed, 31 Oct 2018 16:06:47 +0000 (UTC) Received: (qmail 5659 invoked from network); 31 Oct 2018 16:06:47 -0000 Received: from unknown (HELO stampy.163woodhaven.lan) (mgorman@techsingularity.net@[37.228.229.142]) by 81.17.254.9 with ESMTPA; 31 Oct 2018 16:06:47 -0000 From: Mel Gorman <mgorman@techsingularity.net> To: Linux-MM <linux-mm@kvack.org> Cc: Andrew Morton <akpm@linux-foundation.org>, Vlastimil Babka <vbabka@suse.cz>, David Rientjes <rientjes@google.com>, Andrea Arcangeli <aarcange@redhat.com>, Zi Yan <zi.yan@cs.rutgers.edu>, LKML <linux-kernel@vger.kernel.org>, Mel Gorman <mgorman@techsingularity.net> Subject: [PATCH 5/5] mm: Target compaction on pageblocks that were recently fragmented Date: Wed, 31 Oct 2018 16:06:45 +0000 Message-Id: <20181031160645.7633-6-mgorman@techsingularity.net> X-Mailer: git-send-email 2.16.4 In-Reply-To: <20181031160645.7633-1-mgorman@techsingularity.net> References: <20181031160645.7633-1-mgorman@techsingularity.net> X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: <linux-mm.kvack.org> X-Virus-Scanned: ClamAV using ClamSMTP
Series	Fragmentation avoidance improvements \| expand [0/5] Fragmentation avoidance improvements [1/5] mm, page_alloc: Spread allocations across zones before introducing fragmentation [2/5] mm: Move zone watermark accesses behind an accessor [3/5] mm: Reclaim small amounts of memory when an external fragmentation event occurs [4/5] mm: Stall movable allocations until kswapd progresses during serious external fragmentation e… [5/5] mm: Target compaction on pageblocks that were recently fragmented

[5/5] mm: Target compaction on pageblocks that were recently fragmented

Commit Message

Patch