From patchwork Mon Jun 14 20:14:02 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Matthew Wilcox X-Patchwork-Id: 12319733 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-16.6 required=3.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH,MAILING_LIST_MULTI, MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_GIT autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9A733C2B9F4 for ; Mon, 14 Jun 2021 20:15:20 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 1BF9C6134F for ; Mon, 14 Jun 2021 20:15:20 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 1BF9C6134F Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=infradead.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id A53FE6B006C; Mon, 14 Jun 2021 16:15:19 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id A05426B006E; Mon, 14 Jun 2021 16:15:19 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 8A4B96B0070; Mon, 14 Jun 2021 16:15:19 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0146.hostedemail.com [216.40.44.146]) by kanga.kvack.org (Postfix) with ESMTP id 584276B006C for ; Mon, 14 Jun 2021 16:15:19 -0400 (EDT) Received: from smtpin36.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with ESMTP id D56D18249980 for ; Mon, 14 Jun 2021 20:15:18 +0000 (UTC) X-FDA: 78253433916.36.2FCC184 Received: from casper.infradead.org (casper.infradead.org [90.155.50.34]) by imf06.hostedemail.com (Postfix) with ESMTP id 469AEC0091AE for ; Mon, 14 Jun 2021 20:15:08 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=Content-Transfer-Encoding:MIME-Version: Message-Id:Date:Subject:Cc:To:From:Sender:Reply-To:Content-Type:Content-ID: Content-Description:In-Reply-To:References; bh=T+uDU9PS2lm9skXG3INq+pyo45zjQkF1zuHsAD6rWAU=; b=LL8upfxRi7VG0e5aQTrl3u+WdI g/I0PILX90LqfpgvtIFv0iAMkFOknPvU0hOTluqyRIpTFWBTAQ0ZExWe1Tj012RTWEsW4JzR4srtL bwsjTgThSgJXkJJIlpTulwc67e2CcZv3bMH9gCEkHf9/XIWiO5WGzmHBLO25weovUEzlKSoPioP1G Ix8r1RGGhWJEWY1hCjdYjIoOvqulexXCEAoVzYf/ZndhIkzshbUWcvKyKFqZmpqHHQdy7dpKXpbHd X/yhfL1fdkZfgqTu2HuIRPrEIM2R2wfgou/DWKe+A/PswLQLSmPt9IkrL+rsZTqWCfdXRMtYoawaq RQPrVanA==; Received: from willy by casper.infradead.org with local (Exim 4.94 #2 (Red Hat Linux)) id 1lssyi-005mnj-5m; Mon, 14 Jun 2021 20:14:41 +0000 From: "Matthew Wilcox (Oracle)" To: akpm@linux-foundation.org Cc: "Matthew Wilcox (Oracle)" , linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: [PATCH v11 00/33] Memory folios Date: Mon, 14 Jun 2021 21:14:02 +0100 Message-Id: <20210614201435.1379188-1-willy@infradead.org> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Authentication-Results: imf06.hostedemail.com; dkim=pass header.d=infradead.org header.s=casper.20170209 header.b=LL8upfxR; dmarc=none; spf=none (imf06.hostedemail.com: domain of willy@infradead.org has no SPF policy when checking 90.155.50.34) smtp.mailfrom=willy@infradead.org X-Stat-Signature: d8ee6bjef1mwnjepe4usseubdrmbrcmi X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 469AEC0091AE X-HE-Tag: 1623701708-821892 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: Managing memory in 4KiB pages is a serious overhead. Many benchmarks benefit from a larger "page size". As an example, an earlier iteration of this idea which used compound pages (and wasn't particularly tuned) got a 7% performance boost when compiling the kernel. Using compound pages or THPs exposes a weakness of our type system. Functions are often unprepared for compound pages to be passed to them, and may only act on PAGE_SIZE chunks. Even functions which are aware of compound pages may expect a head page, and do the wrong thing if passed a tail page. We also waste a lot of instructions ensuring that we're not looking at a tail page. Almost every call to PageFoo() contains one or more hidden calls to compound_head(). This also happens for get_page(), put_page() and many more functions. There does not appear to be a way to tell gcc that it can cache the result of compound_head(), nor is there a way to tell it that compound_head() is idempotent. This patch series uses a new type, the struct folio, to manage memory. It provides some basic infrastructure that's worthwhile in its own right, shrinking the kernel by about 6kB of text. The full patch series is considerably larger (~200 patches), and enables XFS to use large pages. It can be found at https://git.infradead.org/users/willy/pagecache.git/shortlog/refs/heads/folio (not everything there is in good shape for upstream submission, but if you go as far as "mm/readahead: Add multi-page folio readahead", it passes xfstests). An earlier version of this patch set found it was worth about a 7% reduction of wall-clock time on kernel compiles. Since v11: - Rebase onto 5.13-rc4 plus eight patches from me currently in -mm. - Remove thp_head() (Vlastimil) - Add folio_memcg_rcu() - Make mem_cgroup_folio_lruvec() follow the new calling convention that's in mmotm, even though it's not upstream yet. - Make get_page_unless_zero() return bool. - Add folio_zonenum() - Add __folio_clear_lru_flags() - Add folio_lru_list() - Add folio_add_to_lru_list() and folio_del_from_lru_list() - Make add_page_to_lru_list() and add_page_to_lru_list_tail() work on compound pages of arbitrary order. - Add folio_mapcount_ptr() (Christoph) - Improve the comment on folio_page() (William) - Change indentation for PageFoo() (Vlastimil) - Fix folio_file_page() for HugeTLBfs (Vlastimil) - Remove externs from folio_wait_bit() - Remove set_page_private_2() - Add check that PG_locked is in the first byte of the flags - Rewrap some comments to avoid line length problems - Convert pagevec_move_tail_fn() to use a folio internally - Drop folio idle/young conversion from this patchset v10: https://lore.kernel.org/linux-mm/20210511214735.1836149-1-willy@infradead.org/ v9: https://lore.kernel.org/linux-mm/20210505150628.111735-1-willy@infradead.org/ v8: https://lore.kernel.org/linux-mm/20210430180740.2707166-1-willy@infradead.org/ Matthew Wilcox (Oracle) (33): mm: Convert get_page_unless_zero() to return bool mm: Introduce struct folio mm: Add folio_pgdat(), folio_zone() and folio_zonenum() mm/vmstat: Add functions to account folio statistics mm/debug: Add VM_BUG_ON_FOLIO() and VM_WARN_ON_ONCE_FOLIO() mm: Add folio reference count functions mm: Add folio_put() mm: Add folio_get() mm: Add folio_try_get_rcu() mm: Add folio flag manipulation functions mm/lru: Add folio LRU functions mm: Handle per-folio private data mm/filemap: Add folio_index(), folio_file_page() and folio_contains() mm/filemap: Add folio_next_index() mm/filemap: Add folio_offset() and folio_file_offset() mm/util: Add folio_mapping() and folio_file_mapping() mm/memcg: Add folio wrappers for various functions mm/filemap: Add folio_unlock() mm/filemap: Add folio_lock() mm/filemap: Add folio_lock_killable() mm/filemap: Add __folio_lock_async() mm/filemap: Add folio_wait_locked() mm/filemap: Add __folio_lock_or_retry() mm/swap: Add folio_rotate_reclaimable() mm/filemap: Add folio_end_writeback() mm/writeback: Add folio_wait_writeback() mm/writeback: Add folio_wait_stable() mm/filemap: Add folio_wait_bit() mm/filemap: Add folio_wake_bit() mm/filemap: Convert page wait queues to be folios mm/filemap: Add folio private_2 functions fs/netfs: Add folio fscache functions mm: Add folio_mapped() Documentation/core-api/mm-api.rst | 4 + Documentation/filesystems/netfs_library.rst | 2 + fs/afs/write.c | 9 +- fs/cachefiles/rdwr.c | 16 +- fs/io_uring.c | 2 +- include/linux/huge_mm.h | 15 - include/linux/memcontrol.h | 72 ++++ include/linux/mm.h | 165 +++++++-- include/linux/mm_inline.h | 85 +++-- include/linux/mm_types.h | 77 ++++ include/linux/mmdebug.h | 20 ++ include/linux/netfs.h | 77 ++-- include/linux/page-flags.h | 245 +++++++++---- include/linux/page_ref.h | 158 +++++++- include/linux/pagemap.h | 377 +++++++++++--------- include/linux/swap.h | 7 +- include/linux/vmstat.h | 107 ++++++ mm/Makefile | 2 +- mm/filemap.c | 321 +++++++++-------- mm/folio-compat.c | 43 +++ mm/internal.h | 1 + mm/memory.c | 8 +- mm/page-writeback.c | 72 ++-- mm/page_io.c | 4 +- mm/swap.c | 30 +- mm/swapfile.c | 8 +- mm/util.c | 59 +-- 27 files changed, 1411 insertions(+), 575 deletions(-) create mode 100644 mm/folio-compat.c