[088/178] NUMA balancing: reduce TLB flush via delaying mapping on hint page fault

From: Huang Ying <ying.huang@intel.com>

DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 960586147D
Date: Thu, 29 Apr 2021 22:57:41 -0700
From: Andrew Morton <akpm@linux-foundation.org>
To: akpm@linux-foundation.org, arjunroy@google.com, hannes@cmpxchg.org,
 kirill.shutemov@linux.intel.com, linux-mm@kvack.org, mgorman@suse.de,
 mm-commits@vger.kernel.org, peterx@redhat.com, peterz@infradead.org,
 torvalds@linux-foundation.org, vbabka@suse.cz, walken@google.com,
 will@kernel.org, willy@infradead.org, ying.huang@intel.com
Subject: [patch 088/178] NUMA balancing: reduce TLB flush via
 delaying mapping on hint page fault
Message-ID: <20210430055741.u1pjk2j5l%akpm@linux-foundation.org>
In-Reply-To: <20210429225251.02b6386d21b69255b4f6c163@linux-foundation.org>
User-Agent: s-nail v14.8.16
Received-SPF: none (linux-foundation.org>: No applicable sender policy
 available) receiver=imf22; identity=mailfrom;
 envelope-from="<akpm@linux-foundation.org>"; helo=mail.kernel.org;
 client-ip=198.145.29.99
Sender: owner-linux-mm@kvack.org
Precedence: bulk

Message ID	20210430055741.u1pjk2j5l%akpm@linux-foundation.org (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=xn6i=J3=kvack.org=owner-linux-mm@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 960586147D Date: Thu, 29 Apr 2021 22:57:41 -0700 From: Andrew Morton <akpm@linux-foundation.org> To: akpm@linux-foundation.org, arjunroy@google.com, hannes@cmpxchg.org, kirill.shutemov@linux.intel.com, linux-mm@kvack.org, mgorman@suse.de, mm-commits@vger.kernel.org, peterx@redhat.com, peterz@infradead.org, torvalds@linux-foundation.org, vbabka@suse.cz, walken@google.com, will@kernel.org, willy@infradead.org, ying.huang@intel.com Subject: [patch 088/178] NUMA balancing: reduce TLB flush via delaying mapping on hint page fault Message-ID: <20210430055741.u1pjk2j5l%akpm@linux-foundation.org> In-Reply-To: <20210429225251.02b6386d21b69255b4f6c163@linux-foundation.org> User-Agent: s-nail v14.8.16 Received-SPF: none (linux-foundation.org>: No applicable sender policy available) receiver=imf22; identity=mailfrom; envelope-from="<akpm@linux-foundation.org>"; helo=mail.kernel.org; client-ip=198.145.29.99 Sender: owner-linux-mm@kvack.org Precedence: bulk
Series	[001/178] arch/ia64/kernel/head.S: remove duplicate include \| expand [001/178] arch/ia64/kernel/head.S: remove duplicate include [002/178] arch/ia64/kernel/fsys.S: fix typos [003/178] arch/ia64/include/asm/pgtable.h: minor typo fixes [004/178] ia64: ensure proper NUMA distance and possible map initialization [005/178] ia64: drop unused IA64_FW_EMU ifdef [006/178] ia64: simplify code flow around swiotlb init [007/178] ia64: trivial spelling fixes [008/178] ia64: fix EFI_DEBUG build [009/178] ia64: mca: always make IA64_MCA_DEBUG an expression [010/178] ia64: drop marked broken DISCONTIGMEM and VIRTUAL_MEM_MAP [011/178] ia64: module: fix symbolizer crash on fdescr [012/178] include/linux/compiler-gcc.h: sparse can do constant folding of __builtin_bswap() [013/178] scripts/spelling.txt: add entries for recent discoveries [014/178] scripts: a new script for checking duplicate struct declaration [015/178] arch/sh/include/asm/tlb.h: remove duplicate include [016/178] ocfs2: replace DEFINE_SIMPLE_ATTRIBUTE with DEFINE_DEBUGFS_ATTRIBUTE [017/178] ocfs2: map flags directly in flags_to_o2dlm() [018/178] ocfs2: fix a typo [019/178] ocfs2/dlm: remove unused function [020/178] kfifo: fix ternary sign extension bugs [021/178] vfs: fs_parser: clean up kernel-doc warnings [022/178] watchdog: rename __touch_watchdog() to a better descriptive name [023/178] watchdog: explicitly update timestamp when reporting softlockup [024/178] watchdog/softlockup: report the overall time of softlockups [025/178] watchdog/softlockup: remove logic that tried to prevent repeated reports [026/178] watchdog: fix barriers when printing backtraces from all CPUs [027/178] watchdog: cleanup handling of false positives [028/178] mm/slab_common: provide "slab_merge" option for !IS_ENABLED(CONFIG_SLAB_MERGE_DEFAULT) bu… [029/178] mm, slub: enable slub_debug static key when creating cache with explicit debug flags [030/178] kunit: add a KUnit test for SLUB debugging functionality [031/178] slub: remove resiliency_test() function [032/178] mm/slub.c: trivial typo fixes [033/178] mm/kmemleak.c: fix a typo [034/178] mm/page_owner: record the timestamp of all pages during free [035/178] mm, page_owner: remove unused parameter in __set_page_owner_handle [036/178] mm: page_owner: fetch backtrace only for tracked pages [037/178] mm: page_owner: use kstrtobool() to parse bool option [038/178] mm: page_owner: detect page_owner recursion via task_struct [039/178] mm: page_poison: print page info when corruption is caught [040/178] mm/memtest: add ARCH_USE_MEMTEST [041/178] mm: provide filemap_range_needs_writeback() helper [042/178] mm: use filemap_range_needs_writeback() for O_DIRECT reads [043/178] iomap: use filemap_range_needs_writeback() for O_DIRECT reads [044/178] mm/filemap: use filemap_read_page in filemap_fault [045/178] mm/filemap: drop check for truncated page after I/O [046/178] mm: page-writeback: simplify memcg handling in test_clear_page_writeback() [047/178] mm: move page_mapping_file to pagemap.h [048/178] mm/filemap: update stale comment [049/178] mm/msync: exit early when the flags is an MS_ASYNC and start < vm_start [050/178] mm/gup: add compound page list iterator [051/178] mm/gup: decrement head page once for group of subpages [052/178] mm/gup: add a range variant of unpin_user_pages_dirty_lock() [053/178] RDMA/umem: batch page unpin in __ib_umem_release() [054/178] mm: gup: remove FOLL_SPLIT [055/178] mm/memremap.c: fix improper SPDX comment style [056/178] mm: memcontrol: fix kernel stack account [057/178] memcg: cleanup root memcg checks [058/178] memcg: enable memcg oom-kill for __GFP_NOFAIL [059/178] mm: memcontrol: fix cpuhotplug statistics flushing [060/178] mm: memcontrol: kill mem_cgroup_nodeinfo() [061/178] mm: memcontrol: privatize memcg_page_state query functions [062/178] cgroup: rstat: support cgroup1 [063/178] cgroup: rstat: punt root-level optimization to individual controllers [064/178] mm: memcontrol: switch to rstat [065/178] mm: memcontrol: consolidate lruvec stat flushing [066/178] kselftests: cgroup: update kmem test for new vmstat implementation [067/178] memcg: charge before adding to swapcache on swapin [068/178] mm: memcontrol: slab: fix obtain a reference to a freeing memcg [069/178] mm: memcontrol: introduce obj_cgroup_{un}charge_pages [070/178] mm: memcontrol: directly access page->memcg_data in mm/page_alloc.c [071/178] mm: memcontrol: change ug->dummy_page only if memcg changed [072/178] mm: memcontrol: use obj_cgroup APIs to charge kmem pages [073/178] mm: memcontrol: inline __memcg_kmem_{un}charge() into obj_cgroup_{un}charge_pages() [074/178] mm: memcontrol: move PageMemcgKmem to the scope of CONFIG_MEMCG_KMEM [075/178] linux/memcontrol.h: remove duplicate struct declaration [076/178] mm: page_counter: mitigate consequences of a page_counter underflow [077/178] mm/memory.c: do_numa_page(): delete bool "migrated" [078/178] mm/interval_tree: add comments to improve code readability [079/178] x86/vmemmap: drop handling of 4K unaligned vmemmap range [080/178] x86/vmemmap: drop handling of 1GB vmemmap ranges [081/178] x86/vmemmap: handle unpopulated sub-pmd ranges [082/178] x86/vmemmap: optimize for consecutive sections in partial populated PMDs [083/178] mm, tracing: improve rss_stat tracepoint message [084/178] mm: add remap_pfn_range_notrack [085/178] mm: add a io_mapping_map_user helper [086/178] i915: use io_mapping_map_user [087/178] i915: fix remap_io_sg to verify the pgprot [088/178] NUMA balancing: reduce TLB flush via delaying mapping on hint page fault [089/178] mm: extend MREMAP_DONTUNMAP to non-anonymous mappings [090/178] Revert "mremap: don't allow MREMAP_DONTUNMAP on special_mappings and aio" [091/178] selftests: add a MREMAP_DONTUNMAP selftest for shmem [092/178] mm/dmapool: switch from strlcpy to strscpy [093/178] mm/sparse: add the missing sparse_buffer_fini() in error branch [094/178] samples/vfio-mdev/mdpy: use remap_vmalloc_range [095/178] mm: unexport remap_vmalloc_range_partial [096/178] mm/vmalloc: use rb_tree instead of list for vread() lookups [097/178] ARM: mm: add missing pud_page define to 2-level page tables [098/178] mm/vmalloc: fix HUGE_VMAP regression by enabling huge pages in vmalloc_to_page [099/178] mm: apply_to_pte_range warn and fail if a large pte is encountered [100/178] mm/vmalloc: rename vmap__range vmap_pages__range [101/178] mm/ioremap: rename ioremap__range to vmap_*_range [102/178] mm: HUGE_VMAP arch support cleanup [103/178] powerpc: inline huge vmap supported functions [104/178] arm64: inline huge vmap supported functions [105/178] x86: inline huge vmap supported functions [106/178] mm/vmalloc: provide fallback arch huge vmap support functions [107/178] mm: move vmap_range from mm/ioremap.c to mm/vmalloc.c [108/178] mm/vmalloc: add vmap_range_noflush variant [109/178] mm/vmalloc: hugepage vmalloc mappings [110/178] mm/vmalloc: remove map_kernel_range [111/178] kernel/dma: remove unnecessary unmap_kernel_range [112/178] powerpc/xive: remove unnecessary unmap_kernel_range [113/178] mm/vmalloc: remove unmap_kernel_range [114/178] mm/vmalloc: improve allocation failure error messages [115/178] mm: vmalloc: prevent use after free in _vm_unmap_aliases [116/178] lib/test_vmalloc.c: remove two kvfree_rcu() tests [117/178] lib/test_vmalloc.c: add a new 'nr_threads' parameter [118/178] vm/test_vmalloc.sh: adapt for updated driver interface [119/178] mm/vmalloc: refactor the preloading loagic [120/178] mm/vmalloc: remove an empty line [121/178] mm/doc: fix fault_flag_allow_retry_first kerneldoc [122/178] mm/doc: fix page_maybe_dma_pinned kerneldoc [123/178] mm/doc: turn fault flags into an enum [124/178] mm/doc: add mm.h and mm_types.h to the mm-api document [125/178] MAINTAINERS: assign pagewalk.h to MEMORY MANAGEMENT [126/178] pagewalk: prefix struct kernel-doc descriptions [127/178] mm/kasan: switch from strlcpy to strscpy [128/178] kasan: fix kasan_byte_accessible() to be consistent with actual checks [129/178] kasan: initialize shadow to TAG_INVALID for SW_TAGS [130/178] mm, kasan: don't poison boot memory with tag-based modes [131/178] arm64: kasan: allow to init memory when setting tags [132/178] kasan: init memory in kasan_(un)poison for HW_TAGS [133/178] kasan, mm: integrate page_alloc init with HW_TAGS [134/178] kasan, mm: integrate slab init_on_alloc with HW_TAGS [135/178] kasan, mm: integrate slab init_on_free with HW_TAGS [136/178] kasan: docs: clean up sections [137/178] kasan: docs: update overview section [138/178] kasan: docs: update usage section [139/178] kasan: docs: update error reports section [140/178] kasan: docs: update boot parameters section [141/178] kasan: docs: update GENERIC implementation details section [142/178] kasan: docs: update SW_TAGS implementation details section [143/178] kasan: docs: update HW_TAGS implementation details section [144/178] kasan: docs: update shadow memory section [145/178] kasan: docs: update ignoring accesses section [146/178] kasan: docs: update tests section [147/178] kasan: record task_work_add() call stack [148/178] kasan: detect false-positives in tests [149/178] irq_work: record irq_work_queue() call stack [150/178] mm: move mem_init_print_info() into mm_init() [151/178] mm/page_alloc: drop pr_info_ratelimited() in alloc_contig_range() [152/178] mm: remove lru_add_drain_all in alloc_contig_range [153/178] include/linux/page-flags-layout.h: correctly determine LAST_CPUPID_WIDTH [154/178] include/linux/page-flags-layout.h: cleanups [155/178] mm/page_alloc: rename alloc_mask to alloc_gfp [156/178] mm/page_alloc: rename gfp_mask to gfp [157/178] mm/page_alloc: combine __alloc_pages and __alloc_pages_nodemask [158/178] mm/mempolicy: rename alloc_pages_current to alloc_pages [159/178] mm/mempolicy: rewrite alloc_pages documentation [160/178] mm/mempolicy: rewrite alloc_pages_vma documentation [161/178] mm/mempolicy: fix mpol_misplaced kernel-doc [162/178] mm: page_alloc: dump migrate-failed pages [163/178] mm/Kconfig: remove default DISCONTIGMEM_MANUAL [164/178] mm, page_alloc: avoid page_to_pfn() in move_freepages() [165/178] mm/page_alloc: duplicate include linux/vmalloc.h [166/178] mm/page_alloc: rename alloced to allocated [167/178] mm/page_alloc: add a bulk page allocator [168/178] mm/page_alloc: add an array-based interface to the bulk page allocator [169/178] mm/page_alloc: optimize code layout for __alloc_pages_bulk [170/178] mm/page_alloc: inline __rmqueue_pcplist [171/178] SUNRPC: set rq_page_end differently [172/178] SUNRPC: refresh rq_pages using a bulk page allocator [173/178] net: page_pool: refactor dma_map into own function page_pool_dma_map [174/178] net: page_pool: use alloc_pages_bulk in refill code path [175/178] mm: page_alloc: ignore init_on_free=1 for debug_pagealloc=1 [176/178] mm/page_alloc: redundant definition variables of pfn in for loop [177/178] mm/mmzone.h: fix existing kernel-doc comments and link them to core-api [178/178] mm/memory-failure: unnecessary amount of unmapping

[088/178] NUMA balancing: reduce TLB flush via delaying mapping on hint page fault

Commit Message

Patch