From patchwork Mon Oct 21 17:34:55 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Roman Gushchin X-Patchwork-Id: 13844473 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id DEF14D3C935 for ; Mon, 21 Oct 2024 17:35:20 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 69FF56B0082; Mon, 21 Oct 2024 13:35:20 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 6501D6B0083; Mon, 21 Oct 2024 13:35:20 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 53E656B0085; Mon, 21 Oct 2024 13:35:20 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id 351C06B0082 for ; Mon, 21 Oct 2024 13:35:20 -0400 (EDT) Received: from smtpin09.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay08.hostedemail.com (Postfix) with ESMTP id B768D14157F for ; Mon, 21 Oct 2024 17:35:03 +0000 (UTC) X-FDA: 82698310002.09.F567403 Received: from out-172.mta1.migadu.com (out-172.mta1.migadu.com [95.215.58.172]) by imf25.hostedemail.com (Postfix) with ESMTP id 761E5A000A for ; Mon, 21 Oct 2024 17:35:07 +0000 (UTC) Authentication-Results: imf25.hostedemail.com; dkim=pass header.d=linux.dev header.s=key1 header.b=nTb8Qi6P; spf=pass (imf25.hostedemail.com: domain of roman.gushchin@linux.dev designates 95.215.58.172 as permitted sender) smtp.mailfrom=roman.gushchin@linux.dev; dmarc=pass (policy=none) header.from=linux.dev ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1729532068; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:references:dkim-signature; bh=ng2+tosjoBBu6IdGY0OAnCymCQLdInYE8eT0ciVoT6Q=; b=cUGrHPEBCa+eApNhmkcCRI0OkO7p2SL399vY4ylHn7AbRDSXaUJ5tCY784jLY4mGHWhRoX OHpVR3pPzc4LukkLdIObLZ7OAOUaFxcVuga4hpTDmNHTvr8PUlXnNLIH2eK2RYoaZ+Thon qQQfV5ILTLNzlVxAf7/E741TUwGwciA= ARC-Authentication-Results: i=1; imf25.hostedemail.com; dkim=pass header.d=linux.dev header.s=key1 header.b=nTb8Qi6P; spf=pass (imf25.hostedemail.com: domain of roman.gushchin@linux.dev designates 95.215.58.172 as permitted sender) smtp.mailfrom=roman.gushchin@linux.dev; dmarc=pass (policy=none) header.from=linux.dev ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1729532068; a=rsa-sha256; cv=none; b=AViZwr9WxAlrKfrh+z4lvotua8tsetf7vK+FmEPlFiRxKd9PoEGeuSZpQV20SflgPPSvIo /WJW8HQvC9zGrq/+KgI2jzK2mHWqgczvIDKgyqrcb7/QNYSDS9PnZ10FnB6P1Gt/SECkQV GkfsCcveWMwlPa6gCEnK7VHq2UUzGOg= X-Report-Abuse: Please report any abuse attempt to abuse@migadu.com and include these headers. DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.dev; s=key1; t=1729532116; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding; bh=ng2+tosjoBBu6IdGY0OAnCymCQLdInYE8eT0ciVoT6Q=; b=nTb8Qi6PWoiuOxAhB2TmVQyT041O2AFkafYBMJopBHfQOCTheY74oOCMxNH5WXHJi6XH8S wH1DrvZ7kX9K0x0Eyig4ZzFCWHsTH/Nodj4ftVlvHF7qEkNZtJA4itciPtPuj65L6Fc3N6 aWIQ45xJ+gwJ3pbUhozwlm9h9TgMHBk= From: Roman Gushchin To: Andrew Morton Cc: linux-mm@kvack.org, Vlastimil Babka , linux-kernel@vger.kernel.org, Roman Gushchin , stable@vger.kernel.org, Hugh Dickins , Matthew Wilcox Subject: [PATCH v2] mm: page_alloc: move mlocked flag clearance into free_pages_prepare() Date: Mon, 21 Oct 2024 17:34:55 +0000 Message-ID: <20241021173455.2691973-1-roman.gushchin@linux.dev> MIME-Version: 1.0 X-Migadu-Flow: FLOW_OUT X-Rspam-User: X-Stat-Signature: igfipstkiptipc9yzr65ttn5wgf6ryz1 X-Rspamd-Queue-Id: 761E5A000A X-Rspamd-Server: rspam11 X-HE-Tag: 1729532107-541750 X-HE-Meta: U2FsdGVkX1+EbISn0Yd746u1ovvDxOQ3a9Z236/TOVPBQPeGwUavl4y00nm4Ili+c5m88ZWqFWgV+l9EoXds1VdfAbuJ83OJa30aoIV/qlHkWsSBb88ZyIPWwME2AzzF682Ny3s71jorQYu53X3+4OnWNs6GbRp2oibvE7XYol4fhzNYOYX2p9p6qqDr9OXceAn1D2nzOJPoZsIzU9UuwrF9Zt+Ryau/5JrcougNydOLRyyQafejoAMf+1eB7+q2DB5Q7RAEq8+7F75MBF6XV/5bXJbVkJjUX1QY1eGHR/sIXkvWK7waIHYxbzs6Y+K8oaoTqW/9NH9gDbuTCul1EJu8l7Fev28XVTEM9iF6UfFT4mt4tdAwTdjlfrCLyV+LW4EyE/qPce4K4e5Uespgg21B1dEV06CgKuxN1NNgdNT+uR30RepUfxnO2ws8sz3K/2SU98BWD7E4IfFslPH+LfHXdYdyJkj+On+2rztHT6XZd6xFJJaEsqWQlxZvn0ikvfllJIt4lVAKmdUhbxylj5D2Ehjy4tvxhxu6jyLS9yLDW4KqiFieUuZtxD576auBLEhH+ThEiJ2rutVeFAMsifQWfHtqipFlbLUXmV75YdxGy1PxzC5WF3k32PD8A0SK01jpFASsQsowR9UGqG8XQ6oir8xzdwUpjqJ+rRb1V30TTJmBJh3HaUf75tSV1ywH1Qoy2MUALcKpV+yBUWWyMOVEzLodR/keprIqg6OsaWZlaS3IQohh2gXsR+P9H4RX1WPM3ayWSUZe61GaOocZoXmzUKvwl6Aoql6lPHd+pGOQnsvN81j/ln6osKQKPQdVeNEI2tSq0n9qy1SdOZavbm4EEYcw3ZumVBgE4Ot5bD4VG8WBBC/Ks/rPZMtQtcNNZFhXw7HGiZPzREflb6RhRdMtw+mJwVk18QhGqzkwXORzTJiMI0/RkqwRu6gSVZZ0umdIbpqSRPrL7TlVKzb xz25lGhY ozXo+8Y+LXe+EKMRJc6V5iPpFdQ09/B0BG95X6afsofkVx3I9LzRhVvQmt5IjsxYH13QQYj8TEV0fjcAKcVyL7tlJw3ICIACS0JuiJ1CbKjP3795XteozYQK/JCTKv/sq+jg7YBKXEcl/IeAvE8CeSvoBnih2RyXnuqlQP4utk7/fJmjHU5caJN0n8bW+2Dwtj4DQxyWuKIkKYe91BaC1zG2OeumQIxjOjxyjrjLosMuJlGTpQmNjj6SdHAyj0uNzmwT68utHlzmrZJ9v89EdRA5BuT3pjvisfzsdzXjsw7Q1R6Ap/7dekoFUzaHMJWhGiXBI7ttgmCtsuiTjOLitxff2PSUOB37iweNO X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Syzbot reported a bad page state problem caused by a page being freed using free_page() still having a mlocked flag at free_pages_prepare() stage: BUG: Bad page state in process syz.0.15 pfn:1137bb page: refcount:0 mapcount:0 mapping:0000000000000000 index:0xffff8881137bb870 pfn:0x1137bb flags: 0x400000000080000(mlocked|node=0|zone=1) raw: 0400000000080000 0000000000000000 dead000000000122 0000000000000000 raw: ffff8881137bb870 0000000000000000 00000000ffffffff 0000000000000000 page dumped because: PAGE_FLAGS_CHECK_AT_FREE flag(s) set page_owner tracks the page as allocated page last allocated via order 0, migratetype Unmovable, gfp_mask 0x400dc0(GFP_KERNEL_ACCOUNT|__GFP_ZERO), pid 3005, tgid 3004 (syz.0.15), ts 61546 608067, free_ts 61390082085 set_page_owner include/linux/page_owner.h:32 [inline] post_alloc_hook+0x1f3/0x230 mm/page_alloc.c:1537 prep_new_page mm/page_alloc.c:1545 [inline] get_page_from_freelist+0x3008/0x31f0 mm/page_alloc.c:3457 __alloc_pages_noprof+0x292/0x7b0 mm/page_alloc.c:4733 alloc_pages_mpol_noprof+0x3e8/0x630 mm/mempolicy.c:2265 kvm_coalesced_mmio_init+0x1f/0xf0 virt/kvm/coalesced_mmio.c:99 kvm_create_vm virt/kvm/kvm_main.c:1235 [inline] kvm_dev_ioctl_create_vm virt/kvm/kvm_main.c:5500 [inline] kvm_dev_ioctl+0x13bb/0x2320 virt/kvm/kvm_main.c:5542 vfs_ioctl fs/ioctl.c:51 [inline] __do_sys_ioctl fs/ioctl.c:907 [inline] __se_sys_ioctl+0xf9/0x170 fs/ioctl.c:893 do_syscall_x64 arch/x86/entry/common.c:52 [inline] do_syscall_64+0x69/0x110 arch/x86/entry/common.c:83 entry_SYSCALL_64_after_hwframe+0x76/0x7e page last free pid 951 tgid 951 stack trace: reset_page_owner include/linux/page_owner.h:25 [inline] free_pages_prepare mm/page_alloc.c:1108 [inline] free_unref_page+0xcb1/0xf00 mm/page_alloc.c:2638 vfree+0x181/0x2e0 mm/vmalloc.c:3361 delayed_vfree_work+0x56/0x80 mm/vmalloc.c:3282 process_one_work kernel/workqueue.c:3229 [inline] process_scheduled_works+0xa5c/0x17a0 kernel/workqueue.c:3310 worker_thread+0xa2b/0xf70 kernel/workqueue.c:3391 kthread+0x2df/0x370 kernel/kthread.c:389 ret_from_fork+0x4b/0x80 arch/x86/kernel/process.c:147 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244 A reproducer is available here: https://syzkaller.appspot.com/x/repro.c?x=1437939f980000 The problem was originally introduced by commit b109b87050df ("mm/munlock: replace clear_page_mlock() by final clearance"): it was handling focused on handling pagecache and anonymous memory and wasn't suitable for lower level get_page()/free_page() API's used for example by KVM, as with this reproducer. Fix it by moving the mlocked flag clearance down to free_page_prepare(). The bug itself if fairly old and harmless (aside from generating these warnings). Closes: https://syzkaller.appspot.com/x/report.txt?x=169a47d0580000 Fixes: b109b87050df ("mm/munlock: replace clear_page_mlock() by final clearance") Signed-off-by: Roman Gushchin Cc: Cc: Hugh Dickins Cc: Matthew Wilcox Cc: Vlastimil Babka Acked-by: Hugh Dickins --- mm/page_alloc.c | 15 +++++++++++++++ mm/swap.c | 14 -------------- 2 files changed, 15 insertions(+), 14 deletions(-) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index bc55d39eb372..7535d78862ab 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -1044,6 +1044,7 @@ __always_inline bool free_pages_prepare(struct page *page, bool skip_kasan_poison = should_skip_kasan_poison(page); bool init = want_init_on_free(); bool compound = PageCompound(page); + struct folio *folio = page_folio(page); VM_BUG_ON_PAGE(PageTail(page), page); @@ -1053,6 +1054,20 @@ __always_inline bool free_pages_prepare(struct page *page, if (memcg_kmem_online() && PageMemcgKmem(page)) __memcg_kmem_uncharge_page(page, order); + /* + * In rare cases, when truncation or holepunching raced with + * munlock after VM_LOCKED was cleared, Mlocked may still be + * found set here. This does not indicate a problem, unless + * "unevictable_pgs_cleared" appears worryingly large. + */ + if (unlikely(folio_test_mlocked(folio))) { + long nr_pages = folio_nr_pages(folio); + + __folio_clear_mlocked(folio); + zone_stat_mod_folio(folio, NR_MLOCK, -nr_pages); + count_vm_events(UNEVICTABLE_PGCLEARED, nr_pages); + } + if (unlikely(PageHWPoison(page)) && !order) { /* Do not let hwpoison pages hit pcplists/buddy */ reset_page_owner(page, order); diff --git a/mm/swap.c b/mm/swap.c index 835bdf324b76..7cd0f4719423 100644 --- a/mm/swap.c +++ b/mm/swap.c @@ -78,20 +78,6 @@ static void __page_cache_release(struct folio *folio, struct lruvec **lruvecp, lruvec_del_folio(*lruvecp, folio); __folio_clear_lru_flags(folio); } - - /* - * In rare cases, when truncation or holepunching raced with - * munlock after VM_LOCKED was cleared, Mlocked may still be - * found set here. This does not indicate a problem, unless - * "unevictable_pgs_cleared" appears worryingly large. - */ - if (unlikely(folio_test_mlocked(folio))) { - long nr_pages = folio_nr_pages(folio); - - __folio_clear_mlocked(folio); - zone_stat_mod_folio(folio, NR_MLOCK, -nr_pages); - count_vm_events(UNEVICTABLE_PGCLEARED, nr_pages); - } } /*