From patchwork Fri Feb 24 14:11:43 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Huang, Ying" X-Patchwork-Id: 13151287 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9363FC61DA3 for ; Fri, 24 Feb 2023 14:12:21 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C597B6B0073; Fri, 24 Feb 2023 09:12:20 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id C09336B0074; Fri, 24 Feb 2023 09:12:20 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id A82BD6B0075; Fri, 24 Feb 2023 09:12:20 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id 9820A6B0073 for ; Fri, 24 Feb 2023 09:12:20 -0500 (EST) Received: from smtpin10.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay04.hostedemail.com (Postfix) with ESMTP id 78B141A0CC7 for ; Fri, 24 Feb 2023 14:12:20 +0000 (UTC) X-FDA: 80502375240.10.4CB56EC Received: from mga04.intel.com (mga04.intel.com [192.55.52.120]) by imf24.hostedemail.com (Postfix) with ESMTP id 548CF180025 for ; Fri, 24 Feb 2023 14:12:17 +0000 (UTC) Authentication-Results: imf24.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=deKQP+XQ; spf=pass (imf24.hostedemail.com: domain of ying.huang@intel.com designates 192.55.52.120 as permitted sender) smtp.mailfrom=ying.huang@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1677247937; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=uyB7RmWwqLANhWNVG+Avd86xs6xec7Preg0oko4nHoU=; b=qoHTYLG+yXy+8xPQbGyT8bCSAZ0XfSD1TLosQlU+wYdsmKsgBexKTuMpj3esA3aiqK9lBx eVg3b1S05PQXRToHYHy6qCo4mMiVgKKOWXU1ll2OuyClSu69fVEdWi/TCRcvEY5jNsUN80 t0Tq1M/t1N5ygG+w00aJWZxF5x9vHGM= ARC-Authentication-Results: i=1; imf24.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=deKQP+XQ; spf=pass (imf24.hostedemail.com: domain of ying.huang@intel.com designates 192.55.52.120 as permitted sender) smtp.mailfrom=ying.huang@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1677247937; a=rsa-sha256; cv=none; b=AhHT5X5tRF0T3LvMG5qsqBzZ/hk0t8oopj1gY2m+QePHVlu9V8yC6Wkwo7qpN0IrkccgN0 WJyNdelJLAyg1UTaYVekJ/BUK1rznJFh94WVE4JDC3cl9Rdkj+nZAdVyTzRxuAGN7SSncf DFR7YOawSLdVc1Ooe0yg5BIZdh3WDcg= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1677247937; x=1708783937; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=X6tTKRKr9Xy2b+0gbUDPmROHq6FpaNkIJooU65G3dW4=; b=deKQP+XQ9+YyZy75+afM84qC/jy1I5sG59VGbq820VO1ZOueoLLje0Sc TOFuv3iasLEEZQUwd5YiZQLxkc6vo8xty0fHogbLwgJsGrgf4RjleSunR Xv9YnmyaKo81Wdlf+syUgCpJoDL+JBNpmYJ81+gmNjeJclLhn7HPHMg8P 5rciJK9BeVpK+2DSucbUkz9728Ci6z+xb8pCk17uAJhM6PRZNdbZFbIwL yfXjk/4ovUR12g981WaUsjcG3DKp3DvuOh/xU8bhT6u6BsbHllJWMgAJJ y3qqoUbfUzavMJnfL18YsDbUkvYrcwBDxg7u9NAsrWyU1YiceqL+pTslm g==; X-IronPort-AV: E=McAfee;i="6500,9779,10630"; a="332167687" X-IronPort-AV: E=Sophos;i="5.97,324,1669104000"; d="scan'208";a="332167687" Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Feb 2023 06:12:15 -0800 X-IronPort-AV: E=McAfee;i="6500,9779,10630"; a="741684649" X-IronPort-AV: E=Sophos;i="5.97,324,1669104000"; d="scan'208";a="741684649" Received: from bingqili-mobl2.ccr.corp.intel.com (HELO yhuang6-mobl2.ccr.corp.intel.com) ([10.255.28.19]) by fmsmga004-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Feb 2023 06:12:11 -0800 From: Huang Ying To: Andrew Morton Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Huang Ying , Hugh Dickins , "Xu, Pengfei" , Christoph Hellwig , Stefan Roesch , Tejun Heo , Xin Hao , Zi Yan , Yang Shi , Baolin Wang , Matthew Wilcox , Mike Kravetz Subject: [PATCH 1/3] migrate_pages: fix deadlock in batched migration Date: Fri, 24 Feb 2023 22:11:43 +0800 Message-Id: <20230224141145.96814-2-ying.huang@intel.com> X-Mailer: git-send-email 2.39.1 In-Reply-To: <20230224141145.96814-1-ying.huang@intel.com> References: <20230224141145.96814-1-ying.huang@intel.com> MIME-Version: 1.0 X-Rspamd-Queue-Id: 548CF180025 X-Stat-Signature: rf4r1ay6jyio9hcex9hp3nnibkt7gzd9 X-Rspam-User: X-Rspamd-Server: rspam08 X-HE-Tag: 1677247937-94127 X-HE-Meta: U2FsdGVkX1/9LS4gcB7/38ZHkC7Qn4ewDfj9csUddvEBaAg+vsSU8SD2y775dfBaY+MCpwlKIVoBQse42Bg6gmm/3ou7xeJwxG+Lzk4SLnYmk01qFH2DRWxZ0D67ZJJ9T14lkt8hNiIVQEbQcALeKkIETRcXWSYDgzvt5h3QodZ6qk7lgWyG0YEogE0V73DNT6pqbB0qW1gXXyfDlDQ0zrssOJCLQKqN9+labQ9gUjb7DcsvHT6o0T5VSJH4gUon9pYyhd+/XpEqD3/oRC0g3XePwmBHdqHEIVCBE4EeuXEe7aiELmjf7c4cvs+OwPiLuqJtVg0pdzNhLc0Cg63PGpTS8RICWrX+oTZmEnzPOjsu7st1C6s5mD4kN1UQDY3VL5TC35SZ8Hv+3yKu0TcaNEOzEr4P8iD9rU5aCJhXvR/aQEOwhruVzJoDC6UUkytjexNzKGzzqp4tNMsjiwwITKe//O33/9TKqetQ+HKvg0a7LV6ZX05PjA7gumCWdkb41BjUa+rCXtDrL7TFGWnw0pHI1aaOxl1PulQ7k5x6NYEuyCmq1nPZu1nJ2Ch8QJ42gp40nByk79xQDV5/8dcf1tWxZFrIbF/qZW22CFE0Xdz2x7/jVraBSOcaPmjeyX41Lkm5WjvUZCZaSq0xNreIJ0S9YHSX6BIawajblAnaXelv5PLdj6GyOPJuFn8ZQo6awV1YWk9KpnwxFU7XHNXAIjVouSLrzg2eo8GyGmxnwby0C9O6594c+FXO2jc5Yqsc8/HV7e4Ih642fBpwEoRS26wseSLJB1u8f0PZoXDdeJNFP3/TRnu/rminuUDNmjpIc6qCPlyJKLnFrONCulpATeSW7fds2Zr6Xl0jUuRJyrmxbhXFohILtEm6TQje+jCY4y4H7h4BKbFf71NWsfU0w9OPtf7aoHP8T9zvl70AVMwKCxEaitNDza5e2t7JxWG4a4aWmO+8Re+tSUGXHKO AbH9sDQm bY3NhNhHEWFmli7jWl+9S+FwMpnR+AOeaO2HE4kLA4Y4CdV/8cQ7+gsbSyYwZ6nnwf0x6UMMBjO8J3cPy3t1mGfpcbMy6V43fO2y061fHs/CKSJlJehZZ9netNSoQoY/cI0+FhRoCi0ATVum7ZJOa7iYHnY0syutcKyH5Fs8LKweT5yaErkfYyaGC25YFwwfhDDkpCOl8+j3QOoXhsdnXeHZIoJvj0J+zfNTSKMWAJ21RXoRzEoC9jb8aQwDWtd3BYgBW2GhcyeCV1EVZRjXOF3smXhq92cTqdtHkhm0LkLY2mOwoaiOCxMS8X7ydvZ/J9asdQLt72hB7p6bCCnUd0jQxeu0LUdOp4gL1HRMWE+1QWjr0Dz7QQp6Ak1a4/V/18q6SEs9l98Ty3LJ3MtIEJz5kdgZOpbbBagIQnoRpfy5E2bcBXLQs/jF5CJvJxMGaHggWydxIvmNGqtk= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: Two deadlock bugs were reported for the migrate_pages() batching series. Thanks Hugh and Pengfei! For example, in the following deadlock trace snippet, INFO: task kworker/u4:0:9 blocked for more than 147 seconds. Not tainted 6.2.0-rc4-kvm+ #1314 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. task:kworker/u4:0 state:D stack:0 pid:9 ppid:2 flags:0x00004000 Workqueue: loop4 loop_rootcg_workfn Call Trace: __schedule+0x43b/0xd00 schedule+0x6a/0xf0 io_schedule+0x4a/0x80 folio_wait_bit_common+0x1b5/0x4e0 ? __pfx_wake_page_function+0x10/0x10 __filemap_get_folio+0x73d/0x770 shmem_get_folio_gfp+0x1fd/0xc80 shmem_write_begin+0x91/0x220 generic_perform_write+0x10e/0x2e0 __generic_file_write_iter+0x17e/0x290 ? generic_write_checks+0x12b/0x1a0 generic_file_write_iter+0x97/0x180 ? __sanitizer_cov_trace_const_cmp4+0x1a/0x20 do_iter_readv_writev+0x13c/0x210 ? __sanitizer_cov_trace_const_cmp4+0x1a/0x20 do_iter_write+0xf6/0x330 vfs_iter_write+0x46/0x70 loop_process_work+0x723/0xfe0 loop_rootcg_workfn+0x28/0x40 process_one_work+0x3cc/0x8d0 worker_thread+0x66/0x630 ? __pfx_worker_thread+0x10/0x10 kthread+0x153/0x190 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x29/0x50 INFO: task repro:1023 blocked for more than 147 seconds. Not tainted 6.2.0-rc4-kvm+ #1314 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. task:repro state:D stack:0 pid:1023 ppid:360 flags:0x00004004 Call Trace: __schedule+0x43b/0xd00 schedule+0x6a/0xf0 io_schedule+0x4a/0x80 folio_wait_bit_common+0x1b5/0x4e0 ? compaction_alloc+0x77/0x1150 ? __pfx_wake_page_function+0x10/0x10 folio_wait_bit+0x30/0x40 folio_wait_writeback+0x2e/0x1e0 migrate_pages_batch+0x555/0x1ac0 ? __pfx_compaction_alloc+0x10/0x10 ? __pfx_compaction_free+0x10/0x10 ? __this_cpu_preempt_check+0x17/0x20 ? lock_is_held_type+0xe6/0x140 migrate_pages+0x100e/0x1180 ? __pfx_compaction_free+0x10/0x10 ? __pfx_compaction_alloc+0x10/0x10 compact_zone+0xe10/0x1b50 ? lock_is_held_type+0xe6/0x140 ? check_preemption_disabled+0x80/0xf0 compact_node+0xa3/0x100 ? __sanitizer_cov_trace_const_cmp8+0x1c/0x30 ? _find_first_bit+0x7b/0x90 sysctl_compaction_handler+0x5d/0xb0 proc_sys_call_handler+0x29d/0x420 proc_sys_write+0x2b/0x40 vfs_write+0x3a3/0x780 ksys_write+0xb7/0x180 __x64_sys_write+0x26/0x30 do_syscall_64+0x3b/0x90 entry_SYSCALL_64_after_hwframe+0x72/0xdc RIP: 0033:0x7f3a2471f59d RSP: 002b:00007ffe567f7288 EFLAGS: 00000217 ORIG_RAX: 0000000000000001 RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f3a2471f59d RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000005 RBP: 00007ffe567f72a0 R08: 0000000000000010 R09: 0000000000000010 R10: 0000000000000010 R11: 0000000000000217 R12: 00000000004012e0 R13: 00007ffe567f73e0 R14: 0000000000000000 R15: 0000000000000000 The page migration task has held the lock of the shmem folio A, and is waiting the writeback of the folio B of the file system on the loop block device to complete. While the loop worker task which writes back the folio B is waiting to lock the shmem folio A, because the folio A backs the folio B in the loop device. Thus deadlock is triggered. In general, if we have locked some other folios except the one we are migrating, it's not safe to wait synchronously, for example, to wait the writeback to complete or wait to lock the buffer head. To fix the deadlock, in this patch, we avoid to batch the page migration except for MIGRATE_ASYNC mode. In MIGRATE_ASYNC mode, synchronous waiting is avoided. The fix can be improved further. We will do that as soon as possible. Link: https://lore.kernel.org/linux-mm/87a6c8c-c5c1-67dc-1e32-eb30831d6e3d@google.com/ Link: https://lore.kernel.org/linux-mm/874jrg7kke.fsf@yhuang6-desk2.ccr.corp.intel.com/ Signed-off-by: "Huang, Ying" Reported-by: Hugh Dickins Reported-by: "Xu, Pengfei" Cc: Christoph Hellwig Cc: Stefan Roesch Cc: Tejun Heo Cc: Xin Hao Cc: Zi Yan Cc: Yang Shi Cc: Baolin Wang Cc: Matthew Wilcox Cc: Mike Kravetz --- mm/migrate.c | 62 ++++++++++++++++------------------------------------ 1 file changed, 19 insertions(+), 43 deletions(-) diff --git a/mm/migrate.c b/mm/migrate.c index 37865f85df6d..7ac37dbbf307 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -1106,7 +1106,7 @@ static void migrate_folio_done(struct folio *src, /* Obtain the lock on page, remove all ptes. */ static int migrate_folio_unmap(new_page_t get_new_page, free_page_t put_new_page, unsigned long private, struct folio *src, - struct folio **dstp, int force, bool avoid_force_lock, + struct folio **dstp, int force, enum migrate_mode mode, enum migrate_reason reason, struct list_head *ret) { @@ -1157,17 +1157,6 @@ static int migrate_folio_unmap(new_page_t get_new_page, free_page_t put_new_page if (current->flags & PF_MEMALLOC) goto out; - /* - * We have locked some folios and are going to wait to lock - * this folio. To avoid a potential deadlock, let's bail - * out and not do that. The locked folios will be moved and - * unlocked, then we can wait to lock this folio. - */ - if (avoid_force_lock) { - rc = -EDEADLOCK; - goto out; - } - folio_lock(src); } locked = true; @@ -1247,7 +1236,7 @@ static int migrate_folio_unmap(new_page_t get_new_page, free_page_t put_new_page /* Establish migration ptes */ VM_BUG_ON_FOLIO(folio_test_anon(src) && !folio_test_ksm(src) && !anon_vma, src); - try_to_migrate(src, TTU_BATCH_FLUSH); + try_to_migrate(src, mode == MIGRATE_ASYNC ? TTU_BATCH_FLUSH : 0); page_was_mapped = 1; } @@ -1261,7 +1250,7 @@ static int migrate_folio_unmap(new_page_t get_new_page, free_page_t put_new_page * A folio that has not been unmapped will be restored to * right list unless we want to retry. */ - if (rc == -EAGAIN || rc == -EDEADLOCK) + if (rc == -EAGAIN) ret = NULL; migrate_folio_undo_src(src, page_was_mapped, anon_vma, locked, ret); @@ -1634,11 +1623,9 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, LIST_HEAD(dst_folios); bool nosplit = (reason == MR_NUMA_MISPLACED); bool no_split_folio_counting = false; - bool avoid_force_lock; retry: rc_saved = 0; - avoid_force_lock = false; retry = 1; for (pass = 0; pass < NR_MAX_MIGRATE_PAGES_RETRY && (retry || large_retry); @@ -1683,15 +1670,14 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, } rc = migrate_folio_unmap(get_new_page, put_new_page, private, - folio, &dst, pass > 2, avoid_force_lock, - mode, reason, ret_folios); + folio, &dst, pass > 2, mode, + reason, ret_folios); /* * The rules are: * Success: folio will be freed * Unmap: folio will be put on unmap_folios list, * dst folio put on dst_folios list * -EAGAIN: stay on the from list - * -EDEADLOCK: stay on the from list * -ENOMEM: stay on the from list * Other errno: put on ret_folios list */ @@ -1743,14 +1729,6 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, goto out; else goto move; - case -EDEADLOCK: - /* - * The folio cannot be locked for potential deadlock. - * Go move (and unlock) all locked folios. Then we can - * try again. - */ - rc_saved = rc; - goto move; case -EAGAIN: if (is_large) { large_retry++; @@ -1765,11 +1743,6 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, stats->nr_thp_succeeded += is_thp; break; case MIGRATEPAGE_UNMAP: - /* - * We have locked some folios, don't force lock - * to avoid deadlock. - */ - avoid_force_lock = true; list_move_tail(&folio->lru, &unmap_folios); list_add_tail(&dst->lru, &dst_folios); break; @@ -1894,17 +1867,15 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, */ list_splice_init(from, ret_folios); list_splice_init(&split_folios, from); + /* + * Force async mode to avoid to wait lock or bit when we have + * locked more than one folios. + */ + mode = MIGRATE_ASYNC; no_split_folio_counting = true; goto retry; } - /* - * We have unlocked all locked folios, so we can force lock now, let's - * try again. - */ - if (rc == -EDEADLOCK) - goto retry; - return rc; } @@ -1939,7 +1910,7 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, enum migrate_mode mode, int reason, unsigned int *ret_succeeded) { int rc, rc_gather; - int nr_pages; + int nr_pages, batch; struct folio *folio, *folio2; LIST_HEAD(folios); LIST_HEAD(ret_folios); @@ -1953,6 +1924,11 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, mode, reason, &stats, &ret_folios); if (rc_gather < 0) goto out; + + if (mode == MIGRATE_ASYNC) + batch = NR_MAX_BATCHED_MIGRATION; + else + batch = 1; again: nr_pages = 0; list_for_each_entry_safe(folio, folio2, from, lru) { @@ -1963,11 +1939,11 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, } nr_pages += folio_nr_pages(folio); - if (nr_pages > NR_MAX_BATCHED_MIGRATION) + if (nr_pages >= batch) break; } - if (nr_pages > NR_MAX_BATCHED_MIGRATION) - list_cut_before(&folios, from, &folio->lru); + if (nr_pages >= batch) + list_cut_before(&folios, from, &folio2->lru); else list_splice_init(from, &folios); rc = migrate_pages_batch(&folios, get_new_page, put_new_page, private, From patchwork Fri Feb 24 14:11:44 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Huang, Ying" X-Patchwork-Id: 13151288 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id E63A0C678DB for ; Fri, 24 Feb 2023 14:12:22 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 442B66B0074; Fri, 24 Feb 2023 09:12:22 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 3CB906B0075; Fri, 24 Feb 2023 09:12:22 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 26C7C6B0078; Fri, 24 Feb 2023 09:12:22 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id 186006B0074 for ; Fri, 24 Feb 2023 09:12:22 -0500 (EST) Received: from smtpin21.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay08.hostedemail.com (Postfix) with ESMTP id D4B36140CE6 for ; Fri, 24 Feb 2023 14:12:21 +0000 (UTC) X-FDA: 80502375282.21.3A519B0 Received: from mga04.intel.com (mga04.intel.com [192.55.52.120]) by imf03.hostedemail.com (Postfix) with ESMTP id BF3A420024 for ; Fri, 24 Feb 2023 14:12:19 +0000 (UTC) Authentication-Results: imf03.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=Oz0JiF4m; spf=pass (imf03.hostedemail.com: domain of ying.huang@intel.com designates 192.55.52.120 as permitted sender) smtp.mailfrom=ying.huang@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1677247940; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=QSS91WOSv9o+IhWkWMmJOZWF++6zljuzAGCtpIGB2BY=; b=vTzyHEY31RpaKk4rI1he8o8p/Qcy4ibl169XRH0s1QxP2T4YT2qWhPmEPesu4k1szqlToY MZLGYrgXB9avRhw46lXeR2hTLoOzU7gth85pkZqyDLP0cmdKQ0TJX8ttJaSVltFDrUu0Ob rpGPpcrrknwCDcgVJDXHCcpOrvm/sxU= ARC-Authentication-Results: i=1; imf03.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=Oz0JiF4m; spf=pass (imf03.hostedemail.com: domain of ying.huang@intel.com designates 192.55.52.120 as permitted sender) smtp.mailfrom=ying.huang@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1677247940; a=rsa-sha256; cv=none; b=PcVo5tWNo1ynpj8AW+iqPFI6uytw139gwOZ2gbt298hwQaK+HqrOoiFW/4ceneNTZp0jrt wrmthu7rdiskHnhXXm8gIewvgwRLNOG7p0NmWkYJrayKneEnURE1eOAEleUTSeno66nrL7 Ad/cpGlwJi/hwhBRtH2O0iZstaulXto= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1677247939; x=1708783939; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=CZw3zTaszsFTuJAllktsjeXaRcVC/hbLzU4WTsAVGdY=; b=Oz0JiF4m1K01jpcNo4vIjiohFTTzVk/ZFFJKMRfnsoSnsha6nXuUKxaQ HGSGdiCYI0V4r46mokMj0rwDzOsbgXnYeSec33ZzChMorcnJBZ6JFRe0N IiygUdHwmQqUqikD/40+WZVdzTx+7k3w+kS+ZqDb4VbwCoR7SWaTlNm3p fLIdf/06mrRVHb77TKNEE9POjoP79on9pB5fMuzUtm4dwanFZkpl/N7Jc 9JrzMwoH5x+qCJaEli8qavSJsm5s6QjZYk5idFFocH5P4RHn8/dJYBOHK m4Cd/RTQgaYmYoxt/xrz52+xj0iH0YDr59aiqZMU2uIyflU8ChONBSgK+ Q==; X-IronPort-AV: E=McAfee;i="6500,9779,10630"; a="332167713" X-IronPort-AV: E=Sophos;i="5.97,324,1669104000"; d="scan'208";a="332167713" Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Feb 2023 06:12:19 -0800 X-IronPort-AV: E=McAfee;i="6500,9779,10630"; a="741684659" X-IronPort-AV: E=Sophos;i="5.97,324,1669104000"; d="scan'208";a="741684659" Received: from bingqili-mobl2.ccr.corp.intel.com (HELO yhuang6-mobl2.ccr.corp.intel.com) ([10.255.28.19]) by fmsmga004-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Feb 2023 06:12:15 -0800 From: Huang Ying To: Andrew Morton Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Huang Ying , Hugh Dickins , "Xu, Pengfei" , Christoph Hellwig , Stefan Roesch , Tejun Heo , Xin Hao , Zi Yan , Yang Shi , Baolin Wang , Matthew Wilcox , Mike Kravetz Subject: [PATCH 2/3] migrate_pages: move split folios processing out of migrate_pages_batch() Date: Fri, 24 Feb 2023 22:11:44 +0800 Message-Id: <20230224141145.96814-3-ying.huang@intel.com> X-Mailer: git-send-email 2.39.1 In-Reply-To: <20230224141145.96814-1-ying.huang@intel.com> References: <20230224141145.96814-1-ying.huang@intel.com> MIME-Version: 1.0 X-Stat-Signature: 6mgpzoyaiox79y3173yfoxkiaiaf4up1 X-Rspam-User: X-Rspamd-Queue-Id: BF3A420024 X-Rspamd-Server: rspam06 X-HE-Tag: 1677247939-57592 X-HE-Meta: U2FsdGVkX1+z9PYJgc9ahix58zkwgBxInoFUAIlYIi35MYdJgXFuRsVp1QF3kBjORelH4rbd5QU1ugXhgkE+tlSloe/ApJK9T1nfRMzu7x3Vw4dVynU/ZVvdj/VMlQY17lKKKxJKpT7FEO3vGcon3IbxKMYRVXt+hk5VAxFoha+KnRfL7qCFnMeIwy+2xQx9V/E1VtajztdCwpDZz0mD3ovNS2GOgk8X/NnU674FXtTe0PpbdmxClJCGW6KwwBLXp+pZdL6YYGLvsPXwqeVibvacqzelVbk/nEC2ZhKpe8ObY2RxHMnGvg4WVV4aKi2QP7nERp2vFFayvSnaWh36z9YSuje7bYX8vtAoeYR5VwNSr1z5fJoidNNmUB2PcCqgE8s0IoYfBUQZ1dzF/TEoyy6Fbf++Ulu6stUwhT0YnHJ8l5dYXLr4P7w5jaCxwQAMbScILx300PTKbClnf51S6kjfUcRJCtAIxMvh8GSi05uFAGm/q6OkpaHb+5y4OVjOs1hJ3U/KeNGmy5EgcsVMH80udFo4jFpboKgF3kch7dBTulK0h9eIUBBzGzjMYcrLVPamC20HsGHM8aEvEyqvnCGiuA9V9Hxl+5eK4aDVIUb5nz6LJyLf4zcawtqVjdwdO4hI4hWyRn23gE/Hga6AAuCAQlyfLP/g9QIYzLtcq7v4vT6B1F+3onbMTC1ZvwSMvHtnl3/RfSxZrkG+vn2cO866vtEJDIl9Mkl28oFB7ATM8m/eBlk+jXQR2T4+xX1CngSXvlQNO8C/p61d2eCsxfjhnYQ29cqNaRhQZU7FrPMsIRAty0c+9otnXd3YsJi3TI/FDM/hL5wS/bj3drBmopv4DLdagmqYd0B7RcwINnFsWlJgLb+cfXBaSLFZHGtq6DdXIT5YYdmdisVmuA//N3Lvu5XGBvG4lWVL5MxRs3BMsAdWHzpt8TJTyt7ycm2t/RT/6O2Wn569nFwW0jb y+zOqKz/ kHaXajklEkGOfgZvE7bVtPS9gORWgwcwiS8YhPqqCCwWbq6rVqoLhquzv/txIe9NpN7V+01rFubuw0MQb6UMxVkPZ87rS2vjRYQnX/DAjARtS3FkJsr5PmFNkGiddLIzO4gjpDPOBIPNuWCoigUs8Q0dwpZyEIFS4pQCU1iwXksDVm6Edg6aWeBCuHYd4NVQcibxwDzWdZxlLVZSHXydIsujEbi0XLiWXhfHIQSZWZOxnn1BJxhqNhB+qe2JVuY37lT6XY71oD/jkHC8l+vcnBBfLideOSuYYVzowDAxei+1/5hFoEUf1YrpC9WdMmcI83qkOD1QevIiEl4SJaYyFZGmfB76XJsRKRKCpZl51Ro3C7MS+MRLkHF41/W7+S271XYS9uNP69p/24JdJonm67q22CCqI4sxz+T/32hfCTzUfCJGrReCgd3P7nGwgq2LO4xjENWJ8vgpVzhg= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: To simplify the code logic and reduce the line number. Signed-off-by: "Huang, Ying" Cc: Hugh Dickins Cc: "Xu, Pengfei" Cc: Christoph Hellwig Cc: Stefan Roesch Cc: Tejun Heo Cc: Xin Hao Cc: Zi Yan Cc: Yang Shi Cc: Baolin Wang Cc: Matthew Wilcox Cc: Mike Kravetz Reviewed-by: Baolin Wang --- mm/migrate.c | 76 ++++++++++++++++++---------------------------------- 1 file changed, 26 insertions(+), 50 deletions(-) diff --git a/mm/migrate.c b/mm/migrate.c index 7ac37dbbf307..91198b487e49 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -1605,9 +1605,10 @@ static int migrate_hugetlbs(struct list_head *from, new_page_t get_new_page, static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, free_page_t put_new_page, unsigned long private, enum migrate_mode mode, int reason, struct list_head *ret_folios, - struct migrate_pages_stats *stats) + struct list_head *split_folios, struct migrate_pages_stats *stats, + int nr_pass) { - int retry; + int retry = 1; int large_retry = 1; int thp_retry = 1; int nr_failed = 0; @@ -1617,19 +1618,12 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, bool is_large = false; bool is_thp = false; struct folio *folio, *folio2, *dst = NULL, *dst2; - int rc, rc_saved, nr_pages; - LIST_HEAD(split_folios); + int rc, rc_saved = 0, nr_pages; LIST_HEAD(unmap_folios); LIST_HEAD(dst_folios); bool nosplit = (reason == MR_NUMA_MISPLACED); - bool no_split_folio_counting = false; -retry: - rc_saved = 0; - retry = 1; - for (pass = 0; - pass < NR_MAX_MIGRATE_PAGES_RETRY && (retry || large_retry); - pass++) { + for (pass = 0; pass < nr_pass && (retry || large_retry); pass++) { retry = 0; large_retry = 0; thp_retry = 0; @@ -1660,7 +1654,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, if (!thp_migration_supported() && is_thp) { nr_large_failed++; stats->nr_thp_failed++; - if (!try_split_folio(folio, &split_folios)) { + if (!try_split_folio(folio, split_folios)) { stats->nr_thp_split++; continue; } @@ -1692,7 +1686,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, stats->nr_thp_failed += is_thp; /* Large folio NUMA faulting doesn't split to retry. */ if (!nosplit) { - int ret = try_split_folio(folio, &split_folios); + int ret = try_split_folio(folio, split_folios); if (!ret) { stats->nr_thp_split += is_thp; @@ -1709,18 +1703,11 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, break; } } - } else if (!no_split_folio_counting) { + } else { nr_failed++; } stats->nr_failed_pages += nr_pages + nr_retry_pages; - /* - * There might be some split folios of fail-to-migrate large - * folios left in split_folios list. Move them to ret_folios - * list so that they could be put back to the right list by - * the caller otherwise the folio refcnt will be leaked. - */ - list_splice_init(&split_folios, ret_folios); /* nr_failed isn't updated for not used */ nr_large_failed += large_retry; stats->nr_thp_failed += thp_retry; @@ -1733,7 +1720,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, if (is_large) { large_retry++; thp_retry += is_thp; - } else if (!no_split_folio_counting) { + } else { retry++; } nr_retry_pages += nr_pages; @@ -1756,7 +1743,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, if (is_large) { nr_large_failed++; stats->nr_thp_failed += is_thp; - } else if (!no_split_folio_counting) { + } else { nr_failed++; } @@ -1774,9 +1761,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, try_to_unmap_flush(); retry = 1; - for (pass = 0; - pass < NR_MAX_MIGRATE_PAGES_RETRY && (retry || large_retry); - pass++) { + for (pass = 0; pass < nr_pass && (retry || large_retry); pass++) { retry = 0; large_retry = 0; thp_retry = 0; @@ -1805,7 +1790,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, if (is_large) { large_retry++; thp_retry += is_thp; - } else if (!no_split_folio_counting) { + } else { retry++; } nr_retry_pages += nr_pages; @@ -1818,7 +1803,7 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, if (is_large) { nr_large_failed++; stats->nr_thp_failed += is_thp; - } else if (!no_split_folio_counting) { + } else { nr_failed++; } @@ -1855,27 +1840,6 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, dst2 = list_next_entry(dst, lru); } - /* - * Try to migrate split folios of fail-to-migrate large folios, no - * nr_failed counting in this round, since all split folios of a - * large folio is counted as 1 failure in the first round. - */ - if (rc >= 0 && !list_empty(&split_folios)) { - /* - * Move non-migrated folios (after NR_MAX_MIGRATE_PAGES_RETRY - * retries) to ret_folios to avoid migrating them again. - */ - list_splice_init(from, ret_folios); - list_splice_init(&split_folios, from); - /* - * Force async mode to avoid to wait lock or bit when we have - * locked more than one folios. - */ - mode = MIGRATE_ASYNC; - no_split_folio_counting = true; - goto retry; - } - return rc; } @@ -1914,6 +1878,7 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, struct folio *folio, *folio2; LIST_HEAD(folios); LIST_HEAD(ret_folios); + LIST_HEAD(split_folios); struct migrate_pages_stats stats; trace_mm_migrate_pages_start(mode, reason); @@ -1947,12 +1912,23 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, else list_splice_init(from, &folios); rc = migrate_pages_batch(&folios, get_new_page, put_new_page, private, - mode, reason, &ret_folios, &stats); + mode, reason, &ret_folios, &split_folios, &stats, + NR_MAX_MIGRATE_PAGES_RETRY); list_splice_tail_init(&folios, &ret_folios); if (rc < 0) { rc_gather = rc; + list_splice_tail(&split_folios, &ret_folios); goto out; } + if (!list_empty(&split_folios)) { + /* + * Failure isn't counted since all split folios of a large folio + * is counted as 1 failure already. + */ + migrate_pages_batch(&split_folios, get_new_page, put_new_page, private, + MIGRATE_ASYNC, reason, &ret_folios, NULL, &stats, 1); + list_splice_tail_init(&split_folios, &ret_folios); + } rc_gather += rc; if (!list_empty(from)) goto again; From patchwork Fri Feb 24 14:11:45 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Huang, Ying" X-Patchwork-Id: 13151289 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9F7E6C61DA3 for ; Fri, 24 Feb 2023 14:12:27 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id DE0B76B0075; Fri, 24 Feb 2023 09:12:26 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id D41376B0078; Fri, 24 Feb 2023 09:12:26 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C08CA6B007B; Fri, 24 Feb 2023 09:12:26 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id B21456B0075 for ; Fri, 24 Feb 2023 09:12:26 -0500 (EST) Received: from smtpin19.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay02.hostedemail.com (Postfix) with ESMTP id 74FCF120AF1 for ; Fri, 24 Feb 2023 14:12:26 +0000 (UTC) X-FDA: 80502375492.19.A5D519D Received: from mga04.intel.com (mga04.intel.com [192.55.52.120]) by imf19.hostedemail.com (Postfix) with ESMTP id 5AEA91A0002 for ; Fri, 24 Feb 2023 14:12:24 +0000 (UTC) Authentication-Results: imf19.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=OY+agIl3; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf19.hostedemail.com: domain of ying.huang@intel.com designates 192.55.52.120 as permitted sender) smtp.mailfrom=ying.huang@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1677247944; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=jtIb4ZWHwjQeYauy02UTZy5dAdRH53XVqaRUoqPshEM=; b=TJjcDg365e1tQva8H/mV4+KdjGZIpsKv246RgZFOdyzb7nWuydq2mmAsqIEM2gcMUDzGXR JtxInMqWC4w+UHvcbGrQSPiONRl2KTBeiHWhmX4EsWvptVLlpkxYpuD/2sBqr8w04lBByX HgBLFxcgq4okLS7Py2i0gQZGUcsdcEs= ARC-Authentication-Results: i=1; imf19.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=OY+agIl3; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf19.hostedemail.com: domain of ying.huang@intel.com designates 192.55.52.120 as permitted sender) smtp.mailfrom=ying.huang@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1677247944; a=rsa-sha256; cv=none; b=ThlWgioArYYq3xYE1ysojKqPo5+YXt1LKxDO2MTtiPQWr45b6IeppsIlkdO8kHHwIAlo+W hNkfq/OOmJDBKpirJGuscvZ9HYwXaK1PSQ+KeUQASiANlIlhwV9SJXRqXBIBM7Quo+OgjW 76T/OtFo45GcfSmiQs+6xnA3QQSI/kI= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1677247944; x=1708783944; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=Dxby7yNPzwalm19ztka4+4UYeHwqmfCE4JJYZx6x6X0=; b=OY+agIl3Ab4Ka0BH0Ou/Qieku9/hZfjq70j/TkyYQ8jEWMJkiYIBuD6g o5St+rbNX7L1LgkXTN9/FsPQRKkemCA9wDf2ZgRVqRiqBltDYRvPgaLgi 8NfQLGNsTtap4VwwgYuXCnBR81y4CVUVgEfbM3qdAmclqVy+qX1uhOei6 trSVRMow0F5Mhov0cNF9suQJMbeZ6yJM+LhTpkjowi0/UzI24MPRzdfvX h1sHvYYglDISjHIN5ycTpL2fpFNCkC/rZi+Jfni2K4nYcAT4+xeUvRS6S x2BfCqJ3ciQecT8VQUfwgNiAQZ9jK0cW1582zUejoajjsDQga7NfJeu1P A==; X-IronPort-AV: E=McAfee;i="6500,9779,10630"; a="332167738" X-IronPort-AV: E=Sophos;i="5.97,324,1669104000"; d="scan'208";a="332167738" Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Feb 2023 06:12:22 -0800 X-IronPort-AV: E=McAfee;i="6500,9779,10630"; a="741684670" X-IronPort-AV: E=Sophos;i="5.97,324,1669104000"; d="scan'208";a="741684670" Received: from bingqili-mobl2.ccr.corp.intel.com (HELO yhuang6-mobl2.ccr.corp.intel.com) ([10.255.28.19]) by fmsmga004-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Feb 2023 06:12:19 -0800 From: Huang Ying To: Andrew Morton Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Huang Ying , Hugh Dickins , "Xu, Pengfei" , Christoph Hellwig , Stefan Roesch , Tejun Heo , Xin Hao , Zi Yan , Yang Shi , Baolin Wang , Matthew Wilcox , Mike Kravetz Subject: [PATCH 3/3] migrate_pages: try migrate in batch asynchronously firstly Date: Fri, 24 Feb 2023 22:11:45 +0800 Message-Id: <20230224141145.96814-4-ying.huang@intel.com> X-Mailer: git-send-email 2.39.1 In-Reply-To: <20230224141145.96814-1-ying.huang@intel.com> References: <20230224141145.96814-1-ying.huang@intel.com> MIME-Version: 1.0 X-Rspam-User: X-Rspamd-Server: rspam02 X-Rspamd-Queue-Id: 5AEA91A0002 X-Stat-Signature: wjpssjbedm46wdbmhbpp4rucm9w8ek64 X-HE-Tag: 1677247944-116700 X-HE-Meta: U2FsdGVkX18lkLFPmvz+f7oouq8PQJlXPh//FG0xevgsAFwb8L2yUXncJmxnSuaF9NCSIbHvZJI5Tf2LjcoVldVs2ri66k0ryxOmflVylC6+bfFIaEDnXu0vnqIJLj9GcIVJLCdp7tG0xCtWYwGLBYGdzCZ1YouUuztbJDMzsNua4+n0Tzi0MK7sQLwR7NBkmr6XS5Apakqgw/1vxlQNjdY1heXkGaeU05YefZNQgeGrY/4N+bRqf5Bc8KNYzJvcZCQGK5D4+A0WKfb2NcqLyDUsKWrM29YT6QdvXWR0oa5K0i/7nr058D5STJslZ5nPXZ3Cr+TzoNfvcfE1+0eQMQmsdHe/nutIsKDdR4uKnqcoUnmPRv6aMg3oiOXwaO2VNULXwbrtW3z2oN13mdKF6NVxj1kb86d4nkDa88gWra4YP8pNfxwH7mSjoYYFG01E6la1wgXtjuJQr5WpAY2NWsZ3hbSZA1nRk5/7PZYb6X5FPP3nyTnJx0mnpHd0yl8pAWTboVEqgTG1I+b0JzuZnOq5+AV2f4C6jBmSSGIXirrcd6x7190sPcLwR4tZCJKV+ydZRItasw3Ztk004pwPh1s4xP5twwH7XvZEUGy8C3dHJxWwgutyCaUCRaZN45euFyfjm8f9owxURV9Imcw+3dE139tPVtwbupOrqG0vFe/KlgH1pF+iC13g9TOjot7xfEMz7FmmzgLM51uOPFA6LYJac+Fjkb99He3PczYOftnuXcwHb8lPQtprpyd9L0sDN2f67NR+nSYYx+J1FHX61RYeDlkwUEwbZ1mwlm82CxY66w0i9rnmocuuFcTo1z3vVCfN23BOx1GcWHcD6YsquVMzWm03KLkuDthcbsh3lU4EKQpy0mu/dy+Z/ZtbuvpeBsnmaNcr3X/FhjDzCbVd7UB5BfUzl+C8BXTMxMx3L9mzo06p/DLtg+IGrqlvCC+8OjHaZeH6b7t8H4+NGts 3YJzUAG8 mO0+RxUR6rjHnGT3nncm4+Ah+lpujq7jcV848mgjvz0NX6wgxNrOHXf4V4bk6l2XgHwYXRP8wOmTKc9TXLHh631Ka7wzZWwEKt1+EvTsWejXRHJj1/gzRRfgTRqSaB2rdH4aE9d6FiMaHwCIx9lmw95+aE0qDxOvIW4Vbl9DhSbhruWnZMuB+edVniDz74fEEC0LsY+3kVkVW7KncxtFiQFOonQMfO6UzT8ppCNgrDAUhZwchtrL/U5ILROcunfpxFu0Dz/Zd+lMLTmHRINFGE4j29hwPSDmiJRqBZP9wqBu+K73MJ77RPLgzunWyeR+8MmjEgVGnSOwAPX3Pt4yqIyZhEXGMzmfIALbBxNkbQAo3wjttN3+JuWPX66hNwPqdrxd0nDptbEbfVmBtt0Ghxl/jH7AGsfdg8hKFkZ7FOj2TGCwA+IFse7QEOTYBl2O6yldVOeUdevplaik= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: When we have locked more than one folios, we cannot wait the lock or bit (e.g., page lock, buffer head lock, writeback bit) synchronously. Otherwise deadlock may be triggered. This make it hard to batch the synchronous migration directly. This patch re-enables batching synchronous migration via trying to migrate in batch asynchronously firstly. And any folios that are failed to be migrated asynchronously will be migrated synchronously one by one. Test shows that this can restore the TLB flushing batching performance for synchronous migration effectively. Signed-off-by: "Huang, Ying" Cc: Hugh Dickins Cc: "Xu, Pengfei" Cc: Christoph Hellwig Cc: Stefan Roesch Cc: Tejun Heo Cc: Xin Hao Cc: Zi Yan Cc: Yang Shi Cc: Baolin Wang Cc: Matthew Wilcox Cc: Mike Kravetz Tested-by: Hugh Dickins Reviewed-by: Baolin Wang --- mm/migrate.c | 65 ++++++++++++++++++++++++++++++++++++++++++++-------- 1 file changed, 55 insertions(+), 10 deletions(-) diff --git a/mm/migrate.c b/mm/migrate.c index 91198b487e49..c17ce5ee8d92 100644 --- a/mm/migrate.c +++ b/mm/migrate.c @@ -1843,6 +1843,51 @@ static int migrate_pages_batch(struct list_head *from, new_page_t get_new_page, return rc; } +static int migrate_pages_sync(struct list_head *from, new_page_t get_new_page, + free_page_t put_new_page, unsigned long private, + enum migrate_mode mode, int reason, struct list_head *ret_folios, + struct list_head *split_folios, struct migrate_pages_stats *stats) +{ + int rc, nr_failed = 0; + LIST_HEAD(folios); + struct migrate_pages_stats astats; + + memset(&astats, 0, sizeof(astats)); + /* Try to migrate in batch with MIGRATE_ASYNC mode firstly */ + rc = migrate_pages_batch(from, get_new_page, put_new_page, private, MIGRATE_ASYNC, + reason, &folios, split_folios, &astats, + NR_MAX_MIGRATE_PAGES_RETRY); + stats->nr_succeeded += astats.nr_succeeded; + stats->nr_thp_succeeded += astats.nr_thp_succeeded; + stats->nr_thp_split += astats.nr_thp_split; + if (rc < 0) { + stats->nr_failed_pages += astats.nr_failed_pages; + stats->nr_thp_failed += astats.nr_thp_failed; + list_splice_tail(&folios, ret_folios); + return rc; + } + stats->nr_thp_failed += astats.nr_thp_split; + nr_failed += astats.nr_thp_split; + /* + * Fall back to migrate all failed folios one by one synchronously. All + * failed folios except split THPs will be retried, so their failure + * isn't counted + */ + list_splice_tail_init(&folios, from); + while (!list_empty(from)) { + list_move(from->next, &folios); + rc = migrate_pages_batch(&folios, get_new_page, put_new_page, + private, mode, reason, ret_folios, + split_folios, stats, NR_MAX_MIGRATE_PAGES_RETRY); + list_splice_tail_init(&folios, ret_folios); + if (rc < 0) + return rc; + nr_failed += rc; + } + + return nr_failed; +} + /* * migrate_pages - migrate the folios specified in a list, to the free folios * supplied as the target for the page migration @@ -1874,7 +1919,7 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, enum migrate_mode mode, int reason, unsigned int *ret_succeeded) { int rc, rc_gather; - int nr_pages, batch; + int nr_pages; struct folio *folio, *folio2; LIST_HEAD(folios); LIST_HEAD(ret_folios); @@ -1890,10 +1935,6 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, if (rc_gather < 0) goto out; - if (mode == MIGRATE_ASYNC) - batch = NR_MAX_BATCHED_MIGRATION; - else - batch = 1; again: nr_pages = 0; list_for_each_entry_safe(folio, folio2, from, lru) { @@ -1904,16 +1945,20 @@ int migrate_pages(struct list_head *from, new_page_t get_new_page, } nr_pages += folio_nr_pages(folio); - if (nr_pages >= batch) + if (nr_pages >= NR_MAX_BATCHED_MIGRATION) break; } - if (nr_pages >= batch) + if (nr_pages >= NR_MAX_BATCHED_MIGRATION) list_cut_before(&folios, from, &folio2->lru); else list_splice_init(from, &folios); - rc = migrate_pages_batch(&folios, get_new_page, put_new_page, private, - mode, reason, &ret_folios, &split_folios, &stats, - NR_MAX_MIGRATE_PAGES_RETRY); + if (mode == MIGRATE_ASYNC) + rc = migrate_pages_batch(&folios, get_new_page, put_new_page, private, + mode, reason, &ret_folios, &split_folios, &stats, + NR_MAX_MIGRATE_PAGES_RETRY); + else + rc = migrate_pages_sync(&folios, get_new_page, put_new_page, private, + mode, reason, &ret_folios, &split_folios, &stats); list_splice_tail_init(&folios, &ret_folios); if (rc < 0) { rc_gather = rc;