diff mbox series

mm/migrate: make migrate_pages_batch() stats consistent.

Message ID 20240618134151.29214-1-zi.yan@sent.com (mailing list archive)
State New
Headers show
Series mm/migrate: make migrate_pages_batch() stats consistent. | expand

Commit Message

Zi Yan June 18, 2024, 1:41 p.m. UTC
From: Zi Yan <ziy@nvidia.com>

As Ying pointed out in [1], stats->nr_thp_failed needs to be updated to
avoid stats inconsistency between MIGRATE_SYNC and MIGRATE_ASYNC when
calling migrate_pages_batch().

[1] https://lore.kernel.org/linux-mm/87msnq7key.fsf@yhuang6-desk2.ccr.corp.intel.com/

Suggested-by: "Huang, Ying" <ying.huang@intel.com>
Signed-off-by: Zi Yan <ziy@nvidia.com>
---
 mm/migrate.c | 5 +++++
 1 file changed, 5 insertions(+)


base-commit: 14d7c92f8df9c0964ae6f8b813c1b3ac38120825

Comments

Andrew Morton June 18, 2024, 7:44 p.m. UTC | #1
On Tue, 18 Jun 2024 09:41:51 -0400 Zi Yan <zi.yan@sent.com> wrote:

> From: Zi Yan <ziy@nvidia.com>
> 
> As Ying pointed out in [1], stats->nr_thp_failed needs to be updated to
> avoid stats inconsistency between MIGRATE_SYNC and MIGRATE_ASYNC when
> calling migrate_pages_batch().
> 

Thanks.  I'll add

Fixes: 7262f208ca68 ("mm/migrate: split source folio if it is on deferred split list")

and shall queue it for 6.10-rcX.

Please don't forget such details.
Zi Yan June 18, 2024, 7:48 p.m. UTC | #2
On 18 Jun 2024, at 15:44, Andrew Morton wrote:

> On Tue, 18 Jun 2024 09:41:51 -0400 Zi Yan <zi.yan@sent.com> wrote:
>
>> From: Zi Yan <ziy@nvidia.com>
>>
>> As Ying pointed out in [1], stats->nr_thp_failed needs to be updated to
>> avoid stats inconsistency between MIGRATE_SYNC and MIGRATE_ASYNC when
>> calling migrate_pages_batch().
>>
>
> Thanks.  I'll add
>
> Fixes: 7262f208ca68 ("mm/migrate: split source folio if it is on deferred split list")
>
> and shall queue it for 6.10-rcX.
>
> Please don't forget such details.

I thought about the Fixes tag. Since Hugh's patch fixes the
BUG_ON and VM_BUG_ON, I did not add it here. But yeah stats inconsistency
is something needs to be fixed. Thanks for adding it.

--
Best Regards,
Yan, Zi
Huang, Ying June 19, 2024, 7:25 a.m. UTC | #3
Zi Yan <zi.yan@sent.com> writes:

> From: Zi Yan <ziy@nvidia.com>
>
> As Ying pointed out in [1], stats->nr_thp_failed needs to be updated to
> avoid stats inconsistency between MIGRATE_SYNC and MIGRATE_ASYNC when
> calling migrate_pages_batch().
>
> [1] https://lore.kernel.org/linux-mm/87msnq7key.fsf@yhuang6-desk2.ccr.corp.intel.com/

IMHO, we shouldn't ask people to use link to understand the patch
description or commit message.  If so, we can make the link less
outstanding and maybe add more words about the issue.

> Suggested-by: "Huang, Ying" <ying.huang@intel.com>
> Signed-off-by: Zi Yan <ziy@nvidia.com>

Otherwise, LGTM, Thanks!

[snip]

--
Best Regards,
Huang, Ying
Zi Yan June 19, 2024, 6:13 p.m. UTC | #4
On 19 Jun 2024, at 3:25, Huang, Ying wrote:

> Zi Yan <zi.yan@sent.com> writes:
>
>> From: Zi Yan <ziy@nvidia.com>
>>
>> As Ying pointed out in [1], stats->nr_thp_failed needs to be updated to
>> avoid stats inconsistency between MIGRATE_SYNC and MIGRATE_ASYNC when
>> calling migrate_pages_batch().

Because if not, when migrate_pages_batch() is called via
migrate_pages(MIGRATE_ASYNC), nr_thp_failed will not be increased and
when migrate_pages_batch() is called via migrate_pages(MIGRATE_SYNC*),
nr_thp_failed will be increase in migrate_pages_sync() by
stats->nr_thp_failed += astats.nr_thp_split.

>>
>> [1] https://lore.kernel.org/linux-mm/87msnq7key.fsf@yhuang6-desk2.ccr.corp.intel.com/
>
> IMHO, we shouldn't ask people to use link to understand the patch
> description or commit message.  If so, we can make the link less
> outstanding and maybe add more words about the issue.

I summarized your explanation from the link. You think the summary is not
enough? I can add more text. Let me know if the above text looks
good to you, then I can resend the patch. Thanks.

--
Best Regards,
Yan, Zi
Huang, Ying June 20, 2024, 1:23 a.m. UTC | #5
Zi Yan <ziy@nvidia.com> writes:

> On 19 Jun 2024, at 3:25, Huang, Ying wrote:
>
>> Zi Yan <zi.yan@sent.com> writes:
>>
>>> From: Zi Yan <ziy@nvidia.com>
>>>
>>> As Ying pointed out in [1], stats->nr_thp_failed needs to be updated to
>>> avoid stats inconsistency between MIGRATE_SYNC and MIGRATE_ASYNC when
>>> calling migrate_pages_batch().
>
> Because if not, when migrate_pages_batch() is called via
> migrate_pages(MIGRATE_ASYNC), nr_thp_failed will not be increased and
> when migrate_pages_batch() is called via migrate_pages(MIGRATE_SYNC*),
> nr_thp_failed will be increase in migrate_pages_sync() by
> stats->nr_thp_failed += astats.nr_thp_split.
>
>>>
>>> [1] https://lore.kernel.org/linux-mm/87msnq7key.fsf@yhuang6-desk2.ccr.corp.intel.com/
>>
>> IMHO, we shouldn't ask people to use link to understand the patch
>> description or commit message.  If so, we can make the link less
>> outstanding and maybe add more words about the issue.
>
> I summarized your explanation from the link. You think the summary is not
> enough? I can add more text. Let me know if the above text looks
> good to you, then I can resend the patch. Thanks.

It looks good to me, Thanks!

--
Best Regards,
Huang, Ying
diff mbox series

Patch

diff --git a/mm/migrate.c b/mm/migrate.c
index 2cc5a68f6843..20cb9f5f7446 100644
--- a/mm/migrate.c
+++ b/mm/migrate.c
@@ -1659,6 +1659,10 @@  static int migrate_pages_batch(struct list_head *from,
 			 * migrate_pages() may report success with (split but
 			 * unmigrated) pages still on its fromlist; whereas it
 			 * always reports success when its fromlist is empty.
+			 * stats->nr_thp_failed should be increased too,
+			 * otherwise stats inconsistency will happen when
+			 * migrate_pages_batch is called via migrate_pages()
+			 * with MIGRATE_SYNC and MIGRATE_ASYNC.
 			 *
 			 * Only check it without removing it from the list.
 			 * Since the folio can be on deferred_split_scan()
@@ -1675,6 +1679,7 @@  static int migrate_pages_batch(struct list_head *from,
 			   !list_empty(&folio->_deferred_list)) {
 				if (try_split_folio(folio, split_folios) == 0) {
 					nr_failed++;
+					stats->nr_thp_failed += is_thp;
 					stats->nr_thp_split += is_thp;
 					stats->nr_split++;
 					continue;