[RFC,v2,03/17] mm: Introduce try_vma_alloc_movable_folio()

Message ID	20230414130303.2345383-4-ryan.roberts@arm.com (mailing list archive)
State	New
Headers	show Return-Path: <owner-linux-mm@kvack.org> From: Ryan Roberts <ryan.roberts@arm.com> To: Andrew Morton <akpm@linux-foundation.org>, "Matthew Wilcox (Oracle)" <willy@infradead.org>, Yu Zhao <yuzhao@google.com>, "Yin, Fengwei" <fengwei.yin@intel.com> Cc: Ryan Roberts <ryan.roberts@arm.com>, linux-mm@kvack.org, linux-arm-kernel@lists.infradead.org Subject: [RFC v2 PATCH 03/17] mm: Introduce try_vma_alloc_movable_folio() Date: Fri, 14 Apr 2023 14:02:49 +0100 Message-Id: <20230414130303.2345383-4-ryan.roberts@arm.com> In-Reply-To: <20230414130303.2345383-1-ryan.roberts@arm.com> References: <20230414130303.2345383-1-ryan.roberts@arm.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: owner-linux-mm@kvack.org Precedence: bulk
Series	variable-order, large folios for anonymous memory \| expand [RFC,v2,00/17] variable-order, large folios for anonymous memory [RFC,v2,01/17] mm: Expose clear_huge_page() unconditionally [RFC,v2,02/17] mm: pass gfp flags and order to vma_alloc_zeroed_movable_folio() [RFC,v2,03/17] mm: Introduce try_vma_alloc_movable_folio() [RFC,v2,04/17] mm: Implement folio_add_new_anon_rmap_range() [RFC,v2,05/17] mm: Routines to determine max anon folio allocation order [RFC,v2,06/17] mm: Allocate large folios for anonymous memory [RFC,v2,07/17] mm: Allow deferred splitting of arbitrary large anon folios [RFC,v2,08/17] mm: Implement folio_move_anon_rmap_range() [RFC,v2,09/17] mm: Update wp_page_reuse() to operate on range of pages [RFC,v2,10/17] mm: Reuse large folios for anonymous memory [RFC,v2,11/17] mm: Split __wp_page_copy_user() into 2 variants [RFC,v2,12/17] mm: ptep_clear_flush_range_notify() macro for batch operation [RFC,v2,13/17] mm: Implement folio_remove_rmap_range() [RFC,v2,14/17] mm: Copy large folios for anonymous memory [RFC,v2,15/17] mm: Convert zero page to large folios on write [RFC,v2,16/17] mm: mmap: Align unhinted maps to highest anon folio order [RFC,v2,17/17] mm: Batch-zap large anonymous folio PTE mappings

Message ID

20230414130303.2345383-4-ryan.roberts@arm.com (mailing list archive)

State

New

Headers

From: Ryan Roberts <ryan.roberts@arm.com>
To: Andrew Morton <akpm@linux-foundation.org>,
	"Matthew Wilcox (Oracle)" <willy@infradead.org>,
	Yu Zhao <yuzhao@google.com>,
	"Yin, Fengwei" <fengwei.yin@intel.com>
Cc: Ryan Roberts <ryan.roberts@arm.com>,
	linux-mm@kvack.org,
	linux-arm-kernel@lists.infradead.org
Subject: [RFC v2 PATCH 03/17] mm: Introduce try_vma_alloc_movable_folio()
Date: Fri, 14 Apr 2023 14:02:49 +0100
Message-Id: <20230414130303.2345383-4-ryan.roberts@arm.com>
In-Reply-To: <20230414130303.2345383-1-ryan.roberts@arm.com>
References: <20230414130303.2345383-1-ryan.roberts@arm.com>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Sender: owner-linux-mm@kvack.org
Precedence: bulk

Series

variable-order, large folios for anonymous memory | expand

Commit Message

Ryan Roberts April 14, 2023, 1:02 p.m. UTC

Opportunistically attempt to allocate high-order folios in highmem,
optionally zeroed. Retry with lower orders all the way to order-0, until
success. Although, of note, order-1 allocations are skipped since a
large folio must be at least order-2 to work with the THP machinery. The
user must check what they got with folio_order().

This will be used to oportunistically allocate large folios for
anonymous memory with a sensible fallback under memory pressure.

For attempts to allocate non-0 orders, we set __GFP_NORETRY to prevent
high latency due to reclaim, instead preferring to just try for a lower
order. The same approach is used by the readahead code when allocating
large folios.

Signed-off-by: Ryan Roberts <ryan.roberts@arm.com>
---
 mm/memory.c | 33 +++++++++++++++++++++++++++++++++
 1 file changed, 33 insertions(+)

--
2.25.1

Comments

Yin Fengwei April 17, 2023, 8:49 a.m. UTC | #1

On 4/14/2023 9:02 PM, Ryan Roberts wrote:
> Opportunistically attempt to allocate high-order folios in highmem,
> optionally zeroed. Retry with lower orders all the way to order-0, until
> success. Although, of note, order-1 allocations are skipped since a
> large folio must be at least order-2 to work with the THP machinery. The
> user must check what they got with folio_order().
> 
> This will be used to oportunistically allocate large folios for
> anonymous memory with a sensible fallback under memory pressure.
> 
> For attempts to allocate non-0 orders, we set __GFP_NORETRY to prevent
> high latency due to reclaim, instead preferring to just try for a lower
> order. The same approach is used by the readahead code when allocating
> large folios.
I am not sure whether anonymous page can share the same approach as page
cache. The latency of new page cache is dominated by IO. So it may be not
big deal to retry with different order some times.

Retry too many times could bring latency for anonymous page allocation.

Regards
Yin, Fengwei

> 
> Signed-off-by: Ryan Roberts <ryan.roberts@arm.com>
> ---
>  mm/memory.c | 33 +++++++++++++++++++++++++++++++++
>  1 file changed, 33 insertions(+)
> 
> diff --git a/mm/memory.c b/mm/memory.c
> index 9d5e8be49f3b..ca32f59acef2 100644
> --- a/mm/memory.c
> +++ b/mm/memory.c
> @@ -2989,6 +2989,39 @@ static vm_fault_t fault_dirty_shared_page(struct vm_fault *vmf)
>  	return 0;
>  }
> 
> +static inline struct folio *vma_alloc_movable_folio(struct vm_area_struct *vma,
> +				unsigned long vaddr, int order, bool zeroed)
> +{
> +	gfp_t gfp = order > 0 ? __GFP_NORETRY | __GFP_NOWARN : 0;
> +
> +	if (zeroed)
> +		return vma_alloc_zeroed_movable_folio(vma, vaddr, gfp, order);
> +	else
> +		return vma_alloc_folio(GFP_HIGHUSER_MOVABLE | gfp, order, vma,
> +								vaddr, false);
> +}
> +
> +/*
> + * Opportunistically attempt to allocate high-order folios, retrying with lower
> + * orders all the way to order-0, until success. order-1 allocations are skipped
> + * since a folio must be at least order-2 to work with the THP machinery. The
> + * user must check what they got with folio_order(). vaddr can be any virtual
> + * address that will be mapped by the allocated folio.
> + */
> +static struct folio *try_vma_alloc_movable_folio(struct vm_area_struct *vma,
> +				unsigned long vaddr, int order, bool zeroed)
> +{
> +	struct folio *folio;
> +
> +	for (; order > 1; order--) {
> +		folio = vma_alloc_movable_folio(vma, vaddr, order, zeroed);
> +		if (folio)
> +			return folio;
> +	}
> +
> +	return vma_alloc_movable_folio(vma, vaddr, 0, zeroed);
> +}
> +
>  /*
>   * Handle write page faults for pages that can be reused in the current vma
>   *
> --
> 2.25.1
>

Ryan Roberts April 17, 2023, 10:11 a.m. UTC | #2

On 17/04/2023 09:49, Yin, Fengwei wrote:
> 
> 
> On 4/14/2023 9:02 PM, Ryan Roberts wrote:
>> Opportunistically attempt to allocate high-order folios in highmem,
>> optionally zeroed. Retry with lower orders all the way to order-0, until
>> success. Although, of note, order-1 allocations are skipped since a
>> large folio must be at least order-2 to work with the THP machinery. The
>> user must check what they got with folio_order().
>>
>> This will be used to oportunistically allocate large folios for
>> anonymous memory with a sensible fallback under memory pressure.
>>
>> For attempts to allocate non-0 orders, we set __GFP_NORETRY to prevent
>> high latency due to reclaim, instead preferring to just try for a lower
>> order. The same approach is used by the readahead code when allocating
>> large folios.
> I am not sure whether anonymous page can share the same approach as page
> cache. The latency of new page cache is dominated by IO. So it may be not
> big deal to retry with different order some times.
> 
> Retry too many times could bring latency for anonymous page allocation.

Perhaps I'm better off just using vma_thp_gfp_mask(), or at least taking
inspiration from it?

> 
> Regards
> Yin, Fengwei
> 
>>
>> Signed-off-by: Ryan Roberts <ryan.roberts@arm.com>
>> ---
>>  mm/memory.c | 33 +++++++++++++++++++++++++++++++++
>>  1 file changed, 33 insertions(+)
>>
>> diff --git a/mm/memory.c b/mm/memory.c
>> index 9d5e8be49f3b..ca32f59acef2 100644
>> --- a/mm/memory.c
>> +++ b/mm/memory.c
>> @@ -2989,6 +2989,39 @@ static vm_fault_t fault_dirty_shared_page(struct vm_fault *vmf)
>>  	return 0;
>>  }
>>
>> +static inline struct folio *vma_alloc_movable_folio(struct vm_area_struct *vma,
>> +				unsigned long vaddr, int order, bool zeroed)
>> +{
>> +	gfp_t gfp = order > 0 ? __GFP_NORETRY | __GFP_NOWARN : 0;
>> +
>> +	if (zeroed)
>> +		return vma_alloc_zeroed_movable_folio(vma, vaddr, gfp, order);
>> +	else
>> +		return vma_alloc_folio(GFP_HIGHUSER_MOVABLE | gfp, order, vma,
>> +								vaddr, false);
>> +}
>> +
>> +/*
>> + * Opportunistically attempt to allocate high-order folios, retrying with lower
>> + * orders all the way to order-0, until success. order-1 allocations are skipped
>> + * since a folio must be at least order-2 to work with the THP machinery. The
>> + * user must check what they got with folio_order(). vaddr can be any virtual
>> + * address that will be mapped by the allocated folio.
>> + */
>> +static struct folio *try_vma_alloc_movable_folio(struct vm_area_struct *vma,
>> +				unsigned long vaddr, int order, bool zeroed)
>> +{
>> +	struct folio *folio;
>> +
>> +	for (; order > 1; order--) {
>> +		folio = vma_alloc_movable_folio(vma, vaddr, order, zeroed);
>> +		if (folio)
>> +			return folio;
>> +	}
>> +
>> +	return vma_alloc_movable_folio(vma, vaddr, 0, zeroed);
>> +}
>> +
>>  /*
>>   * Handle write page faults for pages that can be reused in the current vma
>>   *
>> --
>> 2.25.1
>>

diff --git a/mm/memory.c b/mm/memory.c
index 9d5e8be49f3b..ca32f59acef2 100644
--- a/mm/memory.c
+++ b/mm/memory.c
@@ -2989,6 +2989,39 @@  static vm_fault_t fault_dirty_shared_page(struct vm_fault *vmf)
 	return 0;
 }

+static inline struct folio *vma_alloc_movable_folio(struct vm_area_struct *vma,
+				unsigned long vaddr, int order, bool zeroed)
+{
+	gfp_t gfp = order > 0 ? __GFP_NORETRY | __GFP_NOWARN : 0;
+
+	if (zeroed)
+		return vma_alloc_zeroed_movable_folio(vma, vaddr, gfp, order);
+	else
+		return vma_alloc_folio(GFP_HIGHUSER_MOVABLE | gfp, order, vma,
+								vaddr, false);
+}
+
+/*
+ * Opportunistically attempt to allocate high-order folios, retrying with lower
+ * orders all the way to order-0, until success. order-1 allocations are skipped
+ * since a folio must be at least order-2 to work with the THP machinery. The
+ * user must check what they got with folio_order(). vaddr can be any virtual
+ * address that will be mapped by the allocated folio.
+ */
+static struct folio *try_vma_alloc_movable_folio(struct vm_area_struct *vma,
+				unsigned long vaddr, int order, bool zeroed)
+{
+	struct folio *folio;
+
+	for (; order > 1; order--) {
+		folio = vma_alloc_movable_folio(vma, vaddr, order, zeroed);
+		if (folio)
+			return folio;
+	}
+
+	return vma_alloc_movable_folio(vma, vaddr, 0, zeroed);
+}
+
 /*
  * Handle write page faults for pages that can be reused in the current vma
  *

[RFC,v2,03/17] mm: Introduce try_vma_alloc_movable_folio()

Commit Message

Comments

Patch