Message ID | 20250304150444.3788920-1-ryan.roberts@arm.com (mailing list archive) |
---|---|
Headers | show |
Series | Perf improvements for hugetlb and vmalloc on arm64 | expand |
On Tue, Mar 04, 2025 at 03:04:30PM +0000, Ryan Roberts wrote: > Hi All, > > This is v3 of a series to improve performance for hugetlb and vmalloc on arm64. > Although some of these patches are core-mm, advice from Andrew was to go via the > arm64 tree. Hopefully I can get some ACKs from mm folks. > > The 2 key performance improvements are 1) enabling the use of contpte-mapped > blocks in the vmalloc space when appropriate (which reduces TLB pressure). There > were already hooks for this (used by powerpc) but they required some tidying and > extending for arm64. And 2) batching up barriers when modifying the vmalloc > address space for upto 30% reduction in time taken in vmalloc(). > > vmalloc() performance was measured using the test_vmalloc.ko module. Tested on > Apple M2 and Ampere Altra. Each test had loop count set to 500000 and the whole > test was repeated 10 times. > I will have a look and review just give me some time :) -- Uladzislau Rezki
On 27/03/2025 08:16, Uladzislau Rezki wrote: > On Tue, Mar 04, 2025 at 03:04:30PM +0000, Ryan Roberts wrote: >> Hi All, >> >> This is v3 of a series to improve performance for hugetlb and vmalloc on arm64. >> Although some of these patches are core-mm, advice from Andrew was to go via the >> arm64 tree. Hopefully I can get some ACKs from mm folks. >> >> The 2 key performance improvements are 1) enabling the use of contpte-mapped >> blocks in the vmalloc space when appropriate (which reduces TLB pressure). There >> were already hooks for this (used by powerpc) but they required some tidying and >> extending for arm64. And 2) batching up barriers when modifying the vmalloc >> address space for upto 30% reduction in time taken in vmalloc(). >> >> vmalloc() performance was measured using the test_vmalloc.ko module. Tested on >> Apple M2 and Ampere Altra. Each test had loop count set to 500000 and the whole >> test was repeated 10 times. >> > I will have a look and review just give me some time :) Thanks for the reviews - appreciate it! > > -- > Uladzislau Rezki