Message ID | 20231204105440.61448-1-ryan.roberts@arm.com (mailing list archive) |
---|---|
Headers | show |
Series | Transparent Contiguous PTEs for User Mappings | expand |
On 12/4/23 02:54, Ryan Roberts wrote: > Hi All, > > This is v3 of a series to opportunistically and transparently use contpte > mappings (set the contiguous bit in ptes) for user memory when those mappings > meet the requirements. It is part of a wider effort to improve performance by > allocating and mapping variable-sized blocks of memory (folios). One aim is for > the 4K kernel to approach the performance of the 16K kernel, but without > breaking compatibility and without the associated increase in memory. Another > aim is to benefit the 16K and 64K kernels by enabling 2M THP, since this is the > contpte size for those kernels. We have good performance data that demonstrates > both aims are being met (see below). > > Of course this is only one half of the change. We require the mapped physical > memory to be the correct size and alignment for this to actually be useful (i.e. > 64K for 4K pages, or 2M for 16K/64K pages). Fortunately folios are solving this > problem for us. Filesystems that support it (XFS, AFS, EROFS, tmpfs, ...) will > allocate large folios up to the PMD size today, and more filesystems are coming. > And the other half of my work, to enable "multi-size THP" (large folios) for > anonymous memory, makes contpte sized folios prevalent for anonymous memory too > [3]. > Hi Ryan, Using a couple of Armv8 systems, I've tested this patchset. Details are in my reply to the mTHP patchset [1]. So for this patchset, please feel free to add: Tested-by: John Hubbard <jhubbard@nvidia.com> [1] https://lore.kernel.org/all/2be046e1-ef95-4244-ae23-e56071ae1218@nvidia.com/ thanks,