@@ -4556,10 +4556,21 @@ static vm_fault_t do_numa_page(struct vm_fault *vmf)
* to record page access time. So use default value.
*/
if ((sysctl_numa_balancing_mode & NUMA_BALANCING_MEMORY_TIERING) &&
- !node_is_toptier(page_nid))
+ !node_is_toptier(page_nid)) {
last_cpupid = (-1 & LAST_CPUPID_MASK);
- else
+ /*
+ * According to the data locality for some workloads, the
+ * probability of accessing some data soon after some nearby
+ * data has been accessed. So for tiered memory systems, we
+ * can update the sequential page's age located on slow memory
+ * type, to try to promote it to fast memory in advance to
+ * improve the performance.
+ */
+ if (vmf->address != fault_address)
+ xchg_page_access_time(page, jiffies_to_msecs(jiffies));
+ } else {
last_cpupid = page_cpupid_last(page);
+ }
target_nid = numa_migrate_prep(page, vma, fault_address, page_nid,
&flags);
if (target_nid == NUMA_NO_NODE) {
On some systems with different memory types, including fast memory (DRAM) and slow memory (persistent memory), which will rely on the numa balancing to promote slow and hot memory to fast memory to improve performance. After supporting the speculative numa fault, we can update the next pages' accessing time to help to promote it to fast memory node easily to improve the performance. Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com> --- mm/memory.c | 15 +++++++++++++-- 1 file changed, 13 insertions(+), 2 deletions(-)