[36/44] drm/amdkfd: Fix spurious restore failures

Message ID 20210322105900.14068-37-Felix.Kuehling@amd.com (mailing list archive)
Series Add HMM-based SVM memory manager to KFD v2 | expand

Felix Kuehling March 22, 2021, 10:58 a.m. UTC
Restore can appear to fail if the svms->evicted counter changes before
the function can acquire the necessary locks. Re-read the counter after
acquiring the lock to minimize the chances of having to reschedule the

Change-Id: I236b912bddf106583be264abde2f6bd1a5d5a083
Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>
 drivers/gpu/drm/amd/amdkfd/kfd_svm.c | 2 ++
 1 file changed, 2 insertions(+)
diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_svm.c b/drivers/gpu/drm/amd/amdkfd/kfd_svm.c
index 0fbc037b06e3..49aca4664411 100644
--- a/drivers/gpu/drm/amd/amdkfd/kfd_svm.c
+++ b/drivers/gpu/drm/amd/amdkfd/kfd_svm.c
@@ -1429,6 +1429,8 @@  static void svm_range_restore_work(struct work_struct *work)
 	svm_range_list_lock_and_flush_work(svms, mm);
+	evicted_ranges = atomic_read(&svms->evicted_ranges);
 	list_for_each_entry(prange, &svms->list, list) {
 		invalid = atomic_read(&prange->invalid);
 		if (!invalid)