From patchwork Tue Oct 22 09:27:29 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Yicong Yang X-Patchwork-Id: 13845466 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from bombadil.infradead.org (bombadil.infradead.org [198.137.202.133]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 630F5D1CDCA for ; Tue, 22 Oct 2024 09:50:00 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lists.infradead.org; s=bombadil.20210309; h=Sender:List-Subscribe:List-Help :List-Post:List-Archive:List-Unsubscribe:List-Id:Content-Type: Content-Transfer-Encoding:MIME-Version:Message-ID:Date:Subject:CC:To:From: Reply-To:Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender :Resent-To:Resent-Cc:Resent-Message-ID:In-Reply-To:References:List-Owner; bh=3+bg4W6p8j3w8dsc7XpXmn7YkN4oYDzpflM3eqBT04s=; b=iXJ8Jl4JgEJXP+ISnzY/DssuFm W76x1q5uAhVijh5vDaQZ6nFToY9jzugpLIHuBCyf+xg49zjLDsrf5V190vCDvyBBFFSYA8C7fvu/9 jDL/qEYLTbdNz1y6pK9jQKQw+iTpNIHDe+N2LH7JaayVEYPLv8mqxXCSPDkcu0rui+Eu0Xk+lBltY VOl5DqkhTcBANJlX2lq3p2llVYTGi+X8so3oPtFdhmbyJdULE0uYGPLKU1vY4UmquHjV+M+X+Ilzx 3S5zFCAo7fdnafXEIfBh1x3/Q+FuNq4OCnlqK2x2uxTvAu0dp51qrOyaXlTyF5G4NvOjkdzs7l2Ks 7odjjJLg==; Received: from localhost ([::1] helo=bombadil.infradead.org) by bombadil.infradead.org with esmtp (Exim 4.98 #2 (Red Hat Linux)) id 1t3BWN-0000000AQSN-0kAy; Tue, 22 Oct 2024 09:49:47 +0000 Received: from szxga05-in.huawei.com ([45.249.212.191]) by bombadil.infradead.org with esmtps (Exim 4.98 #2 (Red Hat Linux)) id 1t3BBM-0000000ALsO-1eCH for linux-arm-kernel@lists.infradead.org; Tue, 22 Oct 2024 09:28:06 +0000 Received: from mail.maildlp.com (unknown [172.19.163.17]) by szxga05-in.huawei.com (SkyGuard) with ESMTP id 4XXmvD4Jztz1HLQj; Tue, 22 Oct 2024 17:23:36 +0800 (CST) Received: from kwepemd200014.china.huawei.com (unknown [7.221.188.8]) by mail.maildlp.com (Postfix) with ESMTPS id 626C71A0190; Tue, 22 Oct 2024 17:27:56 +0800 (CST) Received: from localhost.localdomain (10.50.165.33) by kwepemd200014.china.huawei.com (7.221.188.8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1258.34; Tue, 22 Oct 2024 17:27:55 +0800 From: Yicong Yang To: , , , , , CC: , , , , , , , , Subject: [PATCH v3 0/5] Support Armv8.9/v9.4 FEAT_HAFT Date: Tue, 22 Oct 2024 17:27:29 +0800 Message-ID: <20241022092734.59984-1-yangyicong@huawei.com> X-Mailer: git-send-email 2.31.0 MIME-Version: 1.0 X-Originating-IP: [10.50.165.33] X-ClientProxiedBy: dggems702-chm.china.huawei.com (10.3.19.179) To kwepemd200014.china.huawei.com (7.221.188.8) X-CRM114-Version: 20100106-BlameMichelson ( TRE 0.8.0 (BSD) ) MR-646709E3 X-CRM114-CacheID: sfid-20241022_022804_970989_78760948 X-CRM114-Status: GOOD ( 12.71 ) X-BeenThere: linux-arm-kernel@lists.infradead.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: "linux-arm-kernel" Errors-To: linux-arm-kernel-bounces+linux-arm-kernel=archiver.kernel.org@lists.infradead.org From: Yicong Yang This series adds basic support for FEAT_HAFT introduced in Armv8.9/v9.4 and enable ARCH_HAS_NONLEAF_PMD_YOUNG. The latter will be used in lru-gen aging. Tested with lru-gen in below steps: 1. Generate a 1GiB workingset by `stress-ng --vm 1`. Then hang the task to stop accessing the memory. (AF bit won't be updated) 2. try to age the memory by /sys/kernel/debug/lru_gen Run above steps with LRU_GEN_NONLEAF_YOUNG(0x4) and not respectively (switching by /sys/kernel/mm/lru_gen/enabled). LRU_GEN_NONLEAF_YOUNG will clear and test the PMD AF bit on page walking for aging, otherwise will clear and test the PTE AF bit for aging. In this case LRU_GEN_NONLEAF_YOUNG will improve the efficiency of page scanning since pages won't be accessed and we don't need to scan each PTE. Observed ~40% time saved for 1GiB memory on our emulated platform with LRU_GEN_NONLEAF_YOUNG. For lru-gen aging: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/Documentation/admin-guide/mm/multigen_lru.rst?h=v6.11-rc1#n94 Change since v2: - Address comments per Will and Catalin: o detect and enable the feature in __cpu_setup() o allow online the CPU that doesn't have this feature and mismatch with the boot CPU o only advertise the feature if it's enabled system widely o set AF bit for kernel page table entries to save later hardware update o warn unexpected pmdp_test_and_clear_young() - Update all the new AA64MMFR1_EL1 fields per Mark Link: https://lore.kernel.org/linux-arm-kernel/20240814092333.7727-1-yangyicong@huawei.com/ Change since v1: - Address comments from Marc, improve comments/Kconfig, clean code. Thanks for the comments. Link: https://lore.kernel.org/linux-arm-kernel/20240802093458.32683-1-yangyicong@huawei.com/ Yicong Yang (5): arm64/sysreg: Update ID_AA64MMFR1_EL1 register arm64: setup: name 'tcr2' register arm64: Add support for FEAT_HAFT arm64: Enable ARCH_HAS_NONLEAF_PMD_YOUNG arm64: pgtable: Warn unexpected pmdp_test_and_clear_young() arch/arm64/Kconfig | 16 ++++++++++++++++ arch/arm64/include/asm/cpufeature.h | 24 ++++++++++++++++++++++++ arch/arm64/include/asm/pgalloc.h | 9 +++++---- arch/arm64/include/asm/pgtable-hwdef.h | 4 ++++ arch/arm64/include/asm/pgtable.h | 16 ++++++++++++++-- arch/arm64/kernel/cpufeature.c | 23 +++++++++++++++++++++++ arch/arm64/mm/fixmap.c | 9 ++++++--- arch/arm64/mm/mmu.c | 8 ++++---- arch/arm64/mm/proc.S | 20 +++++++++++++++++--- arch/arm64/tools/cpucaps | 1 + arch/arm64/tools/sysreg | 4 ++++ 11 files changed, 118 insertions(+), 16 deletions(-)