From patchwork Tue Dec 15 03:07:40 2020 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Andrew Morton X-Patchwork-Id: 11973755 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-15.7 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id A2490C4361B for ; Tue, 15 Dec 2020 03:07:45 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 4EF16223E4 for ; Tue, 15 Dec 2020 03:07:45 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 4EF16223E4 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=linux-foundation.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id DD2336B00B9; Mon, 14 Dec 2020 22:07:44 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id D5BC38D000D; Mon, 14 Dec 2020 22:07:44 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C4F028D000C; Mon, 14 Dec 2020 22:07:44 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0002.hostedemail.com [216.40.44.2]) by kanga.kvack.org (Postfix) with ESMTP id A7F0F6B00B9 for ; Mon, 14 Dec 2020 22:07:44 -0500 (EST) Received: from smtpin16.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with ESMTP id 786A98249980 for ; Tue, 15 Dec 2020 03:07:44 +0000 (UTC) X-FDA: 77594031648.16.berry36_17154bc27420 Received: from filter.hostedemail.com (10.5.16.251.rfc1918.com [10.5.16.251]) by smtpin16.hostedemail.com (Postfix) with ESMTP id 53077100E6903 for ; Tue, 15 Dec 2020 03:07:44 +0000 (UTC) X-HE-Tag: berry36_17154bc27420 X-Filterd-Recvd-Size: 5031 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by imf44.hostedemail.com (Postfix) with ESMTP for ; Tue, 15 Dec 2020 03:07:43 +0000 (UTC) Date: Mon, 14 Dec 2020 19:07:40 -0800 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linux-foundation.org; s=korg; t=1608001663; bh=4gg4bAJ89U4ncUMe/ZEmKTwiujvoFv1IdSjfaZ4etZo=; h=From:To:Subject:In-Reply-To:From; b=LSwvzq8hNMK8aiDDWNMJFbafeZZ7sNWtO0fkyeXjrIwtWNeCKfdU0Gh67tNYoCn3J JijXIMavjs+FSAWCL420d7eig7np2TGIjadOehey+5u4Qq+1Rzm6cVdLdCBOhOJJGz mf+jbQLmIXQKYloqSvaI977dWxsTaEe1fqNFXJbA= From: Andrew Morton To: akpm@linux-foundation.org, almasrymina@google.com, aneesh.kumar@linux.ibm.com, anshuman.khandual@arm.com, arnd@arndb.de, bgeffon@google.com, bp@alien8.de, catalin.marinas@arm.com, christian.brauner@ubuntu.com, dave.hansen@intel.com, frederic@kernel.org, gshan@redhat.com, hnaveed@wavecomp.com, hpa@zytor.com, jhubbard@nvidia.com, justin.he@arm.com, kaleshsingh@google.com, keescook@chromium.org, kirill.shutemov@linux.intel.com, krzk@kernel.org, linux-mm@kvack.org, linuxram@us.ibm.com, lokeshgidra@google.com, mark.rutland@arm.com, masahiroy@kernel.org, mhiramat@kernel.org, minchan@google.com, mingo@redhat.com, mm-commits@vger.kernel.org, peterz@infradead.org, rcampbell@nvidia.com, rppt@kernel.org, samitolvanen@google.com, sandipan@linux.ibm.com, shuah@kernel.org, sjpark@amazon.de, steven.price@arm.com, surenb@google.com, tglx@linutronix.de, torvalds@linux-foundation.org, will@kernel.org, ziy@nvidia.com Subject: [patch 077/200] x86: mremap speedup - Enable HAVE_MOVE_PUD Message-ID: <20201215030740.oGU8cEfZM%akpm@linux-foundation.org> In-Reply-To: <20201214190237.a17b70ae14f129e2dca3d204@linux-foundation.org> User-Agent: s-nail v14.8.16 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: From: Kalesh Singh Subject: x86: mremap speedup - Enable HAVE_MOVE_PUD HAVE_MOVE_PUD enables remapping pages at the PUD level if both the source and destination addresses are PUD-aligned. With HAVE_MOVE_PUD enabled it can be inferred that there is approximately a 13x improvement in performance on x86. (See data below). ------- Test Results --------- The following results were obtained using a 5.4 kernel, by remapping a PUD-aligned, 1GB sized region to a PUD-aligned destination. The results from 10 iterations of the test are given below: Total mremap times for 1GB data on x86. All times are in nanoseconds. Control HAVE_MOVE_PUD 180394 15089 235728 14056 238931 25741 187330 13838 241742 14187 177925 14778 182758 14728 160872 14418 205813 15107 245722 13998 205721.5 15594 <-- Mean time in nanoseconds A 1GB mremap completion time drops from ~205 microseconds to ~15 microseconds on x86. (~13x speed up). Link: https://lkml.kernel.org/r/20201014005320.2233162-6-kaleshsingh@google.com Signed-off-by: Kalesh Singh Acked-by: Kirill A. Shutemov Acked-by: Ingo Molnar Cc: Thomas Gleixner Cc: Borislav Petkov Cc: H. Peter Anvin Cc: Aneesh Kumar K.V Cc: Anshuman Khandual Cc: Arnd Bergmann Cc: Brian Geffon Cc: Catalin Marinas Cc: Christian Brauner Cc: Dave Hansen Cc: Frederic Weisbecker Cc: Gavin Shan Cc: Hassan Naveed Cc: Jia He Cc: John Hubbard Cc: Kees Cook Cc: Krzysztof Kozlowski Cc: Lokesh Gidra Cc: Mark Rutland Cc: Masahiro Yamada Cc: Masami Hiramatsu Cc: Mike Rapoport Cc: Mina Almasry Cc: Minchan Kim Cc: Peter Zijlstra (Intel) Cc: Ralph Campbell Cc: Ram Pai Cc: Sami Tolvanen Cc: Sandipan Das Cc: SeongJae Park Cc: Shuah Khan Cc: Steven Price Cc: Suren Baghdasaryan Cc: Will Deacon Cc: Zi Yan Signed-off-by: Andrew Morton --- arch/x86/Kconfig | 1 + 1 file changed, 1 insertion(+) --- a/arch/x86/Kconfig~x86-mremap-speedup-enable-have_move_pud +++ a/arch/x86/Kconfig @@ -199,6 +199,7 @@ config X86 select HAVE_MIXED_BREAKPOINTS_REGS select HAVE_MOD_ARCH_SPECIFIC select HAVE_MOVE_PMD + select HAVE_MOVE_PUD select HAVE_NMI select HAVE_OPROFILE select HAVE_OPTPROBES