From patchwork Fri Dec 7 06:16:18 2018 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Nicolas Boichat X-Patchwork-Id: 10717487 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork-2.web.codeaurora.org (Postfix) with ESMTP id 3691314E2 for ; Fri, 7 Dec 2018 06:16:52 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 23F002EB43 for ; Fri, 7 Dec 2018 06:16:52 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 1797D2EB2B; Fri, 7 Dec 2018 06:16:52 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-3.0 required=2.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,MAILING_LIST_MULTI,RCVD_IN_DNSWL_NONE autolearn=ham version=3.3.1 Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 64FFC2EB43 for ; Fri, 7 Dec 2018 06:16:51 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 3F3C36B7EE5; Fri, 7 Dec 2018 01:16:50 -0500 (EST) Delivered-To: linux-mm-outgoing@kvack.org Received: by kanga.kvack.org (Postfix, from userid 40) id 379C76B7EE6; Fri, 7 Dec 2018 01:16:50 -0500 (EST) X-Original-To: int-list-linux-mm@kvack.org X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 1F5C66B7EE7; Fri, 7 Dec 2018 01:16:50 -0500 (EST) X-Original-To: linux-mm@kvack.org X-Delivered-To: linux-mm@kvack.org Received: from mail-pf1-f197.google.com (mail-pf1-f197.google.com [209.85.210.197]) by kanga.kvack.org (Postfix) with ESMTP id CDF0D6B7EE5 for ; Fri, 7 Dec 2018 01:16:49 -0500 (EST) Received: by mail-pf1-f197.google.com with SMTP id n17so2428727pfk.23 for ; Thu, 06 Dec 2018 22:16:49 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:dkim-signature:from:to:cc:subject:date :message-id:in-reply-to:references:mime-version :content-transfer-encoding; bh=NzK+v8SD4i+VFYwyauHYt8uHII5JYwJbxV/psygL2NI=; b=VPx7ESQmH29gU75/VMIN2L605KTLKRiNY1kIzRAFSWtHwxJIrnMnlGYsbWuSJHe/iF Z4GW+J+auAPvuc0jgYy0Ql3PuuyuLKoapT2dqD1M1ck0RGxSpsApBzJGx894C4B/2dWo oo26ZQsn5/kT89Uif2R6ItVgbzlNKa26Vw7UyVWGGlEyPYNuVBzmNw+Xsqb0m501pcJl YzyZJEHTVVepP51MrcIkl/N8P2SVvp73WVCoabW5KM30pDHUom4aVTslO4/54O+08qRq 9u02EdExFhg3RE7KvBkdnEvNqdgEwKIVscrGyMNaAwV3bOHB1nLWd8/FURjlRVdqw3zW kSHg== X-Gm-Message-State: AA+aEWaCfZ5XPTzt4kZ1/ZZwAtlFDXXIysgXIjf9OrOdp6BI1kBRgHmf mOrWK9oeitC0mFCCJJU2e9nhaN1+e6g60TZ/CV9T3QpP9dYOtidSuPvYJE//dszotPNI8e7DAgB QJ3CL58s/B77MMzII30GpOPiMJSHiRwZyVbFoTapy0gR04oWZeaP+767lGKEOR5FF49uNXPc2sg EshP+NS4c4EN9MhM/moEZciuXCDFN0nlyWdH4epe37DuSQeWG1umbVhGi98WwUhiA956y4suJsF 5dko42+SC9gSJ9hXsNWmVghfDBa1HGt2CVlvicjDASIifVZaZpbTPt/FDEmhWWg41DSMduLRQNE fnIqVq6nAoQEkUhNF3s7p7MCBnA5tKLedptSGsbKejaAi9hBgxebOlqJMeHyAtrBfSAkuyf76ED W X-Received: by 2002:a62:7c47:: with SMTP id x68mr1001480pfc.209.1544163409415; Thu, 06 Dec 2018 22:16:49 -0800 (PST) X-Received: by 2002:a62:7c47:: with SMTP id x68mr1001434pfc.209.1544163408209; Thu, 06 Dec 2018 22:16:48 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544163408; cv=none; d=google.com; s=arc-20160816; b=MPRTnQ5Bs/JjSdgZSf3uPx73m7IgtcrZvkXLxLN3TItHzzpiOKpQMet5yALQqmJbJM gdUdSpHodnLRKa4H1hMFoEsZ/dpGh0mgRwUfmsht1KDqg7IhMYUsIosjJRRKr7+KJpYs V9gJPUCvq1LATu4KdA5VcxjFL2XTJRnwP1QtLq8iK4YN6osAiLd6CzBhtAFumz6qpOqD O5z89NjWxNf39l3LLBdiDGHignCyhN66ZDFLxhcZDJH5RmtRm4/W6DznBEZyPrFER6QW k+QX2KsQdodknwO0xyx3Xf61qJ9sfrqhQe76kvAYminO/IrnH6WWalfL5/OLXhdPuoI2 r9wg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:dkim-signature; bh=NzK+v8SD4i+VFYwyauHYt8uHII5JYwJbxV/psygL2NI=; b=mEB+1Qb7s9ul8hya+E1dyfcoP76FoaQ/gn5FYAceL65tzfeIUgMvr6eYDGqwev30nO AJAlIMR0bbiQz7vVlpT8TEKUvvqmG/is3zKxj/aFBybD3CsFngpDXvAYBQgXMyfldhIY ZbrEOZijLKnlw1HqcOlvS8y+LY4lMC56fIJJ7yWIgKCB00fw04bvr4/Txz5pvmY7fp1C Q9DwUwU+v9luK7wbKJeyysGgkKo8abvsvd2fcuQJUd77RbUzFBfmbm0/anxTgkWkw5ZI NJkH5CvKgzbz9+EbrkowvnyBMxcy3Xk+tZIuhSJUKz1DZhkwZjhgj1/KCDtfWb79zorX Kalg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b=GCAm1IN+; spf=pass (google.com: domain of drinkcat@chromium.org designates 209.85.220.65 as permitted sender) smtp.mailfrom=drinkcat@chromium.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org Received: from mail-sor-f65.google.com (mail-sor-f65.google.com. [209.85.220.65]) by mx.google.com with SMTPS id d82sor4208737pfm.4.2018.12.06.22.16.48 for (Google Transport Security); Thu, 06 Dec 2018 22:16:48 -0800 (PST) Received-SPF: pass (google.com: domain of drinkcat@chromium.org designates 209.85.220.65 as permitted sender) client-ip=209.85.220.65; Authentication-Results: mx.google.com; dkim=pass header.i=@chromium.org header.s=google header.b=GCAm1IN+; spf=pass (google.com: domain of drinkcat@chromium.org designates 209.85.220.65 as permitted sender) smtp.mailfrom=drinkcat@chromium.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=chromium.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=NzK+v8SD4i+VFYwyauHYt8uHII5JYwJbxV/psygL2NI=; b=GCAm1IN+ZRxbOc7GN7uNAXJJ8+9asqSko3ZUYgKImfZU63d/DjtitlkotDu3aXwDj5 Iv1yHuvKraDNMyDD480Kx7aNGM25sdgHZaG9Wn7gXCpdWregqeUEyJQiQWwRHOr5rxZN LPaXQhG3devZMIz45D0m+r/ow6oBxqByOhh60= X-Google-Smtp-Source: AFSGD/Xhr/g/OFHfKU7UGrWKwjX5Riip3DtlLhjvwMxfgqPTmR5dk2b/UOS+VKUsgAbbsuT+kNuVPw== X-Received: by 2002:a62:4d81:: with SMTP id a123mr1066512pfb.122.1544163407669; Thu, 06 Dec 2018 22:16:47 -0800 (PST) Received: from drinkcat2.tpe.corp.google.com ([2401:fa00:1:b:f659:7f17:ea11:4e8e]) by smtp.gmail.com with ESMTPSA id z30-v6sm2995944pfg.41.2018.12.06.22.16.43 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 06 Dec 2018 22:16:46 -0800 (PST) From: Nicolas Boichat To: Will Deacon Cc: Robin Murphy , Joerg Roedel , Christoph Lameter , Pekka Enberg , David Rientjes , Joonsoo Kim , Andrew Morton , Vlastimil Babka , Michal Hocko , Mel Gorman , Levin Alexander , Huaisheng Ye , Mike Rapoport , linux-arm-kernel@lists.infradead.org, iommu@lists.linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Yong Wu , Matthias Brugger , Tomasz Figa , yingjoe.chen@mediatek.com, hch@infradead.org, Matthew Wilcox Subject: [PATCH v5 1/3] mm: Add support for kmem caches in DMA32 zone Date: Fri, 7 Dec 2018 14:16:18 +0800 Message-Id: <20181207061620.107881-2-drinkcat@chromium.org> X-Mailer: git-send-email 2.20.0.rc2.403.gdbc3b29805-goog In-Reply-To: <20181207061620.107881-1-drinkcat@chromium.org> References: <20181207061620.107881-1-drinkcat@chromium.org> MIME-Version: 1.0 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: X-Virus-Scanned: ClamAV using ClamSMTP IOMMUs using ARMv7 short-descriptor format require page tables to be allocated within the first 4GB of RAM, even on 64-bit systems. On arm64, this is done by passing GFP_DMA32 flag to memory allocation functions. For IOMMU L2 tables that only take 1KB, it would be a waste to allocate a full page using get_free_pages, so we considered 3 approaches: 1. This patch, adding support for GFP_DMA32 slab caches. 2. genalloc, which requires pre-allocating the maximum number of L2 page tables (4096, so 4MB of memory). 3. page_frag, which is not very memory-efficient as it is unable to reuse freed fragments until the whole page is freed. This change makes it possible to create a custom cache in DMA32 zone using kmem_cache_create, then allocate memory using kmem_cache_alloc. We do not create a DMA32 kmalloc cache array, as there are currently no users of kmalloc(..., GFP_DMA32). These calls will continue to trigger a warning, as we keep GFP_DMA32 in GFP_SLAB_BUG_MASK. This implies that calls to kmem_cache_*alloc on a SLAB_CACHE_DMA32 kmem_cache must _not_ use GFP_DMA32 (it is anyway redundant and unnecessary). Signed-off-by: Nicolas Boichat Acked-by: Vlastimil Babka --- Changes since v2: - Clarified commit message - Add entry in sysfs-kernel-slab to document the new sysfs file (v3 used the page_frag approach) Changes since v4: - Added details to commit message - Dropped change that removed GFP_DMA32 from GFP_SLAB_BUG_MASK: instead we can just call kmem_cache_*alloc without GFP_DMA32 parameter. This also means that we can drop PATCH 1/3, as we do not make any changes in GFP flag verification. - Dropped hunks that added cache_dma32 sysfs file, and moved the hunks to PATCH 3/3, so that maintainer can decide whether to pick the change independently. include/linux/slab.h | 2 ++ mm/slab.c | 2 ++ mm/slab.h | 3 ++- mm/slab_common.c | 2 +- mm/slub.c | 5 +++++ 5 files changed, 12 insertions(+), 2 deletions(-) diff --git a/include/linux/slab.h b/include/linux/slab.h index 11b45f7ae4057c..9449b19c5f107a 100644 --- a/include/linux/slab.h +++ b/include/linux/slab.h @@ -32,6 +32,8 @@ #define SLAB_HWCACHE_ALIGN ((slab_flags_t __force)0x00002000U) /* Use GFP_DMA memory */ #define SLAB_CACHE_DMA ((slab_flags_t __force)0x00004000U) +/* Use GFP_DMA32 memory */ +#define SLAB_CACHE_DMA32 ((slab_flags_t __force)0x00008000U) /* DEBUG: Store the last owner for bug hunting */ #define SLAB_STORE_USER ((slab_flags_t __force)0x00010000U) /* Panic if kmem_cache_create() fails */ diff --git a/mm/slab.c b/mm/slab.c index 73fe23e649c91a..124f8c556d27fb 100644 --- a/mm/slab.c +++ b/mm/slab.c @@ -2109,6 +2109,8 @@ int __kmem_cache_create(struct kmem_cache *cachep, slab_flags_t flags) cachep->allocflags = __GFP_COMP; if (flags & SLAB_CACHE_DMA) cachep->allocflags |= GFP_DMA; + if (flags & SLAB_CACHE_DMA32) + cachep->allocflags |= GFP_DMA32; if (flags & SLAB_RECLAIM_ACCOUNT) cachep->allocflags |= __GFP_RECLAIMABLE; cachep->size = size; diff --git a/mm/slab.h b/mm/slab.h index 4190c24ef0e9df..fcf717e12f0a86 100644 --- a/mm/slab.h +++ b/mm/slab.h @@ -127,7 +127,8 @@ static inline slab_flags_t kmem_cache_flags(unsigned int object_size, /* Legal flag mask for kmem_cache_create(), for various configurations */ -#define SLAB_CORE_FLAGS (SLAB_HWCACHE_ALIGN | SLAB_CACHE_DMA | SLAB_PANIC | \ +#define SLAB_CORE_FLAGS (SLAB_HWCACHE_ALIGN | SLAB_CACHE_DMA | \ + SLAB_CACHE_DMA32 | SLAB_PANIC | \ SLAB_TYPESAFE_BY_RCU | SLAB_DEBUG_OBJECTS ) #if defined(CONFIG_DEBUG_SLAB) diff --git a/mm/slab_common.c b/mm/slab_common.c index 70b0cc85db67f8..18b7b809c8d064 100644 --- a/mm/slab_common.c +++ b/mm/slab_common.c @@ -53,7 +53,7 @@ static DECLARE_WORK(slab_caches_to_rcu_destroy_work, SLAB_FAILSLAB | SLAB_KASAN) #define SLAB_MERGE_SAME (SLAB_RECLAIM_ACCOUNT | SLAB_CACHE_DMA | \ - SLAB_ACCOUNT) + SLAB_CACHE_DMA32 | SLAB_ACCOUNT) /* * Merge control. If this is set then no merging of slab caches will occur. diff --git a/mm/slub.c b/mm/slub.c index c229a9b7dd5448..4caadb926838ef 100644 --- a/mm/slub.c +++ b/mm/slub.c @@ -3583,6 +3583,9 @@ static int calculate_sizes(struct kmem_cache *s, int forced_order) if (s->flags & SLAB_CACHE_DMA) s->allocflags |= GFP_DMA; + if (s->flags & SLAB_CACHE_DMA32) + s->allocflags |= GFP_DMA32; + if (s->flags & SLAB_RECLAIM_ACCOUNT) s->allocflags |= __GFP_RECLAIMABLE; @@ -5671,6 +5674,8 @@ static char *create_unique_id(struct kmem_cache *s) */ if (s->flags & SLAB_CACHE_DMA) *p++ = 'd'; + if (s->flags & SLAB_CACHE_DMA32) + *p++ = 'D'; if (s->flags & SLAB_RECLAIM_ACCOUNT) *p++ = 'a'; if (s->flags & SLAB_CONSISTENCY_CHECKS)