From patchwork Thu Nov 7 16:10:07 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: Alexander Lobakin X-Patchwork-Id: 13866693 Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.13]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 897A8216E01; Thu, 7 Nov 2024 16:13:45 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=198.175.65.13 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1730996027; cv=none; b=SQljbLt0+An862TeCkvlfo6z3MpiWgYDJ9FO8Eyhg1BHm6GSKZR+v4T3HecCgR+dERSy/FrIwu2ckXyjb9hdg7OyXjm9rTDNFjJj9lRk2m3i9bOhxkRpwDhyliNEQeXF5BNvMmLdqd3mR4oMzDxArtTvYrwM+Uk2MdqTVtwPg7E= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1730996027; c=relaxed/simple; bh=2KVewH6gbI7QLh7PVS2aZtluQpiu+xwJYuj2THd+jEs=; h=From:To:Cc:Subject:Date:Message-ID:MIME-Version:Content-Type; b=SgqTlqXujj156i9L8rBwuPH1JqbhrXs8EqEFGtcKSiYnKoh9il6az22eY3OGyH2gLJmtThl8OQV4sbrGxh1zapiXB7piqF9e4lA8R15oddI0HES0P+TMYFOgorS6GlGK9h1jPoC/AHVlYtoEREHMuDXPNoLufKuEIubM3YMsEqQ= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=intel.com; spf=pass smtp.mailfrom=intel.com; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b=U0UHJf1F; arc=none smtp.client-ip=198.175.65.13 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=intel.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=intel.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="U0UHJf1F" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1730996026; x=1762532026; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=2KVewH6gbI7QLh7PVS2aZtluQpiu+xwJYuj2THd+jEs=; b=U0UHJf1FehyvNyR4baVv3C2xtrQl88Wk828y0eRt1eYlEpIgxf9pJ/yM w+XMQNK29M/45rY/CqnNoDupPQTwqJuix2+O/9tKfHYB7JsoZigysM7/+ hY07y82R5DRdq1kMx483lJNMBc+aQ2weHGhvsNoMyfvoc+lDhxO8nF8Xa IPocP1V5SPUfCXJ4fR7yet3fNhb9rSd7acFbU8LpCvHNkX9il5Uz8dgFM qPaUzlwW7owZxPTbBO3yHhqj79wnE649FND7n6/E2EV3+f2Yx1mhlKwZZ BDcEf474hgLbMY/Zgeh02h0h2uIyS1bU76mAi0G82ttGvk0gkj4+plOLO g==; X-CSE-ConnectionGUID: mxvHTS4iTk2q69c4t3Uhuw== X-CSE-MsgGUID: Q9axvU3rQDuUyTM0Ikh18A== X-IronPort-AV: E=McAfee;i="6700,10204,11222"; a="41955674" X-IronPort-AV: E=Sophos;i="6.11,199,1725346800"; d="scan'208";a="41955674" Received: from orviesa004.jf.intel.com ([10.64.159.144]) by orvoesa105.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 07 Nov 2024 08:13:45 -0800 X-CSE-ConnectionGUID: Hskh+WlhRZWk5VW1fdVZYA== X-CSE-MsgGUID: zYG0p20DTfOrFToP1OQp0A== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.12,135,1728975600"; d="scan'208";a="90258131" Received: from newjersey.igk.intel.com ([10.102.20.203]) by orviesa004.jf.intel.com with ESMTP; 07 Nov 2024 08:13:41 -0800 From: Alexander Lobakin To: "David S. Miller" , Eric Dumazet , Jakub Kicinski , Paolo Abeni Cc: Alexander Lobakin , =?utf-8?q?Toke_H=C3=B8?= =?utf-8?q?iland-J=C3=B8rgensen?= , Alexei Starovoitov , Daniel Borkmann , John Fastabend , Andrii Nakryiko , Maciej Fijalkowski , Stanislav Fomichev , Magnus Karlsson , nex.sw.ncis.osdt.itp.upstreaming@intel.com, bpf@vger.kernel.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org Subject: [PATCH net-next v4 00/19] xdp: a fistful of generic changes (+libeth_xdp) Date: Thu, 7 Nov 2024 17:10:07 +0100 Message-ID: <20241107161026.2903044-1-aleksander.lobakin@intel.com> X-Mailer: git-send-email 2.47.0 Precedence: bulk X-Mailing-List: bpf@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 X-Patchwork-Delegate: kuba@kernel.org XDP for idpf is currently 5 chapters: * convert Rx to libeth; * convert Tx and stats to libeth; * generic XDP and XSk code changes (this); * actual XDP for idpf via libeth_xdp; * XSk for idpf (^). Part III does the following: * does some cleanups with marking read-only bpf_prog and xdp_buff arguments const for some generic functions; * allows attaching already registered XDP memory model to Rxq info; * allows mixing pages from several Page Pools within one XDP frame; * optimizes &xdp_frame structure and removes no-more-used field; * adds generic functions to build skbs from xdp_buffs (regular and XSk) and attach frags to xdp_buffs (regular and XSk); * adds helper to optimize XSk xmit in drivers; * extends libeth Rx to support XDP requirements (headroom etc.) on Rx; * adds libeth_xdp -- libeth module with common XDP and XSk routines. They are implemented mostly as inlines with inline callback arguments. They will be then uninlined in the drivers with sane function sizes, but without any indirect calls. All those inlines and macros really removes tons of driver code, which is mostly the same across the drivers minus HW-specific part. You just basically need functions which read Rx descriptors and fill Tx descriptors, call a couple macros and that's it. The rest is written once in libeth_xdp. All exception and cold code are external. Error handling etc, anything that don't happen at line rates, are external. Only the hottest things are inlined ensuring driver code doesn't bloat for no gain and that cold code won't push hot code into more cachelines than wanted. Note on diffstat: don't be scared, almost 1500 lines are documentation explaining everything in details. The actual new code is around 2500. Alexander Lobakin (18): jump_label: export static_key_slow_{inc,dec}_cpuslocked() skbuff: allow 2-4-argument skb_frag_dma_map() unroll: add generic loop unroll helpers bpf, xdp: constify some bpf_prog * function arguments xdp, xsk: constify read-only arguments of some static inline helpers xdp: allow attaching already registered memory model to xdp_rxq_info page_pool: make page_pool_put_page_bulk() actually handle array of pages page_pool: allow mixing PPs within one bulk xdp: get rid of xdp_frame::mem.id xdp: add generic xdp_buff_add_frag() xdp: add generic xdp_build_skb_from_buff() xsk: align &xdp_buff_xsk harder xsk: allow attaching XSk pool via xdp_rxq_info_reg_mem_model() xsk: make xsk_buff_add_frag really add a frag via __xdp_buff_add_frag() xsk: add generic XSk &xdp_buff -> skb conversion xsk: add helper to get &xdp_desc's DMA and meta pointer in one go libeth: support native XDP and register memory model libeth: add a couple of XDP helpers (libeth_xdp) Toke Høiland-Jørgensen (1): xdp: register system page pool as an XDP memory model drivers/net/ethernet/intel/libeth/Kconfig | 6 + drivers/net/ethernet/intel/libeth/Makefile | 6 + include/net/libeth/types.h | 102 +- include/net/page_pool/types.h | 6 +- drivers/net/ethernet/intel/libeth/priv.h | 37 + include/linux/bpf.h | 12 +- include/linux/filter.h | 9 +- include/linux/netdevice.h | 7 +- include/linux/skbuff.h | 49 +- include/linux/unroll.h | 43 + include/net/libeth/rx.h | 6 +- include/net/libeth/tx.h | 34 +- include/net/libeth/xdp.h | 1864 +++++++++++++++++ include/net/libeth/xsk.h | 684 ++++++ include/net/xdp.h | 185 +- include/net/xdp_sock_drv.h | 52 +- include/net/xsk_buff_pool.h | 12 +- .../net/ethernet/freescale/dpaa/dpaa_eth.c | 2 +- drivers/net/ethernet/intel/i40e/i40e_xsk.c | 30 +- drivers/net/ethernet/intel/ice/ice_xsk.c | 32 +- drivers/net/ethernet/intel/libeth/rx.c | 22 +- drivers/net/ethernet/intel/libeth/tx.c | 39 + drivers/net/ethernet/intel/libeth/xdp.c | 444 ++++ drivers/net/ethernet/intel/libeth/xsk.c | 264 +++ drivers/net/veth.c | 4 +- kernel/bpf/cpumap.c | 2 +- kernel/bpf/devmap.c | 8 +- kernel/jump_label.c | 2 + net/bpf/test_run.c | 4 +- net/core/dev.c | 20 +- net/core/filter.c | 41 +- net/core/page_pool.c | 60 +- net/core/skbuff.c | 2 +- net/core/xdp.c | 311 ++- net/xdp/xsk_buff_pool.c | 40 + 35 files changed, 4220 insertions(+), 221 deletions(-) create mode 100644 drivers/net/ethernet/intel/libeth/priv.h create mode 100644 include/net/libeth/xdp.h create mode 100644 include/net/libeth/xsk.h create mode 100644 drivers/net/ethernet/intel/libeth/tx.c create mode 100644 drivers/net/ethernet/intel/libeth/xdp.c create mode 100644 drivers/net/ethernet/intel/libeth/xsk.c --- From v3[0]: * rebase on top of the latest net-next to solve conflict (Jakub); * 09: use iterative approach instead of recursive to not blow the stack (Toke); * 12: rephrase the commitmsg since the functionality changed, so that it's not actual anymore (Toke); * align &xdp_buff_xsk a bit harder since its alignment degraded recently; * pick RBs from Toke. From v2[1]: * cover: rename the series; * collect RBs and Acks from Maciej; * 007: reword the commitmsg; * 011: fix typos in the commitmsg (M); * 012: 'ts' -> 'tsize' (M; not 'truesize' to fit into 80 cols =\); * 016: fix the intro sentence (M); * no functional changes. From v1[2]: * rebase on top of the latest net-next; * no other changes. [0] https://lore.kernel.org/netdev/20241030165201.442301-1-aleksander.lobakin@intel.com [1] https://lore.kernel.org/netdev/20241015145350.4077765-1-aleksander.lobakin@intel.com [2] https://lore.kernel.org/netdev/20241009152756.3113697-1-aleksander.lobakin@intel.com