[v3] net/packet: support mergeable feature of virtio

From: Jianfeng Tan <henry.tjf@antgroup.com>

From: Jianfeng Tan <henry.tjf@antgroup.com>

Packet sockets, like tap, can be used as the backend for kernel vhost.
In packet sockets, virtio net header size is currently hardcoded to be
the size of struct virtio_net_hdr, which is 10 bytes; however, it is not
always the case: some virtio features, such as mrg_rxbuf, need virtio
net header to be 12-byte long.

Mergeable buffers, as a virtio feature, is worthy of supporting: packets
that are larger than one-mbuf size will be dropped in vhost worker's
handle_rx if mrg_rxbuf feature is not used, but large packets
cannot be avoided and increasing mbuf's size is not economical.

With this mergeable feature enabled by virtio-user, packet sockets with
hardcoded 10-byte virtio net header will parse mac head incorrectly in
packet_snd by taking the last two bytes of virtio net header as part of
mac header.
This incorrect mac header parsing will cause packet to be dropped due to
invalid ether head checking in later under-layer device packet receiving.

By adding extra field vnet_hdr_sz with utilizing holes in struct
packet_sock to record currently used virtio net header size and supporting
extra sockopt PACKET_VNET_HDR_SZ to set specified vnet_hdr_sz, packet
sockets can know the exact length of virtio net header that virtio user
gives.
In packet_snd, tpacket_snd and packet_recvmsg, instead of using
hardcoded virtio net header size, it can get the exact vnet_hdr_sz from
corresponding packet_sock, and parse mac header correctly based on this
information to avoid the packets being mistakenly dropped.

Besides, has_vnet_hdr field in struct packet_sock is removed since all 
the information it provides is covered by vnet_hdr_sz field: a packet
socket has a vnet header if and only if its vnet_hdr_sz is not zero.

Signed-off-by: Jianfeng Tan <henry.tjf@antgroup.com>
Co-developed-by: Anqi Shen <amy.saq@antgroup.com>
Signed-off-by: Anqi Shen <amy.saq@antgroup.com>
---

V2 -> V3:
* remove has_vnet_hdr field and use vnet_hdr_sz to indicate whether
there is a vnet header;
* refactor PACKET_VNET_HDR and PACKET_VNET_HDR_SZ sockopt to remove
redundant code.

 include/uapi/linux/if_packet.h |  1 +
 net/packet/af_packet.c         | 82 ++++++++++++++++++++++++++++--------------
 net/packet/diag.c              |  2 +-
 net/packet/internal.h          |  4 +--
 4 files changed, 60 insertions(+), 29 deletions(-)

Message ID	1678168911-337042-1-git-send-email-amy.saq@antgroup.com (mailing list archive)
State	Superseded
Delegated to:	Netdev Maintainers
Headers	show Return-Path: <netdev-owner@vger.kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id C0B74C678D5 for <netdev@archiver.kernel.org>; Tue, 7 Mar 2023 06:02:04 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230366AbjCGGCD (ORCPT <rfc822;netdev@archiver.kernel.org>); Tue, 7 Mar 2023 01:02:03 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37662 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230321AbjCGGCA (ORCPT <rfc822;netdev@vger.kernel.org>); Tue, 7 Mar 2023 01:02:00 -0500 Received: from out0-219.mail.aliyun.com (out0-219.mail.aliyun.com [140.205.0.219]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0C3DE75854 for <netdev@vger.kernel.org>; Mon, 6 Mar 2023 22:01:54 -0800 (PST) X-Alimail-AntiSpam: AC=PASS;BC=-1\|-1;BR=01201311R191e4;CH=green;DM=\|\|false\|;DS=\|\|;FP=0\|-1\|-1\|-1\|0\|-1\|-1\|-1;HT=ay29a033018047194;MF=amy.saq@antgroup.com;NM=1;PH=DS;RN=7;SR=0;TI=SMTPD_---.RfuWwi1_1678168912; Received: from localhost(mailfrom:amy.saq@antgroup.com fp:SMTPD_---.RfuWwi1_1678168912) by smtp.aliyun-inc.com; Tue, 07 Mar 2023 14:01:52 +0800 From: " =?utf-8?b?5rKI5a6J55CqKOWHm+eOpSk=?= " <amy.saq@antgroup.com> To: netdev@vger.kernel.org Cc: <willemdebruijn.kernel@gmail.com>, <mst@redhat.com>, <davem@davemloft.net>, <jasowang@redhat.com>, " =?utf-8?b?6LCI6Ym06ZSL?= " <henry.tjf@antgroup.com>, " =?utf-8?b?5rKI5a6J?= =?utf-8?b?55CqKOWHm+eOpSk=?= " <amy.saq@antgroup.com> Subject: [PATCH v3] net/packet: support mergeable feature of virtio Date: Tue, 07 Mar 2023 14:01:51 +0800 Message-Id: <1678168911-337042-1-git-send-email-amy.saq@antgroup.com> X-Mailer: git-send-email 1.8.3.1 Precedence: bulk List-ID: <netdev.vger.kernel.org> X-Mailing-List: netdev@vger.kernel.org X-Patchwork-Delegate: kuba@kernel.org
Series	[v3] net/packet: support mergeable feature of virtio \| expand [v3] net/packet: support mergeable feature of virtio

Context	Check	Description
netdev/series_format	warning	Single patches do not need cover letters; Target tree name not specified in the subject
netdev/tree_selection	success	Guessed tree name to be net-next
netdev/fixes_present	success	Fixes tag not required for -next series
netdev/header_inline	success	No static functions without inline keyword in header files
netdev/build_32bit	success	Errors and warnings before: 5332 this patch: 5332
netdev/cc_maintainers	warning	3 maintainers not CCed: kuba@kernel.org edumazet@google.com pabeni@redhat.com
netdev/build_clang	success	Errors and warnings before: 1074 this patch: 1074
netdev/verify_signedoff	success	Signed-off-by tag matches author and committer
netdev/deprecated_api	success	None detected
netdev/check_selftest	success	No net selftest shell script
netdev/verify_fixes	success	No Fixes tag
netdev/build_allmodconfig_warn	success	Errors and warnings before: 5540 this patch: 5540
netdev/checkpatch	warning	WARNING: line length of 82 exceeds 80 columns WARNING: line length of 94 exceeds 80 columns WARNING: line length of 96 exceeds 80 columns
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/source_inline	success	Was 0 now: 0

[v3] net/packet: support mergeable feature of virtio

Checks

Commit Message

Comments

Patch