[net-next] net/smc: introduce shadow sockets for fallback connections

SMC-R performs not so well on fallback situations right now,
especially on short link server fallback occasions. We are planning
to make SMC-R widely used and handling this fallback performance
issue is really crucial to us. Here we introduce a shadow socket
method to try to relief this problem.

Basicly, we use two more accept queues to hold incoming connections,
one for fallback connections and the other for smc-r connections.
We implement this method by using two more 'shadow' sockets and
make the connection path of fallback connections almost the same as
normal tcp connections.

Now the SMC-R accept path is like:
  1. incoming connection
  2. schedule work to smc sock alloc, tcp accept and push to smc
     acceptq
  3. wake up user to accept

When fallback happens on servers, the accepting path is the same
which costs more than normal tcp accept path. In fallback
situations, the step 2 above is not necessary and the smc sock is
also not needed. So we use two more shadow sockets when one smc
socket start listening. When new connection comes, we pop the req
to the fallback socket acceptq or the non-fallback socket acceptq
according to its syn_smc flag. As a result, when fallback happen we
can graft the user socket with a normal tcp sock instead of a smc
sock and get rid of the cost generated by step 2 and smc sock
releasing.

               +-----> non-fallback socket acceptq
               |
incoming req --+
               |
               +-----> fallback socket acceptq

With the help of shadow socket, we gain similar performance as tcp
connections on short link nginx server fallback occasions as what
is illustrated below.

Cases are like "./wrk http://x.x.x.x:x/
	-H 'Connection: Close' -c 1600 -t 32 -d 20 --latency"

TCP:
    Requests/sec: 145438.65
    Transfer/sec:     21.64MB

Server fallback occasions on original SMC-R:
    Requests/sec: 114192.82
    Transfer/sec:     16.99MB

Server fallback occasions on SMC-R with shadow sockets:
    Requests/sec: 143528.11
    Transfer/sec:     21.35MB

On the other hand, as a result of using another accept queue, the
fastopenq lock is not the right lock to access when accepting. So
we need to find the right fastopenq lock in inet_csk_accept.

Signed-off-by: Kai Shen <KaiShen@linux.alibaba.com>
---
 net/ipv4/inet_connection_sock.c |  13 ++-
 net/smc/af_smc.c                | 143 ++++++++++++++++++++++++++++++--
 net/smc/smc.h                   |   2 +
 3 files changed, 150 insertions(+), 8 deletions(-)

Message ID	20230321071959.87786-1-KaiShen@linux.alibaba.com (mailing list archive)
State	Handled Elsewhere
Headers	show Return-Path: <linux-rdma-owner@vger.kernel.org> X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 594F1C6FD1D for <linux-rdma@archiver.kernel.org>; Tue, 21 Mar 2023 07:20:10 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229611AbjCUHUI (ORCPT <rfc822;linux-rdma@archiver.kernel.org>); Tue, 21 Mar 2023 03:20:08 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45628 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229923AbjCUHUH (ORCPT <rfc822;linux-rdma@vger.kernel.org>); Tue, 21 Mar 2023 03:20:07 -0400 Received: from out30-101.freemail.mail.aliyun.com (out30-101.freemail.mail.aliyun.com [115.124.30.101]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0788459EC; Tue, 21 Mar 2023 00:20:03 -0700 (PDT) X-Alimail-AntiSpam: AC=PASS;BC=-1\|-1;BR=01201311R721e4;CH=green;DM=\|\|false\|;DS=\|\|;FP=0\|-1\|-1\|-1\|0\|-1\|-1\|-1;HT=ay29a033018045176;MF=kaishen@linux.alibaba.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---0VeMASoW_1679383200; Received: from localhost(mailfrom:KaiShen@linux.alibaba.com fp:SMTPD_---0VeMASoW_1679383200) by smtp.aliyun-inc.com; Tue, 21 Mar 2023 15:20:01 +0800 From: Kai Shen <KaiShen@linux.alibaba.com> To: kgraul@linux.ibm.com, wenjia@linux.ibm.com, jaka@linux.ibm.com, kuba@kernel.org, davem@davemloft.net, dsahern@kernel.org Cc: netdev@vger.kernel.org, linux-s390@vger.kernel.org, linux-rdma@vger.kernel.org Subject: [PATCH net-next] net/smc: introduce shadow sockets for fallback connections Date: Tue, 21 Mar 2023 07:19:59 +0000 Message-Id: <20230321071959.87786-1-KaiShen@linux.alibaba.com> X-Mailer: git-send-email 2.31.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: <linux-rdma.vger.kernel.org> X-Mailing-List: linux-rdma@vger.kernel.org
Series	[net-next] net/smc: introduce shadow sockets for fallback connections \| expand [net-next] net/smc: introduce shadow sockets for fallback connections

[net-next] net/smc: introduce shadow sockets for fallback connections

Commit Message

Comments

Patch