[V2] {NET,IB}/mlx4: 64 byte CQE/EQE support

file at ~ogerlitz/tmp-patches/0001-NET-IB-mlx4-64-byte-CQE-EQE-support.patch

Jack, I'd like you to review the part in this patch which
relates to SRIOV, I've tested it now, applied on Roland's for-next,
and it works OK on a system with a VF probed on the host and
doing ipoib ping. Both VF and PF noticed they should use 64B CQE/EQE

The interface to user space part needs some more shaping, I'll get
there once I know that SRIOV wise I'm OK.

Basically, we had some comments on something related from Ben H.
see http://marc.info/?t=133923325000003&r=1&w=2 please let me know
if I can get away without doing the change you and him discussed
and if not, I'd love if you can create a pre-patch that does it.

I'd like to try and push this for 3.7 -- so need your feedback 
as soon as you can.

Or.

----------------------------------

CX3 devices can work with 64 or 32 byte CQEs/EQEs. Using 64 byte
EQEs/CQEs allow better utilization of new chipsets and gaining higher
performance. This patch queries the HCA's capabilities and if it
supports BOTH 64 byte CQEs and EQES will configure the HW to work
in 64 byte mode. Note that the 32B vs 64B working mode is global,
per HCA and not per CQ or EQ.

Since this mode is global, userspace (libmlx4) must be updated to
work with the configured CQE size, and similarily under SRIOV, guests
that use ConnectX virtual functions need to know both EQE and CQE size.

The patch makes sure that older guest drivers who follows the
QUERY_DEV_FUNC command (e.g as done in mlx4_core of Linux 3.3/3.4)
will notice that they need an update to be able to work with the
PPF since the returned pf_context_behaviour will not be zero any more.

User space notification is done through a new field introduced
in struct mlx4_ib_ucontext which holds device capabilities for
which user space must take action. This changes the binary interface so
the ABI is bumped from 3 to 4 but only when **needed** e.g only when the
driver does use 64B CQEs or future device capabilities which must be
in sync by user space. This would allow to work with unmodified libmlx4
on older devices (e.g A0, B0) which don't support 64 byte cookies.

---

V0 pointer
	mlx4 http://marc.info/?l=linux-rdma&m=131805712306677&w=2
	libmlx4 http://marc.info/?l=linux-rdma&m=131805712306678&w=2

V1 changes from V0

 - unified the 64B CQE and EQE patches to one patch which takes an approach of 
   apply both or none, under the thinking that 99% FW will support both or none.

 - bump the ABI version towards user-space/libmlx4 only when needed and not always

 - added support for SRIOV, using the PF_CONTEXT_BEHAVIOUR_MASK mechanism of
   the query func capabilities command, modified "sizeof (struct mlx4_eqe)" 
   to be 32 or 64 in mlx4_multi_func_init() and slave_event(), just to make sure
   Jack/Liran is that the correct thing to do?

V2 changes

  rebased to kernel 3.6+ (e.g pre 3.7-rc1) that includes the SRIOV/IB patches

 drivers/infiniband/hw/mlx4/cq.c                |   33 ++++++++++++++++++-----
 drivers/infiniband/hw/mlx4/main.c              |   26 +++++++++++++++---
 drivers/infiniband/hw/mlx4/mlx4_ib.h           |    1 +
 drivers/infiniband/hw/mlx4/user.h              |   15 ++++++++++-
 drivers/net/ethernet/mellanox/mlx4/cmd.c       |    2 +-
 drivers/net/ethernet/mellanox/mlx4/en_cq.c     |    2 +-
 drivers/net/ethernet/mellanox/mlx4/en_netdev.c |    1 +
 drivers/net/ethernet/mellanox/mlx4/en_rx.c     |    5 ++-
 drivers/net/ethernet/mellanox/mlx4/en_tx.c     |    5 ++-
 drivers/net/ethernet/mellanox/mlx4/eq.c        |   25 +++++++++++------
 drivers/net/ethernet/mellanox/mlx4/fw.c        |   11 +++++++-
 drivers/net/ethernet/mellanox/mlx4/main.c      |   20 +++++++++++++-
 drivers/net/ethernet/mellanox/mlx4/mlx4_en.h   |    1 +
 include/linux/mlx4/device.h                    |   16 +++++++++++
 14 files changed, 133 insertions(+), 30 deletions(-)

Message ID	1349355955-12831-1-git-send-email-ogerlitz@mellanox.com (mailing list archive)
State	Superseded
Delegated to:	Roland Dreier
Headers	show Return-Path: <linux-rdma-owner@vger.kernel.org> X-Original-To: patchwork-linux-rdma@patchwork.kernel.org Delivered-To: patchwork-process-083081@patchwork1.kernel.org Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by patchwork1.kernel.org (Postfix) with ESMTP id 57EFF40D7F for <patchwork-linux-rdma@patchwork.kernel.org>; Thu, 4 Oct 2012 13:06:06 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756068Ab2JDNGE (ORCPT <rfc822;patchwork-linux-rdma@patchwork.kernel.org>); Thu, 4 Oct 2012 09:06:04 -0400 Received: from eu1sys200aog104.obsmtp.com ([207.126.144.117]:37995 "HELO eu1sys200aog104.obsmtp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1753918Ab2JDNGB (ORCPT <rfc822; linux-rdma@vger.kernel.org>); Thu, 4 Oct 2012 09:06:01 -0400 Received: from mtlsws123.lab.mtl.com ([82.166.227.17]) (using TLSv1) by eu1sys200aob104.postini.com ([207.126.147.11]) with SMTP ID DSNKUG2Jtr13XBj6WpklO5VuBCOymsVu8vjs@postini.com; Thu, 04 Oct 2012 13:06:00 UTC Received: from r-vnc04.lab.mtl.com (r-vnc04.lab.mtl.com [10.208.0.116]) by mtlsws123.lab.mtl.com (8.13.8/8.13.8) with ESMTP id q94D5vPM007492; Thu, 4 Oct 2012 15:05:58 +0200 From: Or Gerlitz <ogerlitz@mellanox.com> To: linux-rdma@vger.kernel.org, jackm@mellanox.com Cc: Or Gerlitz <ogerlitz@mellanox.com> Subject: [PATCH V2] {NET,IB}/mlx4: 64 byte CQE/EQE support Date: Thu, 4 Oct 2012 15:05:55 +0200 Message-Id: <1349355955-12831-1-git-send-email-ogerlitz@mellanox.com> X-Mailer: git-send-email 1.7.8.2 Sender: linux-rdma-owner@vger.kernel.org Precedence: bulk List-ID: <linux-rdma.vger.kernel.org> X-Mailing-List: linux-rdma@vger.kernel.org

[V2] {NET,IB}/mlx4: 64 byte CQE/EQE support

Commit Message

Comments

Patch