mbox series

[rdma-next,0/8] Register infiniband class as net namespace aware class

Message ID 20190213172310.1681-1-leon@kernel.org (mailing list archive)
Headers show
Series Register infiniband class as net namespace aware class | expand

Message

Leon Romanovsky Feb. 13, 2019, 5:23 p.m. UTC
From: Leon Romanovsky <leonro@mellanox.com>

From Parav,

Currently 'infiniband' class is registered as net namespace agnostic
class due to which all rdma devices are visible in all net namespaces.
Due to which net namespace filter needs to be applied on per sysfs entry
such as GID or GID attribute for RoCE. This is fine as long as there is
one rdma device shared among multiple net namespaces.

However, when there are multiple rdma devices, it is desired to see only
one or more rdma devices per net namespace. With different link layer
types, there are various use case and mode exists. At minimum there are
two use cases.

(a) a shared rdma device among multiple net namespaces
(b) rdma device bound to a particular net namespace

In preparation to support backward compatiblity to existing use cases
and also to support future (rdma device bound to net namespace),

1. Prepare rdma infiniband class to be net namespace aware; So that when
rdma device is bound to a net namespace in future, it can be restricted
to a single net namespace. This requires a class to be net namespace
aware. By doing so, a standard kernel framework of sysfs can be utilized
to isolate devices in net namespaces.
This is similar to how net class is net namespace aware following
standard kernel architecture.

2. Replicate the sysfs tree in non init_net namespaces for backward
compatibility, so that existing applications continue to operate in
shared mode.

This functionality is achieved by ib_core implementing a compat
ib_core_device which replicates the device and sysfs entries in non
init_net namespaces. It is desired to not create a full ib_device,
therefore an internal ib_core_device object is created which represents
only needed device tree and sysfs entries.

A diagram, details and its objectives are captured in
Documentation/infiniband/core_devices.txt.

Thanks

BTW, It has same extra space between for_each iterator and bracket,
as was pointed by Bart.

Parav Pandit (8):
  RDMA/core: Use simpler device_del() instead of device_unregister()
  RDMA/core: Introduce and use ib_setup_port_attrs()
  RDMA/core: Introduce ib_core_device to hold device
  RDMA/core: Move device addition deletion to device.c
  RDMA/core: Restrict sysfs entries view to init_net
  RDMA/core: Implement compat device/sysfs tree in net namespace
  RDMA/core: Add Documentation for ib_core_device
  RDMA/core: Support core port attributes in non init_net

 Documentation/infiniband/core_devices.txt | 146 ++++++++++
 drivers/infiniband/core/core_priv.h       |   9 +
 drivers/infiniband/core/device.c          | 333 +++++++++++++++++++++-
 drivers/infiniband/core/sysfs.c           |  81 +++---
 include/rdma/ib_verbs.h                   |  31 +-
 5 files changed, 547 insertions(+), 53 deletions(-)
 create mode 100644 Documentation/infiniband/core_devices.txt

--
2.19.1