mbox series

[rdma-next,v3,0/7] Support RDMA events monitoring through

Message ID 20240909173025.30422-1-michaelgur@nvidia.com (mailing list archive)
Headers show
Series Support RDMA events monitoring through | expand

Message

Michael Guralnik Sept. 9, 2024, 5:30 p.m. UTC
This series consists of multiple parts that collectively offer a method
to monitor RDMA events from userspace.
Using netlink, users will be able to monitor their IB device events and
changes such as device register, device unregister and netdev
attachment.

The first 2 patches contain fixes in mlx5 lag code that are required for
accurate event reporting in case of a lag bond.

Patch #3 initializes phys_port_cnt early in device probe to allow the
IB-to-netdev mapping API to work properly.

Patches #4,#5 modify and export IB-to-netdev mapping API, making it accessible
to all vendors who wish to rely on it for associating their IB device with
a netdevice.

Patches #6,#7 add the netlink support for reporting IB device events to
userspace.

Changes in v3:
	- Fix lockdep warning in ib_device_get_netdev by dropping
	  optimization part from it
	- Extend event info to include device names
	- Instead of disregarding unknown events, report them to
	  userspace
	- Remove fill_mon_register and replace with fill_nldev_handle to
	  fill netlink messages for register events

Changes in v2:
	- Fix compilation issues with forward declaration of ib_device
	- Add missing setting of return code in error flow

Chiara Meiohas (5):
  RDMA/mlx5: Initialize phys_port_cnt earlier in RDMA device creation
  RDMA/device: Remove optimization in ib_device_get_netdev()
  RDMA/mlx5: Use IB set_netdev and get_netdev functions
  RDMA/nldev: Add support for RDMA monitoring
  RDMA/nldev: Expose whether RDMA monitoring is supported

Mark Bloch (2):
  RDMA/mlx5: Check RoCE LAG status before getting netdev
  RDMA/mlx5: Obtain upper net device only when needed

 drivers/infiniband/core/device.c              |  51 ++++-
 drivers/infiniband/core/netlink.c             |   1 +
 drivers/infiniband/core/nldev.c               | 130 ++++++++++++
 drivers/infiniband/hw/mlx5/ib_rep.c           |  22 +-
 drivers/infiniband/hw/mlx5/main.c             | 197 +++++++++++++-----
 drivers/infiniband/hw/mlx5/mlx5_ib.h          |   3 +-
 .../net/ethernet/mellanox/mlx5/core/lag/lag.c |  76 +++----
 include/linux/mlx5/device.h                   |   1 +
 include/linux/mlx5/driver.h                   |   2 +-
 include/rdma/ib_verbs.h                       |   2 +
 include/rdma/rdma_netlink.h                   |  12 ++
 include/uapi/rdma/rdma_netlink.h              |  16 ++
 12 files changed, 399 insertions(+), 114 deletions(-)

Comments

Leon Romanovsky Sept. 11, 2024, 1:30 p.m. UTC | #1
On Mon, 09 Sep 2024 20:30:18 +0300, Michael Guralnik wrote:
> This series consists of multiple parts that collectively offer a method
> to monitor RDMA events from userspace.
> Using netlink, users will be able to monitor their IB device events and
> changes such as device register, device unregister and netdev
> attachment.
> 
> The first 2 patches contain fixes in mlx5 lag code that are required for
> accurate event reporting in case of a lag bond.
> 
> [...]

Applied, thanks!

[1/7] RDMA/mlx5: Check RoCE LAG status before getting netdev
      https://git.kernel.org/rdma/rdma/c/e67266dc429670
[2/7] RDMA/mlx5: Obtain upper net device only when needed
      https://git.kernel.org/rdma/rdma/c/eb66fcc43fde8f
[3/7] RDMA/mlx5: Initialize phys_port_cnt earlier in RDMA device creation
      https://git.kernel.org/rdma/rdma/c/41068c95b0bf6e
[4/7] RDMA/device: Remove optimization in ib_device_get_netdev()
      https://git.kernel.org/rdma/rdma/c/95ae29d023a4ba
[5/7] RDMA/mlx5: Use IB set_netdev and get_netdev functions
      https://git.kernel.org/rdma/rdma/c/425b36d3b2cb81
[6/7] RDMA/nldev: Add support for RDMA monitoring
      https://git.kernel.org/rdma/rdma/c/9a13c8cffcf6e8
[7/7] RDMA/nldev: Expose whether RDMA monitoring is supported
      https://git.kernel.org/rdma/rdma/c/61eb1e03c16f38

Best regards,