Message ID | 20220927055337.22630-1-lizhijian@fujitsu.com (mailing list archive) |
---|---|
Headers | show |
Series | RDMA/rxe: Add RDMA FLUSH operation | expand |
On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: > Hey folks, > > Firstly i want to say thank you to all you guys, especially Bob, who in the > past 1+ month, gave me a lots of idea and inspiration. > > With the your help, some changes are make in 5th version, such as: > - new names and new patch split schemem, suggested by Bob > - bugfix: set is_pmem true only if the whole MR is pmem. it's possible the > one MR container both PMEM and DRAM. > - introduce feth structure, instead of u32 > - new bugfix to rxe_lookup_mw() and lookup_mr(), see (RDMA/rxe: make sure requested access is a subset of {mr,mw}->access), > with this fix, we remove check_placement_type(), lookup_mr() has done the such check. > - Enable QP attr flushable > These change logs also appear in the patch it belongs to. > > These patches are going to implement a *NEW* RDMA opcode "RDMA FLUSH". > In IB SPEC 1.5[1], 2 new opcodes, ATOMIC WRITE and RDMA FLUSH were > added in the MEMORY PLACEMENT EXTENSIONS section. This doesn't apply anymore, I did try to fix it, but it ended up not compiling, so it is better if you handle it and repost. Thanks, Jason
On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: > Hey folks, > > Firstly i want to say thank you to all you guys, especially Bob, who in the > past 1+ month, gave me a lots of idea and inspiration. I would like it if someone familiar with rxe could reviewed-by the protocol parts. Jason
在 2022/10/29 1:57, Jason Gunthorpe 写道: > On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: >> Hey folks, >> >> Firstly i want to say thank you to all you guys, especially Bob, who in the >> past 1+ month, gave me a lots of idea and inspiration. > > I would like it if someone familiar with rxe could reviewed-by the > protocol parts. Hi, Jason I reviewed these patches. I am fine with these patches. Hi, Zhijian I noticed the followings: " $ ./rdma_flush_server -s [server_address] -p [port_number] client: $ ./rdma_flush_client -s [server_address] -p [port_number] " Can you merge the server and the client to rdma-core? Thanks, Zhu Yanjun > > Jason
On 11/11/2022 10:49, Yanjun Zhu wrote: > 在 2022/10/29 1:57, Jason Gunthorpe 写道: >> On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: >>> Hey folks, >>> >>> Firstly i want to say thank you to all you guys, especially Bob, who >>> in the >>> past 1+ month, gave me a lots of idea and inspiration. >> >> I would like it if someone familiar with rxe could reviewed-by the >> protocol parts. > > Hi, Jason > > I reviewed these patches. I am fine with these patches. > > Hi, Zhijian > > I noticed the followings: > " > $ ./rdma_flush_server -s [server_address] -p [port_number] > client: > $ ./rdma_flush_client -s [server_address] -p [port_number] > " > Can you merge the server and the client to rdma-core? Yanjun, Yes, there was already a draft PR here https://github.com/linux-rdma/rdma-core/pull/1181, but it cannot go ahead until the kernel's patches are merged. and i will post a new version these days, would you mind if i add your "Reviewed-by" in next version ? > > Thanks, > Zhu Yanjun > >> >> Jason >
在 2022/11/11 13:10, lizhijian@fujitsu.com 写道: > > > On 11/11/2022 10:49, Yanjun Zhu wrote: >> 在 2022/10/29 1:57, Jason Gunthorpe 写道: >>> On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: >>>> Hey folks, >>>> >>>> Firstly i want to say thank you to all you guys, especially Bob, who >>>> in the >>>> past 1+ month, gave me a lots of idea and inspiration. >>> >>> I would like it if someone familiar with rxe could reviewed-by the >>> protocol parts. >> >> Hi, Jason >> >> I reviewed these patches. I am fine with these patches. >> >> Hi, Zhijian >> >> I noticed the followings: >> " >> $ ./rdma_flush_server -s [server_address] -p [port_number] >> client: >> $ ./rdma_flush_client -s [server_address] -p [port_number] >> " >> Can you merge the server and the client to rdma-core? > > Yanjun, > > Yes, there was already a draft PR here > https://github.com/linux-rdma/rdma-core/pull/1181, but it cannot go > ahead until the kernel's patches are merged. > > and i will post a new version these days, would you mind if i add your > "Reviewed-by" in next version ? Reviewed-by: Zhu Yanjun <yanjun.zhu@linux.dev> Thanks. Another problem, normally rxe should connect to physical ib devices, such as mlx ib device. That is, one host is rxe, the other host is mlx ib device. The rdma connection should be created between the 2 hosts. Do you connect to mlx ib device with this RDMA FLUSH operation? And what is the test result? Thanks a lot. Zhu Yanjun > > > >> >> Thanks, >> Zhu Yanjun >> >>> >>> Jason
On 11/11/2022 13:52, Yanjun Zhu wrote: > 在 2022/11/11 13:10, lizhijian@fujitsu.com 写道: >> >> >> On 11/11/2022 10:49, Yanjun Zhu wrote: >>> 在 2022/10/29 1:57, Jason Gunthorpe 写道: >>>> On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: >>>>> Hey folks, >>>>> >>>>> Firstly i want to say thank you to all you guys, especially Bob, who >>>>> in the >>>>> past 1+ month, gave me a lots of idea and inspiration. >>>> >>>> I would like it if someone familiar with rxe could reviewed-by the >>>> protocol parts. >>> >>> Hi, Jason >>> >>> I reviewed these patches. I am fine with these patches. >>> >>> Hi, Zhijian >>> >>> I noticed the followings: >>> " >>> $ ./rdma_flush_server -s [server_address] -p [port_number] >>> client: >>> $ ./rdma_flush_client -s [server_address] -p [port_number] >>> " >>> Can you merge the server and the client to rdma-core? >> >> Yanjun, >> >> Yes, there was already a draft PR here >> https://github.com/linux-rdma/rdma-core/pull/1181, but it cannot go >> ahead until the kernel's patches are merged. >> >> and i will post a new version these days, would you mind if i add your >> "Reviewed-by" in next version ? > > Reviewed-by: Zhu Yanjun <yanjun.zhu@linux.dev> > Thanks. > > Another problem, normally rxe should connect to physical ib devices, > such as mlx ib device. That is, one host is rxe, the other host is mlx > ib device. The rdma connection should be created between the 2 hosts. it's fully compatible with old operation. > > Do you connect to mlx ib device with this RDMA FLUSH operation? > And what is the test result? Yes, i tested it. After these patches, only RXE device can register *FLUSHABLE* MRs successfully. If mlx try that, EOPNOSUPP will be returned. Similarly, Since other hardwares(MLX for example) have not supported FLUSH operation, EOPNOSUPP will be returned if users try to to that. In short, for RXE requester, MLX responder will return error for the request. MLX requester is not able to request a FLUSH operation. Thanks Zhijian > > Thanks a lot. > Zhu Yanjun > >> >> >> >>> >>> Thanks, >>> Zhu Yanjun >>> >>>> >>>> Jason >
在 2022/11/11 14:10, lizhijian@fujitsu.com 写道: > > On 11/11/2022 13:52, Yanjun Zhu wrote: >> 在 2022/11/11 13:10, lizhijian@fujitsu.com 写道: >>> >>> On 11/11/2022 10:49, Yanjun Zhu wrote: >>>> 在 2022/10/29 1:57, Jason Gunthorpe 写道: >>>>> On Tue, Sep 27, 2022 at 01:53:26PM +0800, Li Zhijian wrote: >>>>>> Hey folks, >>>>>> >>>>>> Firstly i want to say thank you to all you guys, especially Bob, who >>>>>> in the >>>>>> past 1+ month, gave me a lots of idea and inspiration. >>>>> I would like it if someone familiar with rxe could reviewed-by the >>>>> protocol parts. >>>> Hi, Jason >>>> >>>> I reviewed these patches. I am fine with these patches. >>>> >>>> Hi, Zhijian >>>> >>>> I noticed the followings: >>>> " >>>> $ ./rdma_flush_server -s [server_address] -p [port_number] >>>> client: >>>> $ ./rdma_flush_client -s [server_address] -p [port_number] >>>> " >>>> Can you merge the server and the client to rdma-core? >>> Yanjun, >>> >>> Yes, there was already a draft PR here >>> https://github.com/linux-rdma/rdma-core/pull/1181, but it cannot go >>> ahead until the kernel's patches are merged. >>> >>> and i will post a new version these days, would you mind if i add your >>> "Reviewed-by" in next version ? >> Reviewed-by: Zhu Yanjun <yanjun.zhu@linux.dev> >> Thanks. >> >> Another problem, normally rxe should connect to physical ib devices, >> such as mlx ib device. That is, one host is rxe, the other host is mlx >> ib device. The rdma connection should be created between the 2 hosts. > it's fully compatible with old operation. > > >> Do you connect to mlx ib device with this RDMA FLUSH operation? >> And what is the test result? > Yes, i tested it. > > After these patches, only RXE device can register *FLUSHABLE* MRs > successfully. If mlx try that, EOPNOSUPP will be returned. > > Similarly, Since other hardwares(MLX for example) have not supported > FLUSH operation, EOPNOSUPP will be returned if users try to to that. > > In short, for RXE requester, MLX responder will return error for the > request. MLX requester is not able to request a FLUSH operation. Thanks. Do you mean that FLUSH operation is only supported in RXE? ^_^ And MLX does not support FLUSH operation currently? Zhu Yanjun > > Thanks > Zhijian > > >> Thanks a lot. >> Zhu Yanjun >> >>> >>> >>>> Thanks, >>>> Zhu Yanjun >>>> >>>>> Jason
On 11/11/2022 14:30, Yanjun Zhu wrote: >> >> After these patches, only RXE device can register *FLUSHABLE* MRs >> successfully. If mlx try that, EOPNOSUPP will be returned. >> >> Similarly, Since other hardwares(MLX for example) have not supported >> FLUSH operation, EOPNOSUPP will be returned if users try to to that. >> >> In short, for RXE requester, MLX responder will return error for the >> request. MLX requester is not able to request a FLUSH operation. > > Thanks. Do you mean that FLUSH operation is only supported in RXE? ^_^ > > And MLX does not support FLUSH operation currently? IMO, FLUSH and Atomic Write are newly introduced by IBA spec 1.5 published in 2021. So hardware/drivers(MLX) should do something to support it.
在 2022/11/11 14:38, lizhijian@fujitsu.com 写道: > > On 11/11/2022 14:30, Yanjun Zhu wrote: >>> After these patches, only RXE device can register *FLUSHABLE* MRs >>> successfully. If mlx try that, EOPNOSUPP will be returned. >>> >>> Similarly, Since other hardwares(MLX for example) have not supported >>> FLUSH operation, EOPNOSUPP will be returned if users try to to that. >>> >>> In short, for RXE requester, MLX responder will return error for the >>> request. MLX requester is not able to request a FLUSH operation. >> Thanks. Do you mean that FLUSH operation is only supported in RXE? ^_^ >> >> And MLX does not support FLUSH operation currently? > IMO, FLUSH and Atomic Write are newly introduced by IBA spec 1.5 > published in 2021. So hardware/drivers(MLX) should do something to > support it. Thanks. If I got you correctly, FLUSH and Atomic Write is a new feature. And from the test result, it is not supported by MLX driver currently. Wait for MLX Engineer for updates about FLUSH and Atomic Write. IMO, it had better make rxe successfully connect to one physical ib device with FLUSH and Atomic Write, such as MLX or others. Zhu Yanjun