mbox series

[RFC,0/4] rasdaemon: Add support for the CXL error events

Message ID 20230119171809.1406-1-shiju.jose@huawei.com (mailing list archive)
Headers show
Series rasdaemon: Add support for the CXL error events | expand

Message

Shiju Jose Jan. 19, 2023, 5:18 p.m. UTC
From: Shiju Jose <shiju.jose@huawei.com>

Log and record the following CXL errors reported through the kernel
trace events. CXL poison errors, CXL AER uncorrectable errors and CXL AER
correctable errors.

Note: The default poll method in the rasdaemon to receive the trace events
      didn't work in the QEMU. Thus instead used the pthread way for
	  testing the CXL error events.
	  To do so, in the ras-events.c, make following change
	  <change start ...>	  
	  /* rc = read_ras_event_all_cpus(data, cpus); */
      rc = -255;
	  < ...change end >
	  /* Poll doesn't work on this kernel. Fallback to pthread way */
	  if (rc == -255) {
      ...
	 
Shiju Jose (4):
  rasdaemon: Move definition for BIT and BIT_ULL to a common file
  rasdaemon: Add support for the CXL poison events
  rasdaemon: Add support for the CXL AER uncorrectable errors
  rasdaemon: Add support for the CXL AER correctable errors

 Makefile.am                |   8 +-
 configure.ac               |  11 ++
 ras-cxl-handler.c          | 351 +++++++++++++++++++++++++++++++++++++
 ras-cxl-handler.h          |  32 ++++
 ras-events.c               |  33 ++++
 ras-events.h               |   3 +
 ras-non-standard-handler.h |   3 -
 ras-record.c               | 203 +++++++++++++++++++++
 ras-record.h               |  49 ++++++
 ras-report.c               | 219 +++++++++++++++++++++++
 ras-report.h               |   6 +
 11 files changed, 914 insertions(+), 4 deletions(-)
 create mode 100644 ras-cxl-handler.c
 create mode 100644 ras-cxl-handler.h