mbox series

[V6,0/4] rasdaemon: Add support for the CXL error events

Message ID 20230331134404.1315-1-shiju.jose@huawei.com (mailing list archive)
Headers show
Series rasdaemon: Add support for the CXL error events | expand

Message

Shiju Jose March 31, 2023, 1:44 p.m. UTC
From: Shiju Jose <shiju.jose@huawei.com>

Log and record the following CXL error records reported through the kernel
trace events. CXL poison errors, CXL AER uncorrectable errors and CXL AER
correctable errors.

Shiju Jose (4):
  rasdaemon: Move definition for BIT and BIT_ULL to a common file
  rasdaemon: Add support for the CXL poison events
  rasdaemon: Add support for the CXL AER uncorrectable errors
  rasdaemon: Add support for the CXL AER correctable errors

Changes:
V5 -> V6
1. Following modification in the CXL poison due to
   changes in the kernel patches for the cxl_poison.
  - length -> dpa_length
  - add field trace_type
  
V4 -> V5
1. Add logging of device serial number for the CXL AER uncorrectable and
    CXL AER correctable errors.
2. Rebased.
3. Updated the patches description.	

V3 -> V4
1. Modification for the changes in the kernel patches
   a) https://lore.kernel.org/lkml/cover.1675983077.git.alison.schofield@intel.com/
   b) https://lore.kernel.org/linux-cxl/63e5ed38d77d9_138fbc2947a@iweiny-mobl.notmuch/T/#t

V2 -> V3
1. Fix for the comments from Dave Jiang.

RFC V1 -> V2
1. Rename uuid to region_uuid in the log and SQLite DB.
2. Rebase to the latest rasdaemon code.
3. Modify to match the name changes of interface structures and
   functions in the latest libtraceevent-dev, use in the rasdaemon. 

 Makefile.am                |   7 +-
 configure.ac               |  11 +
 ras-cxl-handler.c          | 426 +++++++++++++++++++++++++++++++++++++
 ras-cxl-handler.h          |  32 +++
 ras-events.c               |  33 +++
 ras-events.h               |   3 +
 ras-non-standard-handler.h |   3 -
 ras-record.c               | 215 +++++++++++++++++++
 ras-record.h               |  55 +++++
 ras-report.c               | 231 ++++++++++++++++++++
 ras-report.h               |   6 +
 11 files changed, 1018 insertions(+), 4 deletions(-)
 create mode 100644 ras-cxl-handler.c
 create mode 100644 ras-cxl-handler.h

Comments

Shiju Jose April 28, 2023, 5:59 p.m. UTC | #1
Hi All,

Pull request has submitted for this series.
https://github.com/mchehab/rasdaemon/pull/96

Thanks,
Shiju

-----Original Message-----
From: Shiju Jose <shiju.jose@huawei.com> 
Sent: 31 March 2023 14:44
To: mchehab@kernel.org; linux-cxl@vger.kernel.org; linux-edac@vger.kernel.org
Cc: Jonathan Cameron <jonathan.cameron@huawei.com>; Linuxarm <linuxarm@huawei.com>; Shiju Jose <shiju.jose@huawei.com>
Subject: [PATCH V6 0/4] rasdaemon: Add support for the CXL error events

From: Shiju Jose <shiju.jose@huawei.com>

Log and record the following CXL error records reported through the kernel trace events. CXL poison errors, CXL AER uncorrectable errors and CXL AER correctable errors.

Shiju Jose (4):
  rasdaemon: Move definition for BIT and BIT_ULL to a common file
  rasdaemon: Add support for the CXL poison events
  rasdaemon: Add support for the CXL AER uncorrectable errors
  rasdaemon: Add support for the CXL AER correctable errors

Changes:
V5 -> V6
1. Following modification in the CXL poison due to
   changes in the kernel patches for the cxl_poison.
  - length -> dpa_length
  - add field trace_type
  
V4 -> V5
1. Add logging of device serial number for the CXL AER uncorrectable and
    CXL AER correctable errors.
2. Rebased.
3. Updated the patches description.	

V3 -> V4
1. Modification for the changes in the kernel patches
   a) https://lore.kernel.org/lkml/cover.1675983077.git.alison.schofield@intel.com/
   b) https://lore.kernel.org/linux-cxl/63e5ed38d77d9_138fbc2947a@iweiny-mobl.notmuch/T/#t

V2 -> V3
1. Fix for the comments from Dave Jiang.

RFC V1 -> V2
1. Rename uuid to region_uuid in the log and SQLite DB.
2. Rebase to the latest rasdaemon code.
3. Modify to match the name changes of interface structures and
   functions in the latest libtraceevent-dev, use in the rasdaemon. 

 Makefile.am                |   7 +-
 configure.ac               |  11 +
 ras-cxl-handler.c          | 426 +++++++++++++++++++++++++++++++++++++
 ras-cxl-handler.h          |  32 +++
 ras-events.c               |  33 +++
 ras-events.h               |   3 +
 ras-non-standard-handler.h |   3 -
 ras-record.c               | 215 +++++++++++++++++++
 ras-record.h               |  55 +++++
 ras-report.c               | 231 ++++++++++++++++++++
 ras-report.h               |   6 +
 11 files changed, 1018 insertions(+), 4 deletions(-)  create mode 100644 ras-cxl-handler.c  create mode 100644 ras-cxl-handler.h

--
2.25.1