mbox series

[0/7] EDAC/i10nm: Add RRL support for Intel Granite Rapids server

Message ID 20250417150724.1170168-1-qiuxu.zhuo@intel.com (mailing list archive)
Headers show
Series EDAC/i10nm: Add RRL support for Intel Granite Rapids server | expand

Message

Zhuo, Qiuxu April 17, 2025, 3:07 p.m. UTC
The RRL (Retry Read error Log) registers provide additional error information
during DDR memory correction. This can assist memory DIMM vendors in isolating
problematic cells within the DIMM and help CSPs predict uncorrectable errors.
This patch series is to add RRL support for Intel Granite Rapids servers and
is structured as follows:

Patch 1-3: Bug fixes and cleanup.

Patch 4-6: Refactor the RRL (Retry Read error Log) code of previous generations
           and prepare for Patch 7.

Patch 7  : Add RRL support for Intel Granite Rapids server.

Qiuxu Zhuo (7):
  EDAC/skx_common: Fix general protection fault
  EDAC/{skx_common,i10nm}: Fix the loss of saved RRL for HBM pseudo channel 0
  EDAC/i10nm: Explicitly set the modes of the RRL register sets
  EDAC/{skx_common,i10nm}: Structure the per-channel RRL registers
  EDAC/{skx_common,i10nm}: Refactor enable_retry_rd_err_log()
  EDAC/{skx_common,i10nm}: Refactor show_retry_rd_err_log()
  EDAC/{skx_common,i10nm}: Add RRL support for Intel Granite Rapids server

 drivers/edac/i10nm_base.c | 477 ++++++++++++++++++++++----------------
 drivers/edac/skx_common.c |   1 +
 drivers/edac/skx_common.h |  61 ++++-
 3 files changed, 330 insertions(+), 209 deletions(-)


base-commit: 8ffd015db85fea3e15a77027fda6c02ced4d2444

Comments

Luck, Tony April 18, 2025, 5:24 p.m. UTC | #1
> The RRL (Retry Read error Log) registers provide additional error information
> during DDR memory correction. This can assist memory DIMM vendors in isolating
> problematic cells within the DIMM and help CSPs predict uncorrectable errors.
> This patch series is to add RRL support for Intel Granite Rapids servers and
> is structured as follows:
>
> Patch 1-3: Bug fixes and cleanup.
>
> Patch 4-6: Refactor the RRL (Retry Read error Log) code of previous generations
>            and prepare for Patch 7.
>
> Patch 7  : Add RRL support for Intel Granite Rapids server.

Applied to edac-drivers branch of RAS tree for next merge window.

Thanks

-Tony