mbox series

[ndctl,0/2] papr: Implement initial support for injecting smart errors

Message ID 20210712173132.1205192-1-vaibhav@linux.ibm.com (mailing list archive)
Headers show
Series papr: Implement initial support for injecting smart errors | expand


Vaibhav Jain July 12, 2021, 5:31 p.m. UTC
The patch series implements limited support for injecting smart errors for PAPR
NVDIMMs via ndctl-inject-smart(1) command. SMART errors are emulating in
papr_scm module as presently PAPR doesn't support injecting smart errors on an
NVDIMM. Currently support for injecting 'fatal' health state and 'dirty'
shutdown state is implemented. With the proposed ndctl patched and with
corresponding kernel patch [1] following command flow is expected:

$ sudo ndctl list -DH -d nmem0
 # inject unsafe shutdown and fatal health error
$ sudo ndctl inject-smart nmem0 -Uf
 # uninject all errors
$ sudo ndctl inject-smart nmem0 -N

Structure of the patch series

* First patch updates 'inject-smart' code to not always assume support for
  injecting all smart-errors. It also updates 'intel.c' to explicitly indicate
  the type of smart-inject errors supported.

* Update 'papr.c' to add support for injecting smart 'fatal' health and
  'dirty-shutdown' errors.

Vaibhav Jain (2):
  libndctl, intel: Indicate supported smart-inject types
  libndctl/papr: Add limited support for inject-smart

 ndctl/inject-smart.c  | 33 ++++++++++++++++++-----
 ndctl/lib/intel.c     |  7 ++++-
 ndctl/lib/papr.c      | 61 +++++++++++++++++++++++++++++++++++++++++++
 ndctl/lib/papr_pdsm.h | 17 ++++++++++++
 ndctl/libndctl.h      |  8 ++++++
 5 files changed, 118 insertions(+), 8 deletions(-)