[v4,0/7] Poisoned memory recovery on reboot

Message ID	20241214134555.440097-1-william.roche@oracle.com (mailing list archive)
Headers	show Return-Path: <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org> From: =?utf-8?q?=E2=80=9CWilliam_Roche?= <william.roche@oracle.com> To: david@redhat.com, kvm@vger.kernel.org, qemu-devel@nongnu.org, qemu-arm@nongnu.org Cc: william.roche@oracle.com, peterx@redhat.com, pbonzini@redhat.com, richard.henderson@linaro.org, philmd@linaro.org, peter.maydell@linaro.org, mtosatti@redhat.com, imammedo@redhat.com, eduardo@habkost.net, marcel.apfelbaum@gmail.com, wangyanan55@huawei.com, zhao1.liu@intel.com, joao.m.martins@oracle.com Subject: [PATCH v4 0/7] Poisoned memory recovery on reboot Date: Sat, 14 Dec 2024 13:45:48 +0000 Message-ID: <20241214134555.440097-1-william.roche@oracle.com> In-Reply-To: <cf587c8b-3894-4589-bfea-be5db70e81f3@redhat.com> References: <cf587c8b-3894-4589-bfea-be5db70e81f3@redhat.com> Content-Transfer-Encoding: 8bit Content-Type: text/plain MIME-Version: 1.0 Received-SPF: pass client-ip=205.220.177.32; envelope-from=william.roche@oracle.com; helo=mx0b-00069f02.pphosted.com X-Spam_score_int: -27 X-Spam_score: -2.8 X-Spam_bar: -- X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_VALIDITY_CERTIFIED_BLOCKED=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Series	Poisoned memory recovery on reboot \| expand [v4,0/7] Poisoned memory recovery on reboot [v4,1/7] hwpoison_page_list and qemu_ram_remap are based on pages [v4,2/7] system/physmem: poisoned memory discard on reboot [v4,3/7] accel/kvm: Report the loss of a large memory page [v4,4/7] numa: Introduce and use ram_block_notify_remap() [v4,5/7] hostmem: Factor out applying settings [v4,6/7] hostmem: Handle remapping of RAM [v4,7/7] system/physmem: Memory settings applied on remap notification

Message ID

20241214134555.440097-1-william.roche@oracle.com (mailing list archive)

Headers

From: =?utf-8?q?=E2=80=9CWilliam_Roche?= <william.roche@oracle.com>
To: david@redhat.com, kvm@vger.kernel.org, qemu-devel@nongnu.org,
 qemu-arm@nongnu.org
Cc: william.roche@oracle.com, peterx@redhat.com, pbonzini@redhat.com,
 richard.henderson@linaro.org, philmd@linaro.org,
 peter.maydell@linaro.org, mtosatti@redhat.com, imammedo@redhat.com,
 eduardo@habkost.net, marcel.apfelbaum@gmail.com,
 wangyanan55@huawei.com, zhao1.liu@intel.com, joao.m.martins@oracle.com
Subject: [PATCH v4 0/7] Poisoned memory recovery on reboot
Date: Sat, 14 Dec 2024 13:45:48 +0000
Message-ID: <20241214134555.440097-1-william.roche@oracle.com>
In-Reply-To: <cf587c8b-3894-4589-bfea-be5db70e81f3@redhat.com>
References: <cf587c8b-3894-4589-bfea-be5db70e81f3@redhat.com>
Content-Transfer-Encoding: 8bit
Content-Type: text/plain
MIME-Version: 1.0
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: 
 i1yIo3iSX/TXAmhV4jYKswoPxRubZBfNtNw/AbTsCI+WSzgv06ovsCKKd4QJ2nhlWyq83d+THht/qeeNPfxboLJd9Lnd0J7v/51/IUyUL+noXbWHfXnMQW+SRFPn2WYOLGHxScD3BoiU+DpUKi5vvee6SXaXh0P26t29GyDKxP9FnnGcndVpvwkFGDvT+6AwSdjw1kz0KvjI4h+PF+QPLQU1t3sVp2ZCQvPV3xvwo9g1VJSVPyqbIYBlnjoVQK6VH1/kxqv7Bb6iKEtsxGnnXqCX0lrJyIcspJVxE7GY3ZHt3sNqGzwJc2pJ0WfK/knILeM8rVXkPao+fR6PbCvakL2zOw24nFQs+EbZ3Zmc0gonh7d71VLjW4GgVfu6Bje5Mu4ZGPzvZKNoMS5gAYco80Z4TF5YdiSFiotnMvrzhiVkcOUW0j0dZ8pb0i1rLgb3excZw2kGKL+V/z0RTXVsMp0Rumu9gMbXWa1eWrCbpUkH7mifmqcVknqkE85b40eGFDx60wjZo/MuJBFllEjiLAWSrewQeb54zkM+d4h3TtVL7XEV8B4gR++jSPbJkT3Iq4XdPTWxvPVKIwdLtI5FoAVLKEN8ngt8BbjKH/QW3S1ts7da1BBF3owk3/Gx11y+Z8cdaV3Usjng1LGhfTbd6J7Dwo2Cq7V40Lizs9hS5sbOyTMXFFzjDbXirQ61Ydu51rmDIdZHhIXl2jKqnurFrWxgtBLRzdodJU2f/EcjxFnK3R/1YtOMSF4b+22xWB/q0YC3XPUpP5srORqsnJzviSVr3t0tmb+KrfbzLh2NzfneqxrPyLtD9qOfJW63CHqwtf+yRHQpxBBqFeEFTReN2SOxW6hx1zZcTlNP/c2xj6elGFSZ30cdenU2wUnn8gs5tIWB5w8og0W4mt7TGBK4FFYRv29zSRxIFQQntrYoCc74XjpAUrM1I70VRIvFx3uwu/GtqVDGWh+3oTwUXQ4Kgzoni4b0c4RXGbnFgbhJnK+YpabbujVmPHqgP+rxQfMUcMxYM1yBDIxLHGe1AMyG/mBt6SyEY+HD9yJ+BaxcoTLHwWWStIzZRJCWEDy0R0TuDt2NqHHPDbzRv/ZjyL0cKMA22RmFXonT1pgUUo4MXLlwmkRPsLfF7e+cfszSumuFQWNxH2FsZtp2BQ4TBm780pakgKAZeN5ZftVp/O5FBkbGwWEYtUx0JKzsGrZ2NQk4w7i+npZBm2XNgh/ef28leX+nOi5UKwgm4VKf634ckwsGSWWvl0Jml2kBRiDFQCggBdokQ+e4A7Bow+cVpB2RB0xsVbGWsaKszibAMuSxR1hWHUVzTSHfy2zj8Urv3nmoSu0w+se2RXq6E0R7hvU7zyT0M4oObQx9tfkcq4Bvv1l4OcjtXWxAasFaPqZ+6fyzxSD/DUynNx5XD8we2fO4V3cn2zgS2snenO+U8psV5EVkqfkjFRuEgVGXvFNeLbomFUKQh5dauVqRPmILFDyReO1VLKmnVFgMqMrTAhi2R1YKyujy9p31869hXbbwVw9jbPdzjIL7Ueh1XcSq7DUOfCJQZJ2tT+bjT/NZFo2wI2ewN2vodFNnt2VZToM5NyqhW5CH53nHBZQ+9Fd6NpC87Q==
X-MS-Exchange-AntiSpam-ExternalHop-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-ExternalHop-MessageData-0: 
 FYv9Gbjs5fOGzC3bWGoCUnYTJq0TXf0cIpbDrLO9iw32nRAsEBsOC4+TZI8O06c87Ns2C0hByhbtwwI+tF0/4Oe3s/RTVWT7H1TzNaZ6ZBHm1APfBGbr7I/nvKhDiLhOiH2PdPGYa0klSao7KdCXVr78wLlN5gdTnj+8zdNWJmL1n9RuJFU7TXjyESPiW9l4mF6OPWPYsVbD6TE6ORDM6yn/O4cObguN04LgBLEk1/QXMTNfuk4X2s6hshFkXWlVniOSiVQFZ4uLkg8hBCbQNTZ8/2/8EiIp+HLEB5TwQn1HRay/X8r4gpupzBpCukx/PQLtrRm/O2o7+xDd8PzvgoIq5uUjnvGdWryeGM7QwvQx0SNX8vZ2340up/YIJz/kUh5CU1qDjVlSGRaiRxCLlTZQh+3J1rLzAZs/YYhIxOYAb/ERPtJ+YZx8lp3Xqtvey29KAh31wTdJDdOvYKoXODvd+owRlSWCAspidcQwK7J7u0ISU1cDnMjONCB91MzgGpTjhbwg6v98ZRdCvQF7oOY5BsCzVO/O/B4Jb5Gl1hasJtRoIAkXIjQbEfYK8hq3ksCagl30uSRlmiwdzX6ke54G25vfLK4v3jKzBfpU8ZY=
X-OriginatorOrg: oracle.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 e1303c40-2f75-4fd4-59ac-08dd1c45a326
X-MS-Exchange-CrossTenant-AuthSource: CH3PR10MB7329.namprd10.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 14 Dec 2024 13:45:57.3954 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: 4e2c6054-71cb-48f1-bd6c-3a9705aca71b
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 V6c2Ef7p/kvp79ux16fs8rrcz5DXIRdd+BBBssvH8MI3VgjJ4oF8Fet7t+5X2pGAxCeGE4SpqERNRdzS85tesjV+UQxSHfsF49G6FZfMPD4=
X-MS-Exchange-Transport-CrossTenantHeadersStamped: SN4PR10MB5624
X-Proofpoint-Virus-Version: vendor=baseguard
 engine=ICAP:2.0.293,Aquarius:18.0.1057,Hydra:6.0.680,FMLib:17.12.68.34
 definitions=2024-12-14_05,2024-12-13_01,2024-11-22_01
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 phishscore=0
 spamscore=0 suspectscore=0
 adultscore=0 bulkscore=0 mlxscore=0 mlxlogscore=999 malwarescore=0
 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2411120000
 definitions=main-2412140112
X-Proofpoint-GUID: xZFIhNOhDP9HyK3JinEdLUdYCu9j8L4p
X-Proofpoint-ORIG-GUID: xZFIhNOhDP9HyK3JinEdLUdYCu9j8L4p
Received-SPF: pass client-ip=205.220.177.32;
 envelope-from=william.roche@oracle.com; helo=mx0b-00069f02.pphosted.com
X-Spam_score_int: -27
X-Spam_score: -2.8
X-Spam_bar: --
X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001,
 RCVD_IN_VALIDITY_CERTIFIED_BLOCKED=0.001,
 RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001,
 SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org

Series

Poisoned memory recovery on reboot | expand

Message

“William Roche Dec. 14, 2024, 1:45 p.m. UTC

From: William Roche <willia.roche@oracle.com>

Hello David,

Here is an new version of our code and an updated description of the
patch set:

 ---
This set of patches fixes several problems with hardware memory errors
impacting hugetlbfs memory backed VMs and the generic memory recovery
on VM reset.
When using hugetlbfs large pages, any large page location being impacted
by an HW memory error results in poisoning the entire page, suddenly
making a large chunk of the VM memory unusable.

The main problem that currently exists in Qemu is the lack of backend
file repair before resetting the VM memory, resulting in the impacted
memory to be silently unusable even after a VM reboot.

In order to fix this issue, we take into account the page size of the
impacted memory block when dealing with the associated poisoned page
location.

Using the page size information we also try to regenerate the memory
calling ram_block_discard_range() on VM reset when running
qemu_ram_remap(). So that a poisoned memory backed by a hugetlbfs
file is regenerated with a hole punched in this file. A new page is
loaded when the location is first touched.

In case of a discard failure we fall back to remapping the memory
location. We also have to reset the memory settings and honor the
'prealloc' attribute.

This memory setting is performed by a new remap notification mechanism
calling host_memory_backend_ram_remapped() function when a region of
a memory block is remapped.

We also enrich the messages used to report a memory error relayed to
the VM, providing an identification of memory page and its size in
case of a large page impacted.
 ----
 
 About patch 3/7, I still think that generating an additional message
 in the kvm_hwpoison_page_add() function creates a cleaner code without
 the need to repeat it for x86 and ARM. The message would be displayed
 before all the injection messages issued because of the large page
 failure. But we could go with this version if you prefer the existing
 message to be enriched.
 
 About patch 7/7, I could merge it with your patch 6/7 if you agree.
 
 
v3 -> v4:
. Fixed some commit messages typos
. Enhanced some code comments
. Changed the discard fall back conditions to consider only anonymous
  memory
. Fixed missing some variable name changes in intermediary patches.
. Modify the error message given when an error is injected to report
  the case of a large page
. use snprintf() to generate this message
. Adding this same type of message in the ARM case too


This code is scripts/checkpatch.pl clean
'make check' runs fine on both x86 and Arm.


David Hildenbrand (3):
  numa: Introduce and use ram_block_notify_remap()
  hostmem: Factor out applying settings
  hostmem: Handle remapping of RAM

William Roche (4):
  hwpoison_page_list and qemu_ram_remap are based on pages
  system/physmem: poisoned memory discard on reboot
  accel/kvm: Report the loss of a large memory page
  system/physmem: Memory settings applied on remap notification

 accel/kvm/kvm-all.c       |   2 +-
 backends/hostmem.c        | 189 +++++++++++++++++++++++---------------
 hw/core/numa.c            |  11 +++
 include/exec/cpu-common.h |   3 +-
 include/exec/ramlist.h    |   3 +
 include/sysemu/hostmem.h  |   1 +
 system/physmem.c          |  88 +++++++++++++-----
 target/arm/kvm.c          |  13 +++
 target/i386/kvm/kvm.c     |  18 +++-
 9 files changed, 225 insertions(+), 103 deletions(-)