Message ID | 20200107130950.2983-3-Tianyu.Lan@microsoft.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | x86/Hyper-V: Add Dynamic memory hot-remove function | expand |
On Tue 07-01-20 21:09:42, lantianyu1986@gmail.com wrote: > From: Tianyu Lan <Tianyu.Lan@microsoft.com> > > Hyper-V balloon driver will use is_mem_section_removable() to > check whether memory block is removable or not when receive > memory hot remove msg. Expose it. I do not think this is a good idea. The check is inherently racy. Why cannot the balloon driver simply hotremove the region when asked?
On 07.01.20 14:36, Michal Hocko wrote: > On Tue 07-01-20 21:09:42, lantianyu1986@gmail.com wrote: >> From: Tianyu Lan <Tianyu.Lan@microsoft.com> >> >> Hyper-V balloon driver will use is_mem_section_removable() to >> check whether memory block is removable or not when receive >> memory hot remove msg. Expose it. > > I do not think this is a good idea. The check is inherently racy. Why > cannot the balloon driver simply hotremove the region when asked? > It's not only racy, it also gives no guarantees. False postives and false negatives are possible. If you want to avoid having to loop forever trying to offline when calling offline_and_remove_memory(), you could try to alloc_contig_range() the memory first and then play the PG_offline+notifier game like virtio-mem. I don't remember clearly, but I think pinned pages can make offlining loop for a long time. And I remember there were other scenarios as well (including out of memory conditions and similar). I sent an RFC [1] for powerpc/memtrace that does the same (just error handling is more complicated as it wants to offline and remove multiple consecutive memory blocks) - if you want to try to go down that path. [1] https://lkml.kernel.org/r/20191217123851.8854-1-david@redhat.com
> From: David Hildenbrand <david@redhat.com> > Sent: Friday, January 10, 2020 9:42 PM > To: Michal Hocko <mhocko@kernel.org>; lantianyu1986@gmail.com > Cc: KY Srinivasan <kys@microsoft.com>; Haiyang Zhang > <haiyangz@microsoft.com>; Stephen Hemminger <sthemmin@microsoft.com>; > sashal@kernel.org; akpm@linux-foundation.org; Michael Kelley > <mikelley@microsoft.com>; Tianyu Lan <Tianyu.Lan@microsoft.com>; linux- > hyperv@vger.kernel.org; linux-kernel@vger.kernel.org; linux-mm@kvack.org; > vkuznets <vkuznets@redhat.com>; eric.devolder@oracle.com; vbabka@suse.cz; > osalvador@suse.de; Pasha Tatashin <Pavel.Tatashin@microsoft.com>; > rppt@linux.ibm.com > Subject: [EXTERNAL] Re: [RFC PATCH V2 2/10] mm: expose > is_mem_section_removable() symbol > > On 07.01.20 14:36, Michal Hocko wrote: > > On Tue 07-01-20 21:09:42, lantianyu1986@gmail.com wrote: > >> From: Tianyu Lan <Tianyu.Lan@microsoft.com> > >> > >> Hyper-V balloon driver will use is_mem_section_removable() to check > >> whether memory block is removable or not when receive memory hot > >> remove msg. Expose it. > > > > I do not think this is a good idea. The check is inherently racy. Why > > cannot the balloon driver simply hotremove the region when asked? > > > > It's not only racy, it also gives no guarantees. False postives and false negatives > are possible. > > If you want to avoid having to loop forever trying to offline when calling > offline_and_remove_memory(), you could try to > alloc_contig_range() the memory first and then play the PG_offline+notifier > game like virtio-mem. > > I don't remember clearly, but I think pinned pages can make offlining loop for a > long time. And I remember there were other scenarios as well (including out of > memory conditions and similar). > > I sent an RFC [1] for powerpc/memtrace that does the same (just error > handling is more complicated as it wants to offline and remove multiple > consecutive memory blocks) - if you want to try to go down that path. > Hi David & Michal: Thanks for your review. Some memory blocks are not suitable for hot-plug. If not check memory block's removable, offline_pages() will report some failure error e.g, "failed due to memory holes" and "failure to isolate range". I think the check maybe added into offline_and_remove_memory()? This may help to not create/expose a new interface to do such check in module.
On 13.01.20 15:49, Tianyu Lan wrote: >> From: David Hildenbrand <david@redhat.com> >> Sent: Friday, January 10, 2020 9:42 PM >> To: Michal Hocko <mhocko@kernel.org>; lantianyu1986@gmail.com >> Cc: KY Srinivasan <kys@microsoft.com>; Haiyang Zhang >> <haiyangz@microsoft.com>; Stephen Hemminger <sthemmin@microsoft.com>; >> sashal@kernel.org; akpm@linux-foundation.org; Michael Kelley >> <mikelley@microsoft.com>; Tianyu Lan <Tianyu.Lan@microsoft.com>; linux- >> hyperv@vger.kernel.org; linux-kernel@vger.kernel.org; linux-mm@kvack.org; >> vkuznets <vkuznets@redhat.com>; eric.devolder@oracle.com; vbabka@suse.cz; >> osalvador@suse.de; Pasha Tatashin <Pavel.Tatashin@microsoft.com>; >> rppt@linux.ibm.com >> Subject: [EXTERNAL] Re: [RFC PATCH V2 2/10] mm: expose >> is_mem_section_removable() symbol >> >> On 07.01.20 14:36, Michal Hocko wrote: >>> On Tue 07-01-20 21:09:42, lantianyu1986@gmail.com wrote: >>>> From: Tianyu Lan <Tianyu.Lan@microsoft.com> >>>> >>>> Hyper-V balloon driver will use is_mem_section_removable() to check >>>> whether memory block is removable or not when receive memory hot >>>> remove msg. Expose it. >>> >>> I do not think this is a good idea. The check is inherently racy. Why >>> cannot the balloon driver simply hotremove the region when asked? >>> >> >> It's not only racy, it also gives no guarantees. False postives and false negatives >> are possible. >> >> If you want to avoid having to loop forever trying to offline when calling >> offline_and_remove_memory(), you could try to >> alloc_contig_range() the memory first and then play the PG_offline+notifier >> game like virtio-mem. >> >> I don't remember clearly, but I think pinned pages can make offlining loop for a >> long time. And I remember there were other scenarios as well (including out of >> memory conditions and similar). >> >> I sent an RFC [1] for powerpc/memtrace that does the same (just error >> handling is more complicated as it wants to offline and remove multiple >> consecutive memory blocks) - if you want to try to go down that path. >> > Hi David & Michal: > Thanks for your review. Some memory blocks are not suitable for hot-plug. > If not check memory block's removable, offline_pages() will report some failure error > e.g, "failed due to memory holes" and "failure to isolate range". I think the check maybe > added into offline_and_remove_memory()? This may help to not create/expose a new > interface to do such check in module. So it's all about the logging output. Duplicating these checks feels very wrong. And you will still get plenty of page dumps (read below), so that won't help. We have pr_debug() for these "failure ..." message. that should therefore not be an issue on production systems, right? However, you will see dump_page()s quite often, which logs via pr_warn(). Of course, we could add a mechanism to temporarily disable logging output for these call paths, but it might actually be helpful for debugging. We might just want to convert everything that is not actually a warning to pr_debug() - especially in dump_page().
On Mon 13-01-20 14:49:38, Tianyu Lan wrote: > Hi David & Michal: > Thanks for your review. Some memory blocks are not suitable for hot-plug. > If not check memory block's removable, offline_pages() will report some failure error > e.g, "failed due to memory holes" and "failure to isolate range". I think the check maybe > added into offline_and_remove_memory()? This may help to not create/expose a new > interface to do such check in module. Why is a log message a problem in the first place. The operation has failed afterall. Does the driver try to offline an arbitrary memory? Could you describe your usecase in more details please?
> From: Michal Hocko <mhocko@kernel.org> > Sent: Tuesday, January 14, 2020 5:51 PM > To: Tianyu Lan <Tianyu.Lan@microsoft.com> > Cc: David Hildenbrand <david@redhat.com>; lantianyu1986@gmail.com; KY > Srinivasan <kys@microsoft.com>; Haiyang Zhang <haiyangz@microsoft.com>; > Stephen Hemminger <sthemmin@microsoft.com>; sashal@kernel.org; > akpm@linux-foundation.org; Michael Kelley <mikelley@microsoft.com>; linux- > hyperv@vger.kernel.org; linux-kernel@vger.kernel.org; linux-mm@kvack.org; > vkuznets <vkuznets@redhat.com>; eric.devolder@oracle.com; vbabka@suse.cz; > osalvador@suse.de; Pasha Tatashin <Pavel.Tatashin@microsoft.com>; > rppt@linux.ibm.com > Subject: Re: [EXTERNAL] Re: [RFC PATCH V2 2/10] mm: expose > is_mem_section_removable() symbol > > On Mon 13-01-20 14:49:38, Tianyu Lan wrote: > > Hi David & Michal: > > Thanks for your review. Some memory blocks are not suitable for hot- > plug. > > If not check memory block's removable, offline_pages() will report > > some failure error e.g, "failed due to memory holes" and "failure to > > isolate range". I think the check maybe added into > > offline_and_remove_memory()? This may help to not create/expose a new > interface to do such check in module. > > Why is a log message a problem in the first place. The operation has failed > afterall. Does the driver try to offline an arbitrary memory? Yes. > Could you describe your usecase in more details please? Hyper-V sends hot-remove request message which just contains requested page number but not provide detail range. So Hyper-V driver needs to search suitable memory block in system memory to return back to host if there is no memory hot-add before. So I used the is_mem_section_removable() do such check.
On Fri 17-01-20 16:35:03, Tianyu Lan wrote: [...] > > Could you describe your usecase in more details please? > > Hyper-V sends hot-remove request message which just contains requested > page number but not provide detail range. So Hyper-V driver needs to search > suitable memory block in system memory to return back to host if there is no > memory hot-add before. So I used the is_mem_section_removable() do such check. As David described, you would be much better of by using alloc_contig_range to find a memory that would be suitable for hotremoving without any races.
diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c index d04369e6d3cc..a4ebfc5c48b3 100644 --- a/mm/memory_hotplug.c +++ b/mm/memory_hotplug.c @@ -1179,6 +1179,7 @@ bool is_mem_section_removable(unsigned long start_pfn, unsigned long nr_pages) /* All pageblocks in the memory block are likely to be hot-removable */ return true; } +EXPORT_SYMBOL_GPL(is_mem_section_removable); /* * Confirm all pages in a range [start, end) belong to the same zone.