Message ID | 20250220052027.58847-1-byungchul@sk.com (mailing list archive) |
---|---|
Headers | show |
Series | LUF(Lazy Unmap Flush) reducing tlb numbers over 90% | expand |
On Thu, 20 Feb 2025 14:20:01 +0900 Byungchul Park <byungchul@sk.com> > To check luf's stability, I ran a heavy LLM inference workload consuming > 210GiB over 7 days on a machine with 140GiB memory, and decided it's > stable enough. > > I'm posting the latest version so that anyone can try luf mechanism if > wanted by any chance. However, I tagged RFC again because there are > still issues that should be resolved to merge to mainline: > > 1. Even though system wide total cpu time for TLB shootdown is > reduced over 95%, page allocation paths should take additional cpu > time shifted from page reclaim to perform TLB shootdown. > > 2. We need luf debug feature to detect when luf goes wrong by any > chance. I implemented just a draft version that checks the sanity > on mkwrite(), kmap(), and so on. I need to gather better ideas > to improve the debug feature. > > --- > > Hi everyone, > > While I'm working with a tiered memory system e.g. CXL memory, I have > been facing migration overhead esp. tlb shootdown on promotion or > demotion between different tiers. Yeah.. most tlb shootdowns on > migration through hinting fault can be avoided thanks to Huang Ying's > work, commit 4d4b6d66db ("mm,unmap: avoid flushing tlb in batch if PTE > is inaccessible"). > > However, it's only for migration through hinting fault. I thought it'd > be much better if we have a general mechanism to reduce all the tlb > numbers that we can apply to any unmap code, that we normally believe > tlb flush should be followed. > > I'm suggesting a new mechanism, LUF(Lazy Unmap Flush), that defers tlb > flush until folios that have been unmapped and freed, eventually get > allocated again. It's safe for folios that had been mapped read-only > and were unmapped, as long as the contents of the folios don't change > while staying in pcp or buddy so we can still read the data through the > stale tlb entries. > Given pcp or buddy, you are opening window for use after free which makes no sense in 99% cases.
On Thu, Feb 20, 2025 at 06:32:22PM +0800, Hillf Danton wrote: > On Thu, 20 Feb 2025 14:20:01 +0900 Byungchul Park <byungchul@sk.com> > > To check luf's stability, I ran a heavy LLM inference workload consuming > > 210GiB over 7 days on a machine with 140GiB memory, and decided it's > > stable enough. > > > > I'm posting the latest version so that anyone can try luf mechanism if > > wanted by any chance. However, I tagged RFC again because there are > > still issues that should be resolved to merge to mainline: > > > > 1. Even though system wide total cpu time for TLB shootdown is > > reduced over 95%, page allocation paths should take additional cpu > > time shifted from page reclaim to perform TLB shootdown. > > > > 2. We need luf debug feature to detect when luf goes wrong by any > > chance. I implemented just a draft version that checks the sanity > > on mkwrite(), kmap(), and so on. I need to gather better ideas > > to improve the debug feature. > > > > --- > > > > Hi everyone, > > > > While I'm working with a tiered memory system e.g. CXL memory, I have > > been facing migration overhead esp. tlb shootdown on promotion or > > demotion between different tiers. Yeah.. most tlb shootdowns on > > migration through hinting fault can be avoided thanks to Huang Ying's > > work, commit 4d4b6d66db ("mm,unmap: avoid flushing tlb in batch if PTE > > is inaccessible"). > > > > However, it's only for migration through hinting fault. I thought it'd > > be much better if we have a general mechanism to reduce all the tlb > > numbers that we can apply to any unmap code, that we normally believe > > tlb flush should be followed. > > > > I'm suggesting a new mechanism, LUF(Lazy Unmap Flush), that defers tlb > > flush until folios that have been unmapped and freed, eventually get > > allocated again. It's safe for folios that had been mapped read-only > > and were unmapped, as long as the contents of the folios don't change > > while staying in pcp or buddy so we can still read the data through the > > stale tlb entries. > > > Given pcp or buddy, you are opening window for use after free which makes > no sense in 99% cases. It's kinda 'use(= read only) after free' but luf ensures the data of the interesting pages doesn't change. That's what luf works on. Byungchul
On Thu, Feb 20, 2025 at 06:32:22PM +0800, Hillf Danton wrote: > On Thu, 20 Feb 2025 14:20:01 +0900 Byungchul Park <byungchul@sk.com> > > To check luf's stability, I ran a heavy LLM inference workload consuming > > 210GiB over 7 days on a machine with 140GiB memory, and decided it's > > stable enough. > > > > I'm posting the latest version so that anyone can try luf mechanism if > > wanted by any chance. However, I tagged RFC again because there are > > still issues that should be resolved to merge to mainline: > > > > 1. Even though system wide total cpu time for TLB shootdown is > > reduced over 95%, page allocation paths should take additional cpu > > time shifted from page reclaim to perform TLB shootdown. > > > > 2. We need luf debug feature to detect when luf goes wrong by any > > chance. I implemented just a draft version that checks the sanity > > on mkwrite(), kmap(), and so on. I need to gather better ideas > > to improve the debug feature. > > > > --- > > > > Hi everyone, > > > > While I'm working with a tiered memory system e.g. CXL memory, I have > > been facing migration overhead esp. tlb shootdown on promotion or > > demotion between different tiers. Yeah.. most tlb shootdowns on > > migration through hinting fault can be avoided thanks to Huang Ying's > > work, commit 4d4b6d66db ("mm,unmap: avoid flushing tlb in batch if PTE > > is inaccessible"). > > > > However, it's only for migration through hinting fault. I thought it'd > > be much better if we have a general mechanism to reduce all the tlb > > numbers that we can apply to any unmap code, that we normally believe > > tlb flush should be followed. > > > > I'm suggesting a new mechanism, LUF(Lazy Unmap Flush), that defers tlb > > flush until folios that have been unmapped and freed, eventually get > > allocated again. It's safe for folios that had been mapped read-only > > and were unmapped, as long as the contents of the folios don't change > > while staying in pcp or buddy so we can still read the data through the > > stale tlb entries. > > > Given pcp or buddy, you are opening window for use after free which makes > no sense in 99% cases. Just in case that I don't understand what you meant and for better understanding, can you provide a simple and problematic example from the u-a-f? Byungchul
On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > On Thu, Feb 20, 2025 at 06:32:22PM +0800, Hillf Danton wrote: > > On Thu, 20 Feb 2025 14:20:01 +0900 Byungchul Park <byungchul@sk.com> > > > To check luf's stability, I ran a heavy LLM inference workload consuming > > > 210GiB over 7 days on a machine with 140GiB memory, and decided it's > > > stable enough. > > > > > > I'm posting the latest version so that anyone can try luf mechanism if > > > wanted by any chance. However, I tagged RFC again because there are > > > still issues that should be resolved to merge to mainline: > > > > > > 1. Even though system wide total cpu time for TLB shootdown is > > > reduced over 95%, page allocation paths should take additional cpu > > > time shifted from page reclaim to perform TLB shootdown. > > > > > > 2. We need luf debug feature to detect when luf goes wrong by any > > > chance. I implemented just a draft version that checks the sanity > > > on mkwrite(), kmap(), and so on. I need to gather better ideas > > > to improve the debug feature. > > > > > > --- > > > > > > Hi everyone, > > > > > > While I'm working with a tiered memory system e.g. CXL memory, I have > > > been facing migration overhead esp. tlb shootdown on promotion or > > > demotion between different tiers. Yeah.. most tlb shootdowns on > > > migration through hinting fault can be avoided thanks to Huang Ying's > > > work, commit 4d4b6d66db ("mm,unmap: avoid flushing tlb in batch if PTE > > > is inaccessible"). > > > > > > However, it's only for migration through hinting fault. I thought it'd > > > be much better if we have a general mechanism to reduce all the tlb > > > numbers that we can apply to any unmap code, that we normally believe > > > tlb flush should be followed. > > > > > > I'm suggesting a new mechanism, LUF(Lazy Unmap Flush), that defers tlb > > > flush until folios that have been unmapped and freed, eventually get > > > allocated again. It's safe for folios that had been mapped read-only > > > and were unmapped, as long as the contents of the folios don't change > > > while staying in pcp or buddy so we can still read the data through the > > > stale tlb entries. > > > > > Given pcp or buddy, you are opening window for use after free which makes > > no sense in 99% cases. > > Just in case that I don't understand what you meant and for better > understanding, can you provide a simple and problematic example from > the u-a-f? > Tell us if it is illegal to commit rape without pregnancy in your home town? PS defering flushing tlb [1,2] is no go. Subject: Re: [PATCH v4 29/30] x86/mm, mm/vmalloc: Defer flush_tlb_kernel_range() targeting NOHZ_FULL CPUs [1] https://lore.kernel.org/lkml/20250127155146.GB25757@willie-the-truck/ [2] https://lore.kernel.org/lkml/xhsmhwmdwihte.mognet@vschneid-thinkpadt14sgen2i.remote.csb/
On Thu, Feb 20, 2025 at 07:49:19PM +0800, Hillf Danton wrote: > On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > > On Thu, Feb 20, 2025 at 06:32:22PM +0800, Hillf Danton wrote: > > > On Thu, 20 Feb 2025 14:20:01 +0900 Byungchul Park <byungchul@sk.com> > > > > To check luf's stability, I ran a heavy LLM inference workload consuming > > > > 210GiB over 7 days on a machine with 140GiB memory, and decided it's > > > > stable enough. > > > > > > > > I'm posting the latest version so that anyone can try luf mechanism if > > > > wanted by any chance. However, I tagged RFC again because there are > > > > still issues that should be resolved to merge to mainline: > > > > > > > > 1. Even though system wide total cpu time for TLB shootdown is > > > > reduced over 95%, page allocation paths should take additional cpu > > > > time shifted from page reclaim to perform TLB shootdown. > > > > > > > > 2. We need luf debug feature to detect when luf goes wrong by any > > > > chance. I implemented just a draft version that checks the sanity > > > > on mkwrite(), kmap(), and so on. I need to gather better ideas > > > > to improve the debug feature. > > > > > > > > --- > > > > > > > > Hi everyone, > > > > > > > > While I'm working with a tiered memory system e.g. CXL memory, I have > > > > been facing migration overhead esp. tlb shootdown on promotion or > > > > demotion between different tiers. Yeah.. most tlb shootdowns on > > > > migration through hinting fault can be avoided thanks to Huang Ying's > > > > work, commit 4d4b6d66db ("mm,unmap: avoid flushing tlb in batch if PTE > > > > is inaccessible"). > > > > > > > > However, it's only for migration through hinting fault. I thought it'd > > > > be much better if we have a general mechanism to reduce all the tlb > > > > numbers that we can apply to any unmap code, that we normally believe > > > > tlb flush should be followed. > > > > > > > > I'm suggesting a new mechanism, LUF(Lazy Unmap Flush), that defers tlb > > > > flush until folios that have been unmapped and freed, eventually get > > > > allocated again. It's safe for folios that had been mapped read-only > > > > and were unmapped, as long as the contents of the folios don't change > > > > while staying in pcp or buddy so we can still read the data through the > > > > stale tlb entries. > > > > > > > Given pcp or buddy, you are opening window for use after free which makes > > > no sense in 99% cases. > > > > Just in case that I don't understand what you meant and for better > > understanding, can you provide a simple and problematic example from > > the u-a-f? > > > Tell us if it is illegal to commit rape without pregnancy in your home town? Memory overcommit also looked cheating to someone like you. You definitely think it'd be totally non-sense that each task believes it can use its own full virtual space. We say uaf is illegal only when it can cause access the free area without *appropriate permission*. > PS defering flushing tlb [1,2] is no go. I will check this shortly. Byungchul > > Subject: Re: [PATCH v4 29/30] x86/mm, mm/vmalloc: Defer flush_tlb_kernel_range() targeting NOHZ_FULL CPUs > [1] https://lore.kernel.org/lkml/20250127155146.GB25757@willie-the-truck/ > [2] https://lore.kernel.org/lkml/xhsmhwmdwihte.mognet@vschneid-thinkpadt14sgen2i.remote.csb/
On Thu, Feb 20, 2025 at 07:49:19PM +0800, Hillf Danton wrote: > On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > > On Thu, Feb 20, 2025 at 06:32:22PM +0800, Hillf Danton wrote: > > > On Thu, 20 Feb 2025 14:20:01 +0900 Byungchul Park <byungchul@sk.com> > > > > To check luf's stability, I ran a heavy LLM inference workload consuming > > > > 210GiB over 7 days on a machine with 140GiB memory, and decided it's > > > > stable enough. > > > > > > > > I'm posting the latest version so that anyone can try luf mechanism if > > > > wanted by any chance. However, I tagged RFC again because there are > > > > still issues that should be resolved to merge to mainline: > > > > > > > > 1. Even though system wide total cpu time for TLB shootdown is > > > > reduced over 95%, page allocation paths should take additional cpu > > > > time shifted from page reclaim to perform TLB shootdown. > > > > > > > > 2. We need luf debug feature to detect when luf goes wrong by any > > > > chance. I implemented just a draft version that checks the sanity > > > > on mkwrite(), kmap(), and so on. I need to gather better ideas > > > > to improve the debug feature. > > > > > > > > --- > > > > > > > > Hi everyone, > > > > > > > > While I'm working with a tiered memory system e.g. CXL memory, I have > > > > been facing migration overhead esp. tlb shootdown on promotion or > > > > demotion between different tiers. Yeah.. most tlb shootdowns on > > > > migration through hinting fault can be avoided thanks to Huang Ying's > > > > work, commit 4d4b6d66db ("mm,unmap: avoid flushing tlb in batch if PTE > > > > is inaccessible"). > > > > > > > > However, it's only for migration through hinting fault. I thought it'd > > > > be much better if we have a general mechanism to reduce all the tlb > > > > numbers that we can apply to any unmap code, that we normally believe > > > > tlb flush should be followed. > > > > > > > > I'm suggesting a new mechanism, LUF(Lazy Unmap Flush), that defers tlb > > > > flush until folios that have been unmapped and freed, eventually get > > > > allocated again. It's safe for folios that had been mapped read-only > > > > and were unmapped, as long as the contents of the folios don't change > > > > while staying in pcp or buddy so we can still read the data through the > > > > stale tlb entries. > > > > > > > Given pcp or buddy, you are opening window for use after free which makes > > > no sense in 99% cases. > > > > Just in case that I don't understand what you meant and for better > > understanding, can you provide a simple and problematic example from > > the u-a-f? > > > Tell us if it is illegal to commit rape without pregnancy in your home town? +to Torvalds Logical blame is welcome but I don't want to see potty-mouthed busters like him in Linux community any more. Please, ban him for better community. Byungchul > PS defering flushing tlb [1,2] is no go. > > Subject: Re: [PATCH v4 29/30] x86/mm, mm/vmalloc: Defer flush_tlb_kernel_range() targeting NOHZ_FULL CPUs > [1] https://lore.kernel.org/lkml/20250127155146.GB25757@willie-the-truck/ > [2] https://lore.kernel.org/lkml/xhsmhwmdwihte.mognet@vschneid-thinkpadt14sgen2i.remote.csb/
On Thu, Feb 20, 2025 at 07:49:19PM +0800, Hillf Danton wrote: > On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > > Just in case that I don't understand what you meant and for better > > understanding, can you provide a simple and problematic example from > > the u-a-f? > > > Tell us if it is illegal to commit rape without pregnancy in your home town? Hillf, this is unacceptable language. You need to apologise.
On Thu, Feb 20, 2025 at 01:54:13PM +0000, Matthew Wilcox wrote: > On Thu, Feb 20, 2025 at 07:49:19PM +0800, Hillf Danton wrote: > > On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > > > Just in case that I don't understand what you meant and for better > > > understanding, can you provide a simple and problematic example from > > > the u-a-f? > > > > > Tell us if it is illegal to commit rape without pregnancy in your home town? > > Hillf, this is unacceptable language. You need to apologise. Agreed. WTF Hillf? Where did that come from? Is this how you talk to your co-workers? I'll tell you what would happen in my home town. If someone said that to a co-worker, they would likely be terminated. -- Steve
On 2/19/25 21:20, Byungchul Park wrote: > I'm posting the latest version so that anyone can try luf mechanism if > wanted by any chance. However, I tagged RFC again because there are > still issues that should be resolved to merge to mainline: I don't see anything fundamentally different here from the last 11 versions. I think the entire approach is dangerous and basically makes things impossible to debug. It's not clear that some of the failure scenarios that I've brought up in the past have actually been fixed. What I've said here still stands: > https://lore.kernel.org/all/fab1dd64-c652-4160-93b4-7b483a8874da@intel.com/ > I think tglx would call all of this "tinkering". The approach to this > series is to "fix" narrow, specific cases that reviewers point out, make > it compile, then send it out again, hoping someone will apply it. > > So, for me, until the approach to this series changes: NAK, for x86. > Andrew, please don't take this series. Or, if you do, please drop the > patch enabling it on x86. I think I'd also like to stop being cc'd on this. If LUF is merged into mainline and proven to work on arm64 or riscv for a year, I'd be happy to take another look at enabling it on x86. I think that's just about the only thing that would make me reconsider.
On 2/20/25 16:15, Dave Hansen wrote: > On 2/19/25 21:20, Byungchul Park wrote: >> I'm posting the latest version so that anyone can try luf mechanism if >> wanted by any chance. However, I tagged RFC again because there are >> still issues that should be resolved to merge to mainline: > > I don't see anything fundamentally different here from the last 11 > versions. I think the entire approach is dangerous and basically makes > things impossible to debug. It's not clear that some of the failure > scenarios that I've brought up in the past have actually been fixed. Yes, and it's still an invasive change to the buddy allocator. IIRC at Plumbers the opinion in the audience was that there might be ways to improve the batching on unmap to reduce the flushes without such an invasive and potentially dangerous change? Has that been investigated? Also "Rebase on akpm/mm.git mm-unstable(5a7056135b) as of Nov 22, 2024." is very outdated at this point? Thanks, Vlastimil > What I've said here still stands: > >> https://lore.kernel.org/all/fab1dd64-c652-4160-93b4-7b483a8874da@intel.com/ > >> I think tglx would call all of this "tinkering". The approach to this >> series is to "fix" narrow, specific cases that reviewers point out, make >> it compile, then send it out again, hoping someone will apply it. >> >> So, for me, until the approach to this series changes: NAK, for x86. >> Andrew, please don't take this series. Or, if you do, please drop the >> patch enabling it on x86. > > I think I'd also like to stop being cc'd on this. If LUF is merged into > mainline and proven to work on arm64 or riscv for a year, I'd be happy > to take another look at enabling it on x86. I think that's just about > the only thing that would make me reconsider. >
On Thu, Feb 20, 2025 at 10:09:45AM -0500, Steven Rostedt wrote: > On Thu, Feb 20, 2025 at 01:54:13PM +0000, Matthew Wilcox wrote: > > On Thu, Feb 20, 2025 at 07:49:19PM +0800, Hillf Danton wrote: > > > On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > > > > Just in case that I don't understand what you meant and for better > > > > understanding, can you provide a simple and problematic example from > > > > the u-a-f? > > > > > > > Tell us if it is illegal to commit rape without pregnancy in your home town? > > > > Hillf, this is unacceptable language. You need to apologise. > > Agreed. WTF Hillf? Where did that come from? Is this how you talk to your > co-workers? > > I'll tell you what would happen in my home town. If someone said > that to a co-worker, they would likely be terminated. I can't agree with the "this is a firing offence" approach. We're a community, no one is employed by anyone else here; we work together because we have to and we have to figure out how to get along. We work via consensus, not appeals to authority. However - language like that can _and has_ driven away valuable and long standing members of our community, so people will feel strongly about this. Hillf, I'm going to share a story off list.
On Thu, 20 Feb 2025 17:53:41 -0500 Kent Overstreet <kent.overstreet@linux.dev> wrote: > > I'll tell you what would happen in my home town. If someone said > > that to a co-worker, they would likely be terminated. > > I can't agree with the "this is a firing offence" approach. My point was, if this was in a company, it could very well be a firing offense. > > We're a community, no one is employed by anyone else here; we work > together because we have to and we have to figure out how to get along. > We work via consensus, not appeals to authority. As a community, yes, things are different. But we should not have to tolerate such language. -- Steve
On Thu, Feb 20, 2025 at 06:05:39PM -0500, Steven Rostedt wrote: > On Thu, 20 Feb 2025 17:53:41 -0500 > Kent Overstreet <kent.overstreet@linux.dev> wrote: > > > > I'll tell you what would happen in my home town. If someone said > > > that to a co-worker, they would likely be terminated. > > > > I can't agree with the "this is a firing offence" approach. > > My point was, if this was in a company, it could very well be a firing offense. Well, to a white color worker, yes. But to someone working a blue collor safety critical industry, that's going to come across as rather tame. (And I do get annoyed when people get overly focused on language and forget that _we_ are a safety critical industry. To a first approximation, all the critical infrastructure throughout the world runs on Linux, stuff that doesn't is a rounding error, and all the testing and validation that exists only provides a safety factor. We have to have our shit together, and that does need to come first). That aside - my point isn't about what should and shouldn't be allowed, it's just that norms are arbitrary and it's not the best argument if you want someone to change their behavior. > > We're a community, no one is employed by anyone else here; we work > > together because we have to and we have to figure out how to get along. > > We work via consensus, not appeals to authority. > > As a community, yes, things are different. But we should not have to > tolerate such language. Agreed. And I think we're all aware at this point at how that sort of thing does drive people away, so best not take it so far people start to consider you a liability - or one way or another there's going to be an "or else". This place functions by making people feel respected and valued for the work they do, so a degree of respect and consideration is required.
On Thu, Feb 20, 2025 at 07:15:44AM -0800, Dave Hansen wrote: > On 2/19/25 21:20, Byungchul Park wrote: > > I'm posting the latest version so that anyone can try luf mechanism if > > wanted by any chance. However, I tagged RFC again because there are > > still issues that should be resolved to merge to mainline: > > I don't see anything fundamentally different here from the last 11 > versions. I think the entire approach is dangerous and basically makes > things impossible to debug. It's not clear that some of the failure > scenarios that I've brought up in the past have actually been fixed. Respect your opinion. > What I've said here still stands: > > > https://lore.kernel.org/all/fab1dd64-c652-4160-93b4-7b483a8874da@intel.com/ > > > I think tglx would call all of this "tinkering". The approach to this > > series is to "fix" narrow, specific cases that reviewers point out, make > > it compile, then send it out again, hoping someone will apply it. > > > > So, for me, until the approach to this series changes: NAK, for x86. > > Andrew, please don't take this series. Or, if you do, please drop the > > patch enabling it on x86. > > I think I'd also like to stop being cc'd on this. If LUF is merged into I will un-cc you from the next spin. Byungchul > mainline and proven to work on arm64 or riscv for a year, I'd be happy > to take another look at enabling it on x86. I think that's just about > the only thing that would make me reconsider.
On Thu, 20 Feb 2025 10:09:45 -0500 Steven Rostedt <rostedt@goodmis.org> > On Thu, Feb 20, 2025 at 01:54:13PM +0000, Matthew Wilcox wrote: > > On Thu, Feb 20, 2025 at 07:49:19PM +0800, Hillf Danton wrote: > > > On Thu, 20 Feb 2025 20:09:35 +0900 Byungchul Park wrote: > > > > Just in case that I don't understand what you meant and for better > > > > understanding, can you provide a simple and problematic example from > > > > the u-a-f? > > > > > > > Tell us if it is illegal to commit rape without pregnancy in your home town? > > > > Hillf, this is unacceptable language. You need to apologise. > > Agreed. WTF Hillf? Where did that come from? Is this how you talk to your > co-workers? > > I'll tell you what would happen in my home town. If someone said > that to a co-worker, they would likely be terminated. > Interesting, I want to know if the three words, rape, pregnancy and WTK, could be used before judge in your hometown court by anyone like your lawyer.
On Thu, Feb 20, 2025 at 04:29:51PM +0100, Vlastimil Babka wrote: > On 2/20/25 16:15, Dave Hansen wrote: > > On 2/19/25 21:20, Byungchul Park wrote: > >> I'm posting the latest version so that anyone can try luf mechanism if > >> wanted by any chance. However, I tagged RFC again because there are > >> still issues that should be resolved to merge to mainline: > > > > I don't see anything fundamentally different here from the last 11 > > versions. I think the entire approach is dangerous and basically makes > > things impossible to debug. It's not clear that some of the failure > > scenarios that I've brought up in the past have actually been fixed. > > Yes, and it's still an invasive change to the buddy allocator. Didn't want.. but admit. > IIRC at Plumbers the opinion in the audience was that there might be ways to > improve the batching on unmap to reduce the flushes without such an invasive > and potentially dangerous change? Has that been investigated? Sure. I tried like, by holding those pages not freed until either no one accesses the interesting pages or memory pressure is high. However, unfortunately it was super hard to fix performance degradation by the number of page reclaim increased due to the unfreed pages. > Also "Rebase on akpm/mm.git mm-unstable(5a7056135b) as of Nov 22, 2024." is > very outdated at this point? Sorry for that. I will rebase and share. Byungchul > > Thanks, > Vlastimil > > > What I've said here still stands: > > > >> https://lore.kernel.org/all/fab1dd64-c652-4160-93b4-7b483a8874da@intel.com/ > > > >> I think tglx would call all of this "tinkering". The approach to this > >> series is to "fix" narrow, specific cases that reviewers point out, make > >> it compile, then send it out again, hoping someone will apply it. > >> > >> So, for me, until the approach to this series changes: NAK, for x86. > >> Andrew, please don't take this series. Or, if you do, please drop the > >> patch enabling it on x86. > > > > I think I'd also like to stop being cc'd on this. If LUF is merged into > > mainline and proven to work on arm64 or riscv for a year, I'd be happy > > to take another look at enabling it on x86. I think that's just about > > the only thing that would make me reconsider. > >
On Fri, 21 Feb 2025 07:25:02 +0800 Hillf Danton <hdanton@sina.com> wrote: > > I'll tell you what would happen in my home town. If someone said > > that to a co-worker, they would likely be terminated. > > > Interesting, I want to know if the three words, rape, pregnancy and WTK, > could be used before judge in your hometown court by anyone like your lawyer. Hillf, This isn't a court. And there's no reason to use the word "rape" in a technical conversation on the Linux kernel mailing list. Perhaps a person reading this was a victim of rape. How do you think that would make them feel? Welcomed to our community? Absolutely not. Which is why it's totally unacceptable. -- Steve
On Thu, Feb 20, 2025 at 04:29:51PM +0100, Vlastimil Babka wrote: > On 2/20/25 16:15, Dave Hansen wrote: > > On 2/19/25 21:20, Byungchul Park wrote: > >> I'm posting the latest version so that anyone can try luf mechanism if > >> wanted by any chance. However, I tagged RFC again because there are > >> still issues that should be resolved to merge to mainline: > > > > I don't see anything fundamentally different here from the last 11 > > versions. I think the entire approach is dangerous and basically makes > > things impossible to debug. It's not clear that some of the failure > > scenarios that I've brought up in the past have actually been fixed. > > Yes, and it's still an invasive change to the buddy allocator. > IIRC at Plumbers the opinion in the audience was that there might be ways to > improve the batching on unmap to reduce the flushes without such an invasive > and potentially dangerous change? Has that been investigated? > I know SJ (CCed) is working on making TLB flush batching work for process_madvise().
On Sat, Feb 22, 2025 at 07:05:26AM +0800, Hillf Danton wrote: > On Thu, 20 Feb 2025 18:44:12 -0500 Steven Rostedt <rostedt@goodmis.org> > > On Fri, 21 Feb 2025 07:25:02 +0800 Hillf Danton <hdanton@sina.com> wrote: > > > > I'll tell you what would happen in my home town. If someone said > > > > that to a co-worker, they would likely be terminated. > > > > > > > Interesting, I want to know if the three words, rape, pregnancy and WTK, > > > could be used before judge in your hometown court by anyone like your lawyer. > > > > This isn't a court. And there's no reason to use the word "rape" in a > > technical conversation on the Linux kernel mailing list. Perhaps a person > > reading this was a victim of rape. How do you think that would make them > > feel? Welcomed to our community? Absolutely not. Which is why it's totally > > unacceptable. > > > There are NAK victims. Did you nak more than twice a week, Steve? Hillf, This is not the way to work with your fellow developers in the community to express disagreements. I would recommend following up with an apology. thanks, greg k-h (On behalf of the Code of Conduct Committee)
On Sat, Feb 22, 2025 at 06:10:59PM +0800, Hillf Danton wrote: > On Sat, 22 Feb 2025 08:16:09 +0100 Greg KH <gregkh@linuxfoundation.org> > > On Sat, Feb 22, 2025 at 07:05:26AM +0800, Hillf Danton wrote: > > > On Thu, 20 Feb 2025 18:44:12 -0500 Steven Rostedt <rostedt@goodmis.org> > > > > On Fri, 21 Feb 2025 07:25:02 +0800 Hillf Danton <hdanton@sina.com> wrote: > > > > > > I'll tell you what would happen in my home town. If someone said > > > > > > that to a co-worker, they would likely be terminated. > > > > > > > > > > > Interesting, I want to know if the three words, rape, pregnancy and WTK, > > > > > could be used before judge in your hometown court by anyone like your lawyer. > > > > > > > > This isn't a court. And there's no reason to use the word "rape" in a > > > > technical conversation on the Linux kernel mailing list. Perhaps a person > > > > reading this was a victim of rape. How do you think that would make them > > > > feel? Welcomed to our community? Absolutely not. Which is why it's totally > > > > unacceptable. > > > > > > > There are NAK victims. Did you nak more than twice a week, Steve? > > > > This is not the way to work with your fellow developers in the community > > to express disagreements. > > > No comment because you are free to express disagreements. Disagreements are fine, but not with words like what you used, sorry, that was unacceptable and requires an apology. > > I would recommend following up with an apology. > > > It would take some time for me to opt to follow/ignore what you recommended. Please take the time to do so very soon. thanks, greg k-h (On behalf of the Code of Conduct Committee)