Message ID | 5241DA5B.8000909@gmail.com (mailing list archive) |
---|---|
State | Not Applicable, archived |
Headers | show |
Hello, On Wed, Sep 25, 2013 at 02:30:51AM +0800, Zhang Yanfei wrote: > +/** > + * memory_map_bottom_up - Map [map_start, map_end) bottom up > + * @map_start: start address of the target memory range > + * @map_end: end address of the target memory range > + * > + * This function will setup direct mapping for memory range > + * [map_start, map_end) in bottom-up. Ditto about the comment. > + */ > +static void __init memory_map_bottom_up(unsigned long map_start, > + unsigned long map_end) > +{ > + unsigned long next, new_mapped_ram_size, start; > + unsigned long mapped_ram_size = 0; > + /* step_size need to be small so pgt_buf from BRK could cover it */ > + unsigned long step_size = PMD_SIZE; > + > + start = map_start; > + min_pfn_mapped = start >> PAGE_SHIFT; > + > + /* > + * We start from the bottom (@map_start) and go to the top (@map_end). > + * The memblock_find_in_range() gets us a block of RAM from the > + * end of RAM in [min_pfn_mapped, max_pfn_mapped) used as new pages > + * for page table. > + */ > + while (start < map_end) { > + if (map_end - start > step_size) { > + next = round_up(start + 1, step_size); > + if (next > map_end) > + next = map_end; > + } else > + next = map_end; > + > + new_mapped_ram_size = init_range_memory_mapping(start, next); > + start = next; > + > + if (new_mapped_ram_size > mapped_ram_size) > + step_size <<= STEP_SIZE_SHIFT; > + mapped_ram_size += new_mapped_ram_size; > + } > +} As Yinghai pointed out in another thread, do we need to worry about falling back to top-down? Thanks.
Hello tejun, On 09/26/2013 10:48 PM, Tejun Heo wrote: > Hello, > > On Wed, Sep 25, 2013 at 02:30:51AM +0800, Zhang Yanfei wrote: >> +/** >> + * memory_map_bottom_up - Map [map_start, map_end) bottom up >> + * @map_start: start address of the target memory range >> + * @map_end: end address of the target memory range >> + * >> + * This function will setup direct mapping for memory range >> + * [map_start, map_end) in bottom-up. > > Ditto about the comment. OK, will do. > >> + */ >> +static void __init memory_map_bottom_up(unsigned long map_start, >> + unsigned long map_end) >> +{ >> + unsigned long next, new_mapped_ram_size, start; >> + unsigned long mapped_ram_size = 0; >> + /* step_size need to be small so pgt_buf from BRK could cover it */ >> + unsigned long step_size = PMD_SIZE; >> + >> + start = map_start; >> + min_pfn_mapped = start >> PAGE_SHIFT; >> + >> + /* >> + * We start from the bottom (@map_start) and go to the top (@map_end). >> + * The memblock_find_in_range() gets us a block of RAM from the >> + * end of RAM in [min_pfn_mapped, max_pfn_mapped) used as new pages >> + * for page table. >> + */ >> + while (start < map_end) { >> + if (map_end - start > step_size) { >> + next = round_up(start + 1, step_size); >> + if (next > map_end) >> + next = map_end; >> + } else >> + next = map_end; >> + >> + new_mapped_ram_size = init_range_memory_mapping(start, next); >> + start = next; >> + >> + if (new_mapped_ram_size > mapped_ram_size) >> + step_size <<= STEP_SIZE_SHIFT; >> + mapped_ram_size += new_mapped_ram_size; >> + } >> +} > > As Yinghai pointed out in another thread, do we need to worry about > falling back to top-down? I've explained to him. Nop, we don't need to worry about that. Because even the min_pfn_mapped becomes ISA_END_ADDRESS in the second call below, we won't allocate memory below the kernel because we have limited the allocation above the kernel. Thanks.
On Thu, Sep 26, 2013 at 11:43:02PM +0800, Zhang Yanfei wrote: > > As Yinghai pointed out in another thread, do we need to worry about > > falling back to top-down? > > I've explained to him. Nop, we don't need to worry about that. Because even > the min_pfn_mapped becomes ISA_END_ADDRESS in the second call below, we won't > allocate memory below the kernel because we have limited the allocation above > the kernel. Maybe I misunderstood but wasn't he worrying about there not being enough space above kernel? In that case, it'd automatically fall back to top-down allocation anyway, right? Thanks.
On 09/26/2013 11:48 PM, Tejun Heo wrote: > On Thu, Sep 26, 2013 at 11:43:02PM +0800, Zhang Yanfei wrote: >>> As Yinghai pointed out in another thread, do we need to worry about >>> falling back to top-down? >> >> I've explained to him. Nop, we don't need to worry about that. Because even >> the min_pfn_mapped becomes ISA_END_ADDRESS in the second call below, we won't >> allocate memory below the kernel because we have limited the allocation above >> the kernel. > > Maybe I misunderstood but wasn't he worrying about there not being > enough space above kernel? In that case, it'd automatically fall back > to top-down allocation anyway, right? Ah, I see. You are saying another issue. He is worrying that if we use kexec to load the kernel high, say we have 16GB, we put the kernel in 15.99GB (just an example), so we only have less than 100MB above the kernel. But as I've explained to him, in almost all the cases, if we want our memory hotplug work, we don't do that. And yeah, assume we have this problem, it'd fall back to top down and that return backs to patch 2, we will trigger the WARN_ONCE, and the admin will know what has happened. Thanks.
On Fri, Sep 27, 2013 at 12:03:01AM +0800, Zhang Yanfei wrote: > Ah, I see. You are saying another issue. He is worrying that if we use > kexec to load the kernel high, say we have 16GB, we put the kernel in > 15.99GB (just an example), so we only have less than 100MB above the kernel. > > But as I've explained to him, in almost all the cases, if we want our > memory hotplug work, we don't do that. And yeah, assume we have this > problem, it'd fall back to top down and that return backs to patch 2, > we will trigger the WARN_ONCE, and the admin will know what has happened. Alright, Acked-by: Tejun Heo <tj@kernel.org> Thanks.
On 09/26/2013 10:48 PM, Tejun Heo wrote: > Hello, > > On Wed, Sep 25, 2013 at 02:30:51AM +0800, Zhang Yanfei wrote: >> +/** >> + * memory_map_bottom_up - Map [map_start, map_end) bottom up >> + * @map_start: start address of the target memory range >> + * @map_end: end address of the target memory range >> + * >> + * This function will setup direct mapping for memory range >> + * [map_start, map_end) in bottom-up. > > Ditto about the comment. Trying below: /** * memory_map_bottom_up - Map [map_start, map_end) bottom up * @map_start: start address of the target memory range * @map_end: end address of the target memory range * * This function will setup direct mapping for memory range * [map_start, map_end) in bottom-up. Since we have limited the * bottom-up allocation above the kernel, the page tables will * be allocated just above the kernel and we map the memory * in [map_start, map_end) in bottom-up. */ static void __init memory_map_bottom_up(unsigned long map_start, unsigned long map_end) { Thanks.
diff --git a/arch/x86/mm/init.c b/arch/x86/mm/init.c index dbe57e5..d35363e 100644 --- a/arch/x86/mm/init.c +++ b/arch/x86/mm/init.c @@ -456,6 +456,48 @@ static void __init memory_map_top_down(unsigned long map_start, init_range_memory_mapping(real_end, map_end); } +/** + * memory_map_bottom_up - Map [map_start, map_end) bottom up + * @map_start: start address of the target memory range + * @map_end: end address of the target memory range + * + * This function will setup direct mapping for memory range + * [map_start, map_end) in bottom-up. + */ +static void __init memory_map_bottom_up(unsigned long map_start, + unsigned long map_end) +{ + unsigned long next, new_mapped_ram_size, start; + unsigned long mapped_ram_size = 0; + /* step_size need to be small so pgt_buf from BRK could cover it */ + unsigned long step_size = PMD_SIZE; + + start = map_start; + min_pfn_mapped = start >> PAGE_SHIFT; + + /* + * We start from the bottom (@map_start) and go to the top (@map_end). + * The memblock_find_in_range() gets us a block of RAM from the + * end of RAM in [min_pfn_mapped, max_pfn_mapped) used as new pages + * for page table. + */ + while (start < map_end) { + if (map_end - start > step_size) { + next = round_up(start + 1, step_size); + if (next > map_end) + next = map_end; + } else + next = map_end; + + new_mapped_ram_size = init_range_memory_mapping(start, next); + start = next; + + if (new_mapped_ram_size > mapped_ram_size) + step_size <<= STEP_SIZE_SHIFT; + mapped_ram_size += new_mapped_ram_size; + } +} + void __init init_mem_mapping(void) { unsigned long end; @@ -471,8 +513,26 @@ void __init init_mem_mapping(void) /* the ISA range is always mapped regardless of memory holes */ init_memory_mapping(0, ISA_END_ADDRESS); - /* setup direct mapping for range [ISA_END_ADDRESS, end) in top-down*/ - memory_map_top_down(ISA_END_ADDRESS, end); + /* + * If the allocation is in bottom-up direction, we setup direct mapping + * in bottom-up, otherwise we setup direct mapping in top-down. + */ + if (memblock_bottom_up()) { + unsigned long kernel_end; + + kernel_end = __pa_symbol(_end); + /* + * we need two separate calls here. This is because we want to + * allocate page tables above the kernel. So we first map + * [kernel_end, end) to make memory above the kernel be mapped + * as soon as possible. And then use page tables allocated above + * the kernel to map [ISA_END_ADDRESS, kernel_end). + */ + memory_map_bottom_up(kernel_end, end); + memory_map_bottom_up(ISA_END_ADDRESS, kernel_end); + } else { + memory_map_top_down(ISA_END_ADDRESS, end); + } #ifdef CONFIG_X86_64 if (max_pfn > max_low_pfn) {