Message ID | 20130627092152.GB10758@redhat.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
On 06/27/2013 05:21 PM, Gleb Natapov wrote: > On Thu, Jun 27, 2013 at 12:14:24PM +0300, Gleb Natapov wrote: >> On Thu, Jun 27, 2013 at 12:01:10PM +0300, Gleb Natapov wrote: >>> On Thu, Jun 27, 2013 at 11:29:00AM +0300, Gleb Natapov wrote: >>>> On Fri, Jun 07, 2013 at 04:51:26PM +0800, Xiao Guangrong wrote: >>>>> This patch tries to introduce a very simple and scale way to invalidate >>>>> all mmio sptes - it need not walk any shadow pages and hold mmu-lock >>>>> >>>>> KVM maintains a global mmio valid generation-number which is stored in >>>>> kvm->memslots.generation and every mmio spte stores the current global >>>>> generation-number into his available bits when it is created >>>>> >>>>> When KVM need zap all mmio sptes, it just simply increase the global >>>>> generation-number. When guests do mmio access, KVM intercepts a MMIO #PF >>>>> then it walks the shadow page table and get the mmio spte. If the >>>>> generation-number on the spte does not equal the global generation-number, >>>>> it will go to the normal #PF handler to update the mmio spte >>>>> >>>>> Since 19 bits are used to store generation-number on mmio spte, we zap all >>>>> mmio sptes when the number is round >>>>> >>>> So this commit makes Fedora 9 32 bit reboot during boot, Fedora 9 64 >>>> fails too, but I haven't checked what happens exactly. >>>> >>> Something wrong with gfn calculation during mmio: >>> >>> qemu-system-x86-17003 [000] 3962.625103: handle_mmio_page_fault: addr:c00ba6c0 gfn 100000000ba access a92 >>> qemu-system-x86-17003 [000] 3962.774862: handle_mmio_page_fault: addr:ffffb170 gfn 100000fee00 access a92 >>> >> Hmm, so I wounder why get_mmio_spte_gfn() does not clear gen bits. >> > Hmm, something like patch below fixes it. Will test more. > > > diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c > index 1fd2c05..aec9c05 100644 > --- a/arch/x86/kvm/mmu.c > +++ b/arch/x86/kvm/mmu.c > @@ -260,7 +260,8 @@ static bool is_mmio_spte(u64 spte) > > static gfn_t get_mmio_spte_gfn(u64 spte) > { > - return (spte & ~shadow_mmio_mask) >> PAGE_SHIFT; > + u64 mask = generation_mmio_spte_mask(MMIO_MAX_GEN) | shadow_mmio_mask; > + return (spte & ~mask) >> PAGE_SHIFT; > } Looks nice. Gleb, thank you very much for investigating the bug and fixing my mistake. I will be more careful in the further developments. -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Thu, Jun 27, 2013 at 05:50:08PM +0800, Xiao Guangrong wrote: > On 06/27/2013 05:21 PM, Gleb Natapov wrote: > > On Thu, Jun 27, 2013 at 12:14:24PM +0300, Gleb Natapov wrote: > >> On Thu, Jun 27, 2013 at 12:01:10PM +0300, Gleb Natapov wrote: > >>> On Thu, Jun 27, 2013 at 11:29:00AM +0300, Gleb Natapov wrote: > >>>> On Fri, Jun 07, 2013 at 04:51:26PM +0800, Xiao Guangrong wrote: > >>>>> This patch tries to introduce a very simple and scale way to invalidate > >>>>> all mmio sptes - it need not walk any shadow pages and hold mmu-lock > >>>>> > >>>>> KVM maintains a global mmio valid generation-number which is stored in > >>>>> kvm->memslots.generation and every mmio spte stores the current global > >>>>> generation-number into his available bits when it is created > >>>>> > >>>>> When KVM need zap all mmio sptes, it just simply increase the global > >>>>> generation-number. When guests do mmio access, KVM intercepts a MMIO #PF > >>>>> then it walks the shadow page table and get the mmio spte. If the > >>>>> generation-number on the spte does not equal the global generation-number, > >>>>> it will go to the normal #PF handler to update the mmio spte > >>>>> > >>>>> Since 19 bits are used to store generation-number on mmio spte, we zap all > >>>>> mmio sptes when the number is round > >>>>> > >>>> So this commit makes Fedora 9 32 bit reboot during boot, Fedora 9 64 > >>>> fails too, but I haven't checked what happens exactly. > >>>> > >>> Something wrong with gfn calculation during mmio: > >>> > >>> qemu-system-x86-17003 [000] 3962.625103: handle_mmio_page_fault: addr:c00ba6c0 gfn 100000000ba access a92 > >>> qemu-system-x86-17003 [000] 3962.774862: handle_mmio_page_fault: addr:ffffb170 gfn 100000fee00 access a92 > >>> > >> Hmm, so I wounder why get_mmio_spte_gfn() does not clear gen bits. > >> > > Hmm, something like patch below fixes it. Will test more. > > > > > > diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c > > index 1fd2c05..aec9c05 100644 > > --- a/arch/x86/kvm/mmu.c > > +++ b/arch/x86/kvm/mmu.c > > @@ -260,7 +260,8 @@ static bool is_mmio_spte(u64 spte) > > > > static gfn_t get_mmio_spte_gfn(u64 spte) > > { > > - return (spte & ~shadow_mmio_mask) >> PAGE_SHIFT; > > + u64 mask = generation_mmio_spte_mask(MMIO_MAX_GEN) | shadow_mmio_mask; > > + return (spte & ~mask) >> PAGE_SHIFT; > > } > > Looks nice. > The question is if get_mmio_spte_access() need the same treatment? -- Gleb. -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 06/27/2013 06:19 PM, Gleb Natapov wrote: > On Thu, Jun 27, 2013 at 05:50:08PM +0800, Xiao Guangrong wrote: >> On 06/27/2013 05:21 PM, Gleb Natapov wrote: >>> On Thu, Jun 27, 2013 at 12:14:24PM +0300, Gleb Natapov wrote: >>>> On Thu, Jun 27, 2013 at 12:01:10PM +0300, Gleb Natapov wrote: >>>>> On Thu, Jun 27, 2013 at 11:29:00AM +0300, Gleb Natapov wrote: >>>>>> On Fri, Jun 07, 2013 at 04:51:26PM +0800, Xiao Guangrong wrote: >>>>>>> This patch tries to introduce a very simple and scale way to invalidate >>>>>>> all mmio sptes - it need not walk any shadow pages and hold mmu-lock >>>>>>> >>>>>>> KVM maintains a global mmio valid generation-number which is stored in >>>>>>> kvm->memslots.generation and every mmio spte stores the current global >>>>>>> generation-number into his available bits when it is created >>>>>>> >>>>>>> When KVM need zap all mmio sptes, it just simply increase the global >>>>>>> generation-number. When guests do mmio access, KVM intercepts a MMIO #PF >>>>>>> then it walks the shadow page table and get the mmio spte. If the >>>>>>> generation-number on the spte does not equal the global generation-number, >>>>>>> it will go to the normal #PF handler to update the mmio spte >>>>>>> >>>>>>> Since 19 bits are used to store generation-number on mmio spte, we zap all >>>>>>> mmio sptes when the number is round >>>>>>> >>>>>> So this commit makes Fedora 9 32 bit reboot during boot, Fedora 9 64 >>>>>> fails too, but I haven't checked what happens exactly. >>>>>> >>>>> Something wrong with gfn calculation during mmio: >>>>> >>>>> qemu-system-x86-17003 [000] 3962.625103: handle_mmio_page_fault: addr:c00ba6c0 gfn 100000000ba access a92 >>>>> qemu-system-x86-17003 [000] 3962.774862: handle_mmio_page_fault: addr:ffffb170 gfn 100000fee00 access a92 >>>>> >>>> Hmm, so I wounder why get_mmio_spte_gfn() does not clear gen bits. >>>> >>> Hmm, something like patch below fixes it. Will test more. >>> >>> >>> diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c >>> index 1fd2c05..aec9c05 100644 >>> --- a/arch/x86/kvm/mmu.c >>> +++ b/arch/x86/kvm/mmu.c >>> @@ -260,7 +260,8 @@ static bool is_mmio_spte(u64 spte) >>> >>> static gfn_t get_mmio_spte_gfn(u64 spte) >>> { >>> - return (spte & ~shadow_mmio_mask) >> PAGE_SHIFT; >>> + u64 mask = generation_mmio_spte_mask(MMIO_MAX_GEN) | shadow_mmio_mask; >>> + return (spte & ~mask) >> PAGE_SHIFT; >>> } >> >> Looks nice. >> > The question is if get_mmio_spte_access() need the same treatment? It works okay since the Access only uses bit1 and bit2 (and in the direct mmu case, only use gfn). But i am happy to do the same change in get_mmio_spte_access() to make the code more clear. -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Thu, Jun 27, 2013 at 07:05:20PM +0800, Xiao Guangrong wrote: > On 06/27/2013 06:19 PM, Gleb Natapov wrote: > > On Thu, Jun 27, 2013 at 05:50:08PM +0800, Xiao Guangrong wrote: > >> On 06/27/2013 05:21 PM, Gleb Natapov wrote: > >>> On Thu, Jun 27, 2013 at 12:14:24PM +0300, Gleb Natapov wrote: > >>>> On Thu, Jun 27, 2013 at 12:01:10PM +0300, Gleb Natapov wrote: > >>>>> On Thu, Jun 27, 2013 at 11:29:00AM +0300, Gleb Natapov wrote: > >>>>>> On Fri, Jun 07, 2013 at 04:51:26PM +0800, Xiao Guangrong wrote: > >>>>>>> This patch tries to introduce a very simple and scale way to invalidate > >>>>>>> all mmio sptes - it need not walk any shadow pages and hold mmu-lock > >>>>>>> > >>>>>>> KVM maintains a global mmio valid generation-number which is stored in > >>>>>>> kvm->memslots.generation and every mmio spte stores the current global > >>>>>>> generation-number into his available bits when it is created > >>>>>>> > >>>>>>> When KVM need zap all mmio sptes, it just simply increase the global > >>>>>>> generation-number. When guests do mmio access, KVM intercepts a MMIO #PF > >>>>>>> then it walks the shadow page table and get the mmio spte. If the > >>>>>>> generation-number on the spte does not equal the global generation-number, > >>>>>>> it will go to the normal #PF handler to update the mmio spte > >>>>>>> > >>>>>>> Since 19 bits are used to store generation-number on mmio spte, we zap all > >>>>>>> mmio sptes when the number is round > >>>>>>> > >>>>>> So this commit makes Fedora 9 32 bit reboot during boot, Fedora 9 64 > >>>>>> fails too, but I haven't checked what happens exactly. > >>>>>> > >>>>> Something wrong with gfn calculation during mmio: > >>>>> > >>>>> qemu-system-x86-17003 [000] 3962.625103: handle_mmio_page_fault: addr:c00ba6c0 gfn 100000000ba access a92 > >>>>> qemu-system-x86-17003 [000] 3962.774862: handle_mmio_page_fault: addr:ffffb170 gfn 100000fee00 access a92 > >>>>> > >>>> Hmm, so I wounder why get_mmio_spte_gfn() does not clear gen bits. > >>>> > >>> Hmm, something like patch below fixes it. Will test more. > >>> > >>> > >>> diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c > >>> index 1fd2c05..aec9c05 100644 > >>> --- a/arch/x86/kvm/mmu.c > >>> +++ b/arch/x86/kvm/mmu.c > >>> @@ -260,7 +260,8 @@ static bool is_mmio_spte(u64 spte) > >>> > >>> static gfn_t get_mmio_spte_gfn(u64 spte) > >>> { > >>> - return (spte & ~shadow_mmio_mask) >> PAGE_SHIFT; > >>> + u64 mask = generation_mmio_spte_mask(MMIO_MAX_GEN) | shadow_mmio_mask; > >>> + return (spte & ~mask) >> PAGE_SHIFT; > >>> } > >> > >> Looks nice. > >> > > The question is if get_mmio_spte_access() need the same treatment? > > It works okay since the Access only uses bit1 and bit2 (and in the direct mmu > case, only use gfn). But i am happy to do the same change in get_mmio_spte_access() > to make the code more clear. > It will fix output of handle_mmio_page_fault at least. Currently we have "access a92" there. -- Gleb. -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/arch/x86/kvm/mmu.c b/arch/x86/kvm/mmu.c index 1fd2c05..aec9c05 100644 --- a/arch/x86/kvm/mmu.c +++ b/arch/x86/kvm/mmu.c @@ -260,7 +260,8 @@ static bool is_mmio_spte(u64 spte) static gfn_t get_mmio_spte_gfn(u64 spte) { - return (spte & ~shadow_mmio_mask) >> PAGE_SHIFT; + u64 mask = generation_mmio_spte_mask(MMIO_MAX_GEN) | shadow_mmio_mask; + return (spte & ~mask) >> PAGE_SHIFT; } static unsigned get_mmio_spte_access(u64 spte)