KVM: guest: only batch user pte updates

Marcelo Tosatti wrote:
> KVM's paravirt mmu pte batching has issues with, at least, kernel
> updates from DEBUG_PAGEALLOC.
>
> This has been experienced with slab allocation from irq context from
> within lazy mmu sections:
>
> https://bugzilla.redhat.com/show_bug.cgi?id=480822
>
> DEBUG_PAGEALLOC will map/unmap the kernel pagetables to catch bad
> accesses, with code such as:
>
>                 __change_page_attr():
>
>                 /*
>                  * Do we really change anything ?
>                  */
>                 if (pte_val(old_pte) != pte_val(new_pte)) {
>                         set_pte_atomic(kpte, new_pte);
>                         cpa->flags |= CPA_FLUSHTLB;
>                 }
>
> A present->nonpresent update can be queued, but not yet committed to
> memory. So the set_pte_atomic will be skipped but the update flushed
> afterwards. set_pte_ATOMIC.
>   

Are you saying that there's a queued update which means that old_pte is 
a stale value which happens to equal new_pte, so new_pte is never set?  
OK, sounds like a generic problem, of the same sort we've had with 
kmap_atomic being used in interrupt routines in lazy mode.

In this case, I think the proper fix is to call 
arch_flush_lazy_mmu_mode() before reading old_pte to make sure its up to 
date, and calling it again when processing CPA_FLUSHTLB.  Could you try 
the patch below instead?

(BTW, set_pte_atomic doesn't mean synchronous; it just means its safe to 
use on live ptes on 32-bit PAE machines which can't otherwise atomically 
update a pte.)

    J

commit 264d7d09de69b1f729adb43acc86bd504dd21251
Author: Jeremy Fitzhardinge <jeremy.fitzhardinge@citrix.com>
Date:   Tue Feb 10 14:15:52 2009 -0800

    x86/cpa: make sure cpa is safe to call in lazy mmu mode

    The CPA code may be called while we're in lazy mmu update mode - for
    example, when using DEBUG_PAGE_ALLOC and doing a slab allocation
    in an interrupt handler which interrupted a lazy mmu update.  In this
    case, the in memory pagetable state may be out of date due to pending
    queued updates.  We need to flush any pending updates before inspecting
    the page table.  Similarly, we must explicitly flush any modifications
    CPA may have made (which comes down to flushing queued operations when
    flushing the TLB).

    Signed-off-by: Jeremy Fitzhardinge <jeremy.fitzhardinge@citrix.com>

--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

KVM: guest: only batch user pte updates

Commit Message

Comments

Patch