[v4,4/7] x86/HVM: implement memory read caching for insn emulation

Emulation requiring device model assistance uses a form of instruction
re-execution, assuming that the second (and any further) pass takes
exactly the same path. This is a valid assumption as far as use of CPU
registers goes (as those can't change without any other instruction
executing in between), but is wrong for memory accesses. In particular
it has been observed that Windows might page out buffers underneath an
instruction currently under emulation (hitting between two passes). If
the first pass read a memory operand successfully, any subsequent pass
needs to get to see the exact same value.

Introduce a cache to make sure above described assumption holds. This
is a very simplistic implementation for now: Only exact matches are
satisfied (no overlaps or partial reads or anything); this is sufficient
for the immediate purpose of making re-execution an exact replay. The
cache also won't be used just yet for guest page walks; that'll be the
subject of a subsequent change.

With the cache being generally transparent to upper layers, but with it
having limited capacity yet being required for correctness, certain
users of hvm_copy_from_guest_*() need to disable caching temporarily,
without invalidating the cache. Note that the adjustments here to
hvm_hypercall() and hvm_task_switch() are benign at this point; they'll
become relevant once we start to be able to emulate respective insns
through the main emulator (and more changes will then likely be needed
to nested code).

As to the actual data page in this scenario, there are a couple of
aspects to take into consideration:
- We must be talking about an insn accessing two locations (two memory
  ones, one of which is MMIO, or a memory and an I/O one).
- If the non I/O / MMIO side is being read, the re-read (if it occurs at
  all) is having its result discarded, by taking the shortcut through
  the first switch()'s STATE_IORESP_READY case in hvmemul_do_io(). Note
  how, among all the re-issue sanity checks there, we avoid comparing
  the actual data.
- If the non I/O / MMIO side is being written, it is the OSes
  responsibility to avoid actually moving page contents to disk while
  there might still be a write access in flight - this is no different
  in behavior from bare hardware.
- Read-modify-write accesses are, as always, complicated, and while we
  deal with them better nowadays than we did in the past, we're still
  not quite there to guarantee hardware like behavior in all cases
  anyway. Nothing is getting worse by the changes made here, afaict.

In __hvm_copy() also reduce p's scope and change its type to void *.

Signed-off-by: Jan Beulich <jbeulich@suse.com>
---
TBD: In principle the caching here yields unnecessary the one used for
     insn bytes (vio->mmio_insn{,_bytes}. However, to seed the cache
     with the data SVM may have made available, we'd have to also know
     the corresponding GPA. It's not safe, however, to re-walk the page
     tables to find out, as the page tables may have changed in the
     meantime. Therefore I guess we need to keep the duplicate
     functionality for now. A possible solution to this could be to use
     a physical-address-based cache for page table accesses (and looking
     forward also e.g. SVM/VMX insn emulation), and a linear-address-
     based one for all other reads.
---
v4: Re-write for cache to become transparent to callers.
v3: Add text about the actual data page to the description.
v2: Re-base.

Message ID	cd3d95e9-7305-539c-a6e3-babd226eaea4@suse.com (mailing list archive)
State	Superseded
Headers	show Return-Path: <SRS0=nADe=3U=lists.xenproject.org=xen-devel-bounces@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 56363206F0 From: Jan Beulich <jbeulich@suse.com> To: "xen-devel@lists.xenproject.org" <xen-devel@lists.xenproject.org> References: <d9ac8ea4-9f2a-93d5-7656-48d93930ed2e@suse.com> Message-ID: <cd3d95e9-7305-539c-a6e3-babd226eaea4@suse.com> Date: Fri, 31 Jan 2020 17:44:23 +0100 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.4.2 MIME-Version: 1.0 In-Reply-To: <d9ac8ea4-9f2a-93d5-7656-48d93930ed2e@suse.com> Content-Language: en-US Subject: [Xen-devel] [PATCH v4 4/7] x86/HVM: implement memory read caching for insn emulation Precedence: list Cc: Kevin Tian <kevin.tian@intel.com>, Wei Liu <wl@xen.org>, Paul Durrant <paul@xen.org>, George Dunlap <George.Dunlap@eu.citrix.com>, Andrew Cooper <andrew.cooper3@citrix.com>, Jun Nakajima <jun.nakajima@intel.com>, =?utf-8?q?Roger_Pau_Monn=C3=A9?= <roger.pau@citrix.com> Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: xen-devel-bounces@lists.xenproject.org Sender: "Xen-devel" <xen-devel-bounces@lists.xenproject.org>
Series	x86/HVM: implement memory read caching \| expand [v4,0/7] x86/HVM: implement memory read caching [v4,1/7] SVM: drop asm/hvm/emulate.h inclusion from vmcb.h [v4,2/7] x86/HVM: rename a variable in __hvm_copy() [v4,3/7] x86/HVM: introduce "curr" into hvmemul_rep_{mov, sto}s() [v4,4/7] x86/HVM: implement memory read caching for insn emulation [v4,5/7] x86/mm: use cache in guest_walk_tables() [v4,6/7] x86/mm: drop p2mt parameter from map_domain_gfn() [v4,7/7] x86/HVM: reduce scope of pfec in hvm_emulate_init_per_insn()

[v4,4/7] x86/HVM: implement memory read caching for insn emulation

Commit Message

Comments

Patch