mbox series

[v3,0/3] arm64/sve: Trivial optimisation for 128 bit SVE vectors

Message ID 20210512151131.27877-1-broonie@kernel.org (mailing list archive)
Headers show
Series arm64/sve: Trivial optimisation for 128 bit SVE vectors | expand

Message

Mark Brown May 12, 2021, 3:11 p.m. UTC
This series is a combination of factoring out some duplicated code and a
very minor optimisation to the performance of handling converting FPSIMD
state to SVE in the live registers for 128 bit SVE vectors.

v3:
 - Tweak comment.
v2:
 - Combine P and FFR flushing into a single macro.

Mark Brown (3):
  arm64/sve: Split _sve_flush macro into separate Z and predicate
    flushes
  arm64/sve: Use the sve_flush macros in sve_load_from_fpsimd_state()
  arm64/sve: Skip flushing Z registers with 128 bit vectors

 arch/arm64/include/asm/fpsimd.h       |  2 +-
 arch/arm64/include/asm/fpsimdmacros.h |  4 +++-
 arch/arm64/kernel/entry-fpsimd.S      | 22 +++++++++++++++-------
 arch/arm64/kernel/fpsimd.c            |  6 ++++--
 4 files changed, 23 insertions(+), 11 deletions(-)


base-commit: 6efb943b8616ec53a5e444193dccf1af9ad627b5

Comments

Catalin Marinas May 14, 2021, 11:03 a.m. UTC | #1
On Wed, May 12, 2021 at 04:11:28PM +0100, Mark Brown wrote:
> This series is a combination of factoring out some duplicated code and a
> very minor optimisation to the performance of handling converting FPSIMD
> state to SVE in the live registers for 128 bit SVE vectors.
> 
> v3:
>  - Tweak comment.
> v2:
>  - Combine P and FFR flushing into a single macro.
> 
> Mark Brown (3):
>   arm64/sve: Split _sve_flush macro into separate Z and predicate
>     flushes
>   arm64/sve: Use the sve_flush macros in sve_load_from_fpsimd_state()
>   arm64/sve: Skip flushing Z registers with 128 bit vectors

I acked v2, hadn't noticed v3 was out. So here it is again:

Acked-by: Catalin Marinas <catalin.marinas@arm.com>
Will Deacon May 26, 2021, 10:15 p.m. UTC | #2
On Wed, 12 May 2021 16:11:28 +0100, Mark Brown wrote:
> This series is a combination of factoring out some duplicated code and a
> very minor optimisation to the performance of handling converting FPSIMD
> state to SVE in the live registers for 128 bit SVE vectors.
> 
> v3:
>  - Tweak comment.
> v2:
>  - Combine P and FFR flushing into a single macro.
> 
> [...]

Applied to arm64 (for-next/sve), thanks!

[1/3] arm64/sve: Split _sve_flush macro into separate Z and predicate flushes
      https://git.kernel.org/arm64/c/483dbf6a3590
[2/3] arm64/sve: Use the sve_flush macros in sve_load_from_fpsimd_state()
      https://git.kernel.org/arm64/c/c9f6890bca11
[3/3] arm64/sve: Skip flushing Z registers with 128 bit vectors
      https://git.kernel.org/arm64/c/ad4711f962e0

Cheers,