[v2,20/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*

Message ID	20250129013857.135256-21-richard.henderson@linaro.org (mailing list archive)
State	New
Headers	show Return-Path: <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org> From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: peter.maydell@linaro.org Subject: [PATCH v2 20/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_ Date: Tue, 28 Jan 2025 17:38:43 -0800 Message-ID: <20250129013857.135256-21-richard.henderson@linaro.org> In-Reply-To: <20250129013857.135256-1-richard.henderson@linaro.org> References: <20250129013857.135256-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::634; envelope-from=richard.henderson@linaro.org; helo=mail-pl1-x634.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Series	target/arm: FEAT_AFP followups for FEAT_SME2 \| expand [v2,00/34] target/arm: FEAT_AFP followups for FEAT_SME2 [v2,01/34] target/arm: Rename FPST_FPCR_A32 to FPST_A32 [v2,02/34] target/arm: Rename FPST_FPCR_A64 to FPST_A64 [v2,03/34] target/arm: Rename FPST_FPCR_F16_A32 to FPST_A32_F16 [v2,04/34] target/arm: Rename FPST_FPCR_F16_A64 to FPST_A64_F16 [v2,05/34] target/arm: Rename FPST_FPCR_AH* to FPST_AH* [v2,06/34] target/arm: Introduce CPUARMState.vfp.fp_status[] [v2,07/34] target/arm: Remove standard_fp_status_f16 [v2,08/34] target/arm: Remove standard_fp_status [v2,09/34] target/arm: Remove ah_fp_status_f16 [v2,10/34] target/arm: Remove ah_fp_status [v2,11/34] target/arm: Remove fp_status_f16_a64 [v2,12/34] target/arm: Remove fp_status_f16_a32 [v2,13/34] target/arm: Remove fp_status_a64 [v2,14/34] target/arm: Remove fp_status_a32 [v2,15/34] target/arm: Simplify fp_status indexing in mve_helper.c [v2,16/34] target/arm: Simplify DO_VFP_cmp in vfp_helper.c [v2,17/34] target/arm: Move float_ah_chs to vec_internal.h [v2,18/34] target/arm: Introduce float_maybe_ah_chs [v2,19/34] target/arm: Use float_maybe_ah_chs in sve_ftssel_ [v2,20/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_ [v2,21/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_ [v2,22/34] target/arm: Use flags for AH negation in do_fmla_zpzzz_* [v2,23/34] target/arm: Use flags for AH negation in sve_ftmad_* [v2,24/34] target/arm: Use flags for AH negation in float_ah_mulsub_f [v2,25/34] target/arm: Handle FPCR.AH in gvec_fcmla[hsd] [v2,26/34] target/arm: Handle FPCR.AH in gvec_fcmla[hs]_idx [v2,27/34] target/arm: Handle FPCR.AH in sve_fcmla_zpzzz_ [v2,28/34] target/arm: Split gvec_fmla_idx_* for fmls and ah_fmls [v2,29/34] Revert "target/arm: Handle FPCR.AH in FMLSL" [v2,30/34] target/arm: Handle FPCR.AH in gvec_fmlal_a64 [v2,31/34] target/arm: Handle FPCR.AH in sve2_fmlal_zzxw_s [v2,32/34] target/arm: Handle FPCR.AH in sve2_fmlal_zzzw_s [v2,33/34] target/arm: Read fz16 from env->vfp.fpcr [v2,34/34] target/arm: Sink fp_status and fpcr access into do_fmlal*

Message ID

20250129013857.135256-21-richard.henderson@linaro.org (mailing list archive)

State

New

Headers

From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: peter.maydell@linaro.org
Subject: [PATCH v2 20/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*
Date: Tue, 28 Jan 2025 17:38:43 -0800
Message-ID: <20250129013857.135256-21-richard.henderson@linaro.org>
In-Reply-To: <20250129013857.135256-1-richard.henderson@linaro.org>
References: <20250129013857.135256-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::634;
 envelope-from=richard.henderson@linaro.org; helo=mail-pl1-x634.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org

Series

target/arm: FEAT_AFP followups for FEAT_SME2 | expand

Commit Message

Richard Henderson Jan. 29, 2025, 1:38 a.m. UTC

The construction of neg_imag and neg_real were done to make it easy
to apply both in parallel with two simple logical operations.  This
changed with FPCR.AH, which is more complex than that.

Note that there was a naming issue with neg_imag and neg_real.
They were named backward, with neg_imag being non-zero for rot=1,
and vice versa.  This was combined with reversed usage within the
loop, so that the negation in the end turned out correct.

Using the rot variable introduced with fpcr_ah, it's easier to
match the pseudocode for the instruction.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/sve_helper.c | 33 ++++++++++++---------------------
 1 file changed, 12 insertions(+), 21 deletions(-)

diff --git a/target/arm/tcg/sve_helper.c b/target/arm/tcg/sve_helper.c
index a2ff3b7f11..a1f7743221 100644
--- a/target/arm/tcg/sve_helper.c
+++ b/target/arm/tcg/sve_helper.c
@@ -5226,8 +5226,6 @@  void HELPER(sve_fcadd_h)(void *vd, void *vn, void *vm, void *vg,
     uint64_t *g = vg;
     bool rot = extract32(desc, SIMD_DATA_SHIFT, 1);
     bool fpcr_ah = extract32(desc, SIMD_DATA_SHIFT + 1, 1);
-    float16 neg_imag = float16_set_sign(0, rot);
-    float16 neg_real = float16_chs(neg_imag);
 
     do {
         uint64_t pg = g[(i - 1) >> 6];
@@ -5243,11 +5241,10 @@  void HELPER(sve_fcadd_h)(void *vd, void *vn, void *vm, void *vg,
             e2 = *(float16 *)(vn + H1_2(j));
             e3 = *(float16 *)(vm + H1_2(i));
 
-            if (neg_real && !(fpcr_ah && float16_is_any_nan(e1))) {
-                e1 ^= neg_real;
-            }
-            if (neg_imag && !(fpcr_ah && float16_is_any_nan(e3))) {
-                e3 ^= neg_imag;
+            if (rot) {
+                e3 = float16_maybe_ah_chs(e3, fpcr_ah);
+            } else {
+                e1 = float16_maybe_ah_chs(e1, fpcr_ah);
             }
 
             if (likely((pg >> (i & 63)) & 1)) {
@@ -5267,8 +5264,6 @@  void HELPER(sve_fcadd_s)(void *vd, void *vn, void *vm, void *vg,
     uint64_t *g = vg;
     bool rot = extract32(desc, SIMD_DATA_SHIFT, 1);
     bool fpcr_ah = extract32(desc, SIMD_DATA_SHIFT + 1, 1);
-    float32 neg_imag = float32_set_sign(0, rot);
-    float32 neg_real = float32_chs(neg_imag);
 
     do {
         uint64_t pg = g[(i - 1) >> 6];
@@ -5284,11 +5279,10 @@  void HELPER(sve_fcadd_s)(void *vd, void *vn, void *vm, void *vg,
             e2 = *(float32 *)(vn + H1_2(j));
             e3 = *(float32 *)(vm + H1_2(i));
 
-            if (neg_real && !(fpcr_ah && float32_is_any_nan(e1))) {
-                e1 ^= neg_real;
-            }
-            if (neg_imag && !(fpcr_ah && float32_is_any_nan(e3))) {
-                e3 ^= neg_imag;
+            if (rot) {
+                e3 = float32_maybe_ah_chs(e3, fpcr_ah);
+            } else {
+                e1 = float32_maybe_ah_chs(e1, fpcr_ah);
             }
 
             if (likely((pg >> (i & 63)) & 1)) {
@@ -5308,8 +5302,6 @@  void HELPER(sve_fcadd_d)(void *vd, void *vn, void *vm, void *vg,
     uint64_t *g = vg;
     bool rot = extract32(desc, SIMD_DATA_SHIFT, 1);
     bool fpcr_ah = extract32(desc, SIMD_DATA_SHIFT + 1, 1);
-    float64 neg_imag = float64_set_sign(0, rot);
-    float64 neg_real = float64_chs(neg_imag);
 
     do {
         uint64_t pg = g[(i - 1) >> 6];
@@ -5325,11 +5317,10 @@  void HELPER(sve_fcadd_d)(void *vd, void *vn, void *vm, void *vg,
             e2 = *(float64 *)(vn + H1_2(j));
             e3 = *(float64 *)(vm + H1_2(i));
 
-            if (neg_real && !(fpcr_ah && float64_is_any_nan(e1))) {
-                e1 ^= neg_real;
-            }
-            if (neg_imag && !(fpcr_ah && float64_is_any_nan(e3))) {
-                e3 ^= neg_imag;
+            if (rot) {
+                e3 = float64_maybe_ah_chs(e3, fpcr_ah);
+            } else {
+                e1 = float64_maybe_ah_chs(e1, fpcr_ah);
             }
 
             if (likely((pg >> (i & 63)) & 1)) {

[v2,20/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*

Commit Message

Patch

[v2,20/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_