[v2,24/69] target/arm: Convert FMOV, FABS, FNEG (scalar) to decodetree

Message ID	20241210161733.1830573-25-richard.henderson@linaro.org (mailing list archive)
State	New
Headers	show Return-Path: <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org> From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Cc: qemu-arm@nongnu.org Subject: [PATCH v2 24/69] target/arm: Convert FMOV, FABS, FNEG (scalar) to decodetree Date: Tue, 10 Dec 2024 10:16:48 -0600 Message-ID: <20241210161733.1830573-25-richard.henderson@linaro.org> In-Reply-To: <20241210161733.1830573-1-richard.henderson@linaro.org> References: <20241210161733.1830573-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2a00:1450:4864:20::132; envelope-from=richard.henderson@linaro.org; helo=mail-lf1-x132.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Series	target/arm: AArch64 decodetree conversion, final part \| expand [v2,00/69] target/arm: AArch64 decodetree conversion, final part [v2,01/69] target/arm: Add section labels for "Data Processing (register)" [v2,02/69] target/arm: Convert UDIV, SDIV to decodetree [v2,03/69] target/arm: Convert LSLV, LSRV, ASRV, RORV to decodetree [v2,04/69] target/arm: Convert CRC32, CRC32C to decodetree [v2,05/69] target/arm: Convert SUBP, IRG, GMI to decodetree [v2,06/69] target/arm: Convert PACGA to decodetree [v2,07/69] target/arm: Convert RBIT, REV16, REV32, REV64 to decodetree [v2,08/69] target/arm: Convert CLZ, CLS to decodetree [v2,09/69] target/arm: Convert PAC[ID], AUT[ID] to decodetree [v2,10/69] target/arm: Convert XPAC[ID] to decodetree [v2,11/69] target/arm: Convert disas_logic_reg to decodetree [v2,12/69] target/arm: Convert disas_add_sub_ext_reg to decodetree [v2,13/69] target/arm: Convert disas_add_sub_reg to decodetree [v2,14/69] target/arm: Convert disas_data_proc_3src to decodetree [v2,15/69] target/arm: Convert disas_adc_sbc to decodetree [v2,16/69] target/arm: Convert RMIF to decodetree [v2,17/69] target/arm: Convert SETF8, SETF16 to decodetree [v2,18/69] target/arm: Convert CCMP, CCMN to decodetree [v2,19/69] target/arm: Convert disas_cond_select to decodetree [v2,20/69] target/arm: Introduce fp_access_check_scalar_hsd [v2,21/69] target/arm: Introduce fp_access_check_vector_hsd [v2,22/69] target/arm: Convert FCMP, FCMPE, FCCMP, FCCMPE to decodetree [v2,23/69] target/arm: Fix decode of fp16 vector fabs, fneg [v2,24/69] target/arm: Convert FMOV, FABS, FNEG (scalar) to decodetree [v2,25/69] target/arm: Pass fpstatus to vfp_sqrt* [v2,26/69] target/arm: Remove helper_sqrt_f16 [v2,27/69] target/arm: Convert FSQRT (scalar) to decodetree [v2,28/69] target/arm: Convert FRINT[NPMSAXI] (scalar) to decodetree [v2,29/69] target/arm: Convert BFCVT to decodetree [v2,30/69] target/arm: Convert FRINT{32, 64}[ZX] (scalar) to decodetree [v2,31/69] target/arm: Convert FCVT (scalar) to decodetree [v2,32/69] target/arm: Convert handle_fpfpcvt to decodetree [v2,33/69] target/arm: Convert FJCVTZS to decodetree [v2,34/69] target/arm: Convert handle_fmov to decodetree [v2,35/69] target/arm: Convert SQABS, SQNEG to decodetree [v2,36/69] target/arm: Convert ABS, NEG to decodetree [v2,37/69] target/arm: Introduce gen_gvec_cls, gen_gvec_clz [v2,38/69] target/arm: Convert CLS, CLZ (vector) to decodetree [v2,39/69] target/arm: Introduce gen_gvec_cnt, gen_gvec_rbit [v2,40/69] target/arm: Convert CNT, NOT, RBIT (vector) to decodetree [v2,41/69] target/arm: Convert CMGT, CMGE, GMLT, GMLE, CMEQ (zero) to decodetree [v2,42/69] target/arm: Introduce gen_gvec_rev{16,32,64} [v2,43/69] target/arm: Convert handle_rev to decodetree [v2,44/69] target/arm: Move helper_neon_addlp_{s8, s16} to neon_helper.c [v2,45/69] target/arm: Introduce gen_gvec_{s,u}{add,ada}lp [v2,46/69] target/arm: Convert handle_2misc_pairwise to decodetree [v2,47/69] target/arm: Remove helper_neon_{add,sub}l_u{16,32} [v2,48/69] target/arm: Introduce clear_vec [v2,49/69] target/arm: Convert XTN, SQXTUN, SQXTN, UQXTN to decodetree [v2,50/69] target/arm: Convert FCVTN, BFCVTN to decodetree [v2,51/69] target/arm: Convert FCVTXN to decodetree [v2,52/69] target/arm: Convert SHLL to decodetree [v2,53/69] target/arm: Implement gen_gvec_fabs, gen_gvec_fneg [v2,54/69] target/arm: Convert FABS, FNEG (vector) to decodetree [v2,55/69] target/arm: Convert FSQRT (vector) to decodetree [v2,56/69] target/arm: Convert FRINT* (vector) to decodetree [v2,57/69] target/arm: Convert FCVT* (vector, integer) scalar to decodetree [v2,58/69] target/arm: Convert FCVT* (vector, fixed-point) scalar to decodetree [v2,59/69] target/arm: Convert [US]CVTF (vector, integer) scalar to decodetree [v2,60/69] target/arm: Convert [US]CVTF (vector, fixed-point) scalar to decodetree [v2,61/69] target/arm: Rename helper_gvec_vcvt_[hf][su] with _rz [v2,62/69] target/arm: Convert [US]CVTF (vector) to decodetree [v2,63/69] target/arm: Convert FCVTZ[SU] (vector, fixed-point) to decodetree [v2,64/69] target/arm: Convert FCVT* (vector, integer) to decodetree [v2,65/69] target/arm: Convert handle_2misc_fcmp_zero to decodetree [v2,66/69] target/arm: Convert FRECPE, FRECPX, FRSQRTE to decodetree [v2,67/69] target/arm: Introduce gen_gvec_urecpe, gen_gvec_ursqrte [v2,68/69] target/arm: Convert URECPE and URSQRTE to decodetree [v2,69/69] target/arm: Convert FCVTL to decodetree

Message ID

20241210161733.1830573-25-richard.henderson@linaro.org (mailing list archive)

State

New

Headers

From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Cc: qemu-arm@nongnu.org
Subject: [PATCH v2 24/69] target/arm: Convert FMOV, FABS,
 FNEG (scalar) to decodetree
Date: Tue, 10 Dec 2024 10:16:48 -0600
Message-ID: <20241210161733.1830573-25-richard.henderson@linaro.org>
In-Reply-To: <20241210161733.1830573-1-richard.henderson@linaro.org>
References: <20241210161733.1830573-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2a00:1450:4864:20::132;
 envelope-from=richard.henderson@linaro.org; helo=mail-lf1-x132.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org

Series

target/arm: AArch64 decodetree conversion, final part | expand

Signed-off-by: Richard Henderson <richard.henderson@linaro.org> --- target/arm/tcg/translate-a64.c | 105 +++++++++++++++++++++++---------- target/arm/tcg/a64.decode | 7 +++ 2 files changed, 81 insertions(+), 31 deletions(-)

Comments

Peter Maydell Dec. 11, 2024, 3:50 p.m. UTC | #1

On Tue, 10 Dec 2024 at 16:23, Richard Henderson
<richard.henderson@linaro.org> wrote:
>
> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>  target/arm/tcg/translate-a64.c | 105 +++++++++++++++++++++++----------
>  target/arm/tcg/a64.decode      |   7 +++
>  2 files changed, 81 insertions(+), 31 deletions(-)

Reviewed-by: Peter Maydell <peter.maydell@linaro.org>

thanks
-- PMM

diff --git a/target/arm/tcg/translate-a64.c b/target/arm/tcg/translate-a64.c
index f67360c4c5..383ee7f3f0 100644
--- a/target/arm/tcg/translate-a64.c
+++ b/target/arm/tcg/translate-a64.c
@@ -8283,6 +8283,67 @@  static bool trans_CSEL(DisasContext *s, arg_CSEL *a)
     return true;
 }
 
+typedef struct FPScalar1Int {
+    void (*gen_h)(TCGv_i32, TCGv_i32);
+    void (*gen_s)(TCGv_i32, TCGv_i32);
+    void (*gen_d)(TCGv_i64, TCGv_i64);
+} FPScalar1Int;
+
+static bool do_fp1_scalar_int(DisasContext *s, arg_rr_e *a,
+                              const FPScalar1Int *f)
+{
+    switch (a->esz) {
+    case MO_64:
+        if (fp_access_check(s)) {
+            TCGv_i64 t = read_fp_dreg(s, a->rn);
+            f->gen_d(t, t);
+            write_fp_dreg(s, a->rd, t);
+        }
+        break;
+    case MO_32:
+        if (fp_access_check(s)) {
+            TCGv_i32 t = read_fp_sreg(s, a->rn);
+            f->gen_s(t, t);
+            write_fp_sreg(s, a->rd, t);
+        }
+        break;
+    case MO_16:
+        if (!dc_isar_feature(aa64_fp16, s)) {
+            return false;
+        }
+        if (fp_access_check(s)) {
+            TCGv_i32 t = read_fp_hreg(s, a->rn);
+            f->gen_h(t, t);
+            write_fp_sreg(s, a->rd, t);
+        }
+        break;
+    default:
+        return false;
+    }
+    return true;
+}
+
+static const FPScalar1Int f_scalar_fmov = {
+    tcg_gen_mov_i32,
+    tcg_gen_mov_i32,
+    tcg_gen_mov_i64,
+};
+TRANS(FMOV_s, do_fp1_scalar_int, a, &f_scalar_fmov)
+
+static const FPScalar1Int f_scalar_fabs = {
+    gen_vfp_absh,
+    gen_vfp_abss,
+    gen_vfp_absd,
+};
+TRANS(FABS_s, do_fp1_scalar_int, a, &f_scalar_fabs)
+
+static const FPScalar1Int f_scalar_fneg = {
+    gen_vfp_negh,
+    gen_vfp_negs,
+    gen_vfp_negd,
+};
+TRANS(FNEG_s, do_fp1_scalar_int, a, &f_scalar_fneg)
+
 /* Floating-point data-processing (1 source) - half precision */
 static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
 {
@@ -8291,15 +8352,6 @@  static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
     TCGv_i32 tcg_res = tcg_temp_new_i32();
 
     switch (opcode) {
-    case 0x0: /* FMOV */
-        tcg_gen_mov_i32(tcg_res, tcg_op);
-        break;
-    case 0x1: /* FABS */
-        gen_vfp_absh(tcg_res, tcg_op);
-        break;
-    case 0x2: /* FNEG */
-        gen_vfp_negh(tcg_res, tcg_op);
-        break;
     case 0x3: /* FSQRT */
         fpst = fpstatus_ptr(FPST_FPCR_F16);
         gen_helper_sqrt_f16(tcg_res, tcg_op, fpst);
@@ -8327,6 +8379,9 @@  static void handle_fp_1src_half(DisasContext *s, int opcode, int rd, int rn)
         gen_helper_advsimd_rinth(tcg_res, tcg_op, fpst);
         break;
     default:
+    case 0x0: /* FMOV */
+    case 0x1: /* FABS */
+    case 0x2: /* FNEG */
         g_assert_not_reached();
     }
 
@@ -8345,15 +8400,6 @@  static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
     tcg_res = tcg_temp_new_i32();
 
     switch (opcode) {
-    case 0x0: /* FMOV */
-        tcg_gen_mov_i32(tcg_res, tcg_op);
-        goto done;
-    case 0x1: /* FABS */
-        gen_vfp_abss(tcg_res, tcg_op);
-        goto done;
-    case 0x2: /* FNEG */
-        gen_vfp_negs(tcg_res, tcg_op);
-        goto done;
     case 0x3: /* FSQRT */
         gen_helper_vfp_sqrts(tcg_res, tcg_op, tcg_env);
         goto done;
@@ -8389,6 +8435,9 @@  static void handle_fp_1src_single(DisasContext *s, int opcode, int rd, int rn)
         gen_fpst = gen_helper_frint64_s;
         break;
     default:
+    case 0x0: /* FMOV */
+    case 0x1: /* FABS */
+    case 0x2: /* FNEG */
         g_assert_not_reached();
     }
 
@@ -8413,22 +8462,10 @@  static void handle_fp_1src_double(DisasContext *s, int opcode, int rd, int rn)
     TCGv_ptr fpst;
     int rmode = -1;
 
-    switch (opcode) {
-    case 0x0: /* FMOV */
-        gen_gvec_fn2(s, false, rd, rn, tcg_gen_gvec_mov, 0);
-        return;
-    }
-
     tcg_op = read_fp_dreg(s, rn);
     tcg_res = tcg_temp_new_i64();
 
     switch (opcode) {
-    case 0x1: /* FABS */
-        gen_vfp_absd(tcg_res, tcg_op);
-        goto done;
-    case 0x2: /* FNEG */
-        gen_vfp_negd(tcg_res, tcg_op);
-        goto done;
     case 0x3: /* FSQRT */
         gen_helper_vfp_sqrtd(tcg_res, tcg_op, tcg_env);
         goto done;
@@ -8461,6 +8498,9 @@  static void handle_fp_1src_double(DisasContext *s, int opcode, int rd, int rn)
         gen_fpst = gen_helper_frint64_d;
         break;
     default:
+    case 0x0: /* FMOV */
+    case 0x1: /* FABS */
+    case 0x2: /* FNEG */
         g_assert_not_reached();
     }
 
@@ -8581,7 +8621,7 @@  static void disas_fp_1src(DisasContext *s, uint32_t insn)
             goto do_unallocated;
         }
         /* fall through */
-    case 0x0 ... 0x3:
+    case 0x3:
     case 0x8 ... 0xc:
     case 0xe ... 0xf:
         /* 32-to-32 and 64-to-64 ops */
@@ -8631,6 +8671,9 @@  static void disas_fp_1src(DisasContext *s, uint32_t insn)
 
     default:
     do_unallocated:
+    case 0x0: /* FMOV */
+    case 0x1: /* FABS */
+    case 0x2: /* FNEG */
         unallocated_encoding(s);
         break;
     }
diff --git a/target/arm/tcg/a64.decode b/target/arm/tcg/a64.decode
index 7868b1cb24..b9cc8963da 100644
--- a/target/arm/tcg/a64.decode
+++ b/target/arm/tcg/a64.decode
@@ -47,6 +47,7 @@ 
 @rr_h           ........ ... ..... ...... rn:5 rd:5     &rr_e esz=1
 @rr_d           ........ ... ..... ...... rn:5 rd:5     &rr_e esz=3
 @rr_sd          ........ ... ..... ...... rn:5 rd:5     &rr_e esz=%esz_sd
+@rr_hsd         ........ ... ..... ...... rn:5 rd:5     &rr_e esz=%esz_hsd
 
 @rrr_b          ........ ... rm:5 ...... rn:5 rd:5      &rrr_e esz=0
 @rrr_h          ........ ... rm:5 ...... rn:5 rd:5      &rrr_e esz=1
@@ -1321,6 +1322,12 @@  FMAXV_s         0110 1110 00 11000 01111 10 ..... .....     @rr_q1e2
 FMINV_h         0.00 1110 10 11000 01111 10 ..... .....     @qrr_h
 FMINV_s         0110 1110 10 11000 01111 10 ..... .....     @rr_q1e2
 
+# Floating-point data processing (1 source)
+
+FMOV_s          00011110 .. 1 000000 10000 ..... .....      @rr_hsd
+FABS_s          00011110 .. 1 000001 10000 ..... .....      @rr_hsd
+FNEG_s          00011110 .. 1 000010 10000 ..... .....      @rr_hsd
+
 # Floating-point Immediate
 
 FMOVI_s         0001 1110 .. 1 imm:8 100 00000 rd:5         esz=%esz_hsd

[v2,24/69] target/arm: Convert FMOV, FABS, FNEG (scalar) to decodetree

Commit Message

Comments

Patch