[v5,26/49] target/ppc: implement vrlqnm

Message ID	20220225210936.1749575-27-matheus.ferst@eldorado.org.br (mailing list archive)
State	New, archived
Headers	show Return-Path: <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org> From: matheus.ferst@eldorado.org.br To: qemu-devel@nongnu.org, qemu-ppc@nongnu.org Subject: [PATCH v5 26/49] target/ppc: implement vrlqnm Date: Fri, 25 Feb 2022 18:09:13 -0300 Message-Id: <20220225210936.1749575-27-matheus.ferst@eldorado.org.br> In-Reply-To: <20220225210936.1749575-1-matheus.ferst@eldorado.org.br> References: <20220225210936.1749575-1-matheus.ferst@eldorado.org.br> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=187.72.171.209; envelope-from=matheus.ferst@eldorado.org.br; helo=outlook.eldorado.org.br X-Spam_score_int: -4 X-Spam_score: -0.5 X-Spam_bar: / X-Spam_report: (-0.5 / 5.0 requ) BAYES_00=-1.9, PDS_HP_HELO_NORDNS=0.659, RDNS_NONE=0.793, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action Precedence: list Cc: danielhb413@gmail.com, richard.henderson@linaro.org, groug@kaod.org, clg@kaod.org, Matheus Ferst <matheus.ferst@eldorado.org.br>, david@gibson.dropbear.id.au Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>
Series	target/ppc: PowerISA Vector/VSX instruction batch \| expand [v5,00/49] target/ppc: PowerISA Vector/VSX instruction batch [v5,01/49] target/ppc: Introduce TRANSFLAGS macros [v5,02/49] target/ppc: moved vector even and odd multiplication to decodetree [v5,03/49] target/ppc: Moved vector multiply high and low to decodetree [v5,04/49] target/ppc: vmulh instructions without helpers [v5,05/49] target/ppc: Implement vmsumcud instruction [v5,06/49] target/ppc: Implement vmsumudm instruction [v5,07/49] target/ppc: Move vexts[bhw]2[wd] to decodetree [v5,08/49] target/ppc: Implement vextsd2q [v5,09/49] target/ppc: Move Vector Compare Equal/Not Equal/Greater Than to decodetree [v5,10/49] target/ppc: Move Vector Compare Not Equal or Zero to decodetree [v5,11/49] target/ppc: Implement Vector Compare Equal Quadword [v5,12/49] target/ppc: Implement Vector Compare Greater Than Quadword [v5,13/49] target/ppc: Implement Vector Compare Quadword [v5,14/49] target/ppc: implement vstri[bh][lr] [v5,15/49] target/ppc: implement vclrlb [v5,16/49] target/ppc: implement vclrrb [v5,17/49] target/ppc: implement vcntmb[bhwd] [v5,18/49] target/ppc: implement vgnb [v5,19/49] target/ppc: move vs[lr][a][bhwd] to decodetree [v5,20/49] target/ppc: implement vslq [v5,21/49] target/ppc: implement vsrq [v5,22/49] target/ppc: implement vsraq [v5,23/49] target/ppc: move vrl[bhwd] to decodetree [v5,24/49] target/ppc: move vrl[bhwd]nm/vrl[bhwd]mi to decodetree [v5,25/49] target/ppc: implement vrlq [v5,26/49] target/ppc: implement vrlqnm [v5,27/49] target/ppc: implement vrlqmi [v5,28/49] target/ppc: Move vsel and vperm/vpermr to decodetree [v5,29/49] target/ppc: Move xxsel to decodetree [v5,30/49] target/ppc: move xxperm/xxpermr to decodetree [v5,31/49] target/ppc: Move xxpermdi to decodetree [v5,32/49] target/ppc: Implement xxpermx instruction [v5,33/49] tcg/tcg-op-gvec.c: Introduce tcg_gen_gvec_4i [v5,34/49] target/ppc: Implement xxeval [v5,35/49] target/ppc: Implement xxgenpcv[bhwd]m instruction [v5,36/49] target/ppc: move xs[n]madd[am][ds]p/xs[n]msub[am][ds]p to decodetree [v5,37/49] target/ppc: implement xs[n]maddqp[o]/xs[n]msubqp[o] [v5,38/49] target/ppc: Implement xvtlsbb instruction [v5,39/49] target/ppc: Remove xscmpnedp instruction [v5,40/49] target/ppc: Refactor VSX_SCALAR_CMP_DP [v5,41/49] target/ppc: Implement xscmp{eq,ge,gt}qp [v5,42/49] target/ppc: Move xscmp{eq,ge,gt}dp to decodetree [v5,43/49] target/ppc: Move xs{max, min}[cj]dp to use do_helper_XX3 [v5,44/49] target/ppc: Refactor VSX_MAX_MINC helper [v5,45/49] target/ppc: Implement xs{max,min}cqp [v5,46/49] target/ppc: Implement xvcvbf16spn and xvcvspbf16 instructions [v5,47/49] target/ppc: implement plxsd/pstxsd [v5,48/49] target/ppc: implement plxssp/pstxssp [v5,49/49] target/ppc: implement lxvr[bhwd]/stxvr[bhwd]x

Message ID

20220225210936.1749575-27-matheus.ferst@eldorado.org.br (mailing list archive)

State

New, archived

Headers

From: matheus.ferst@eldorado.org.br
To: qemu-devel@nongnu.org,
	qemu-ppc@nongnu.org
Subject: [PATCH v5 26/49] target/ppc: implement vrlqnm
Date: Fri, 25 Feb 2022 18:09:13 -0300
Message-Id: <20220225210936.1749575-27-matheus.ferst@eldorado.org.br>
In-Reply-To: <20220225210936.1749575-1-matheus.ferst@eldorado.org.br>
References: <20220225210936.1749575-1-matheus.ferst@eldorado.org.br>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=187.72.171.209;
 envelope-from=matheus.ferst@eldorado.org.br; helo=outlook.eldorado.org.br
X-Spam_score_int: -4
X-Spam_score: -0.5
X-Spam_bar: /
X-Spam_report: (-0.5 / 5.0 requ) BAYES_00=-1.9, PDS_HP_HELO_NORDNS=0.659,
 RDNS_NONE=0.793, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Cc: danielhb413@gmail.com, richard.henderson@linaro.org, groug@kaod.org,
 clg@kaod.org, Matheus Ferst <matheus.ferst@eldorado.org.br>,
 david@gibson.dropbear.id.au
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: "Qemu-devel"
 <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>

Series

target/ppc: PowerISA Vector/VSX instruction batch | expand

Commit Message

Matheus K. Ferst Feb. 25, 2022, 9:09 p.m. UTC

From: Matheus Ferst <matheus.ferst@eldorado.org.br>

Signed-off-by: Matheus Ferst <matheus.ferst@eldorado.org.br>
---
 target/ppc/insn32.decode            |  1 +
 target/ppc/translate/vmx-impl.c.inc | 81 +++++++++++++++++++++++++++--
 2 files changed, 77 insertions(+), 5 deletions(-)

Comments

Richard Henderson Feb. 25, 2022, 9:42 p.m. UTC | #1

On 2/25/22 11:09, matheus.ferst@eldorado.org.br wrote:
> +    /* t = t >> 1 */
> +    tcg_gen_shli_i64(t0, th, 63);
> +    tcg_gen_shri_i64(tl, tl, 1);
> +    tcg_gen_shri_i64(th, th, 1);
> +    tcg_gen_or_i64(tl, t0, tl);

tcg_gen_extract2_i64(tl, tl, th, 1);
tcg_gen_shri_i64(th, th, 1);

> +    if (mask) {
> +        tcg_gen_shri_i64(n, vrb, 8);
> +        tcg_gen_shri_i64(vrb, vrb, 16);
> +        tcg_gen_andi_i64(n, n, 0x7f);
> +        tcg_gen_andi_i64(vrb, vrb, 0x7f);

Two tcg_gen_extract_i64.

Otherwise,
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>


r~

diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode
index c3d47a8815..87d482c5d9 100644
--- a/target/ppc/insn32.decode
+++ b/target/ppc/insn32.decode
@@ -498,6 +498,7 @@  VRLDMI          000100 ..... ..... ..... 00011000101    @VX
 
 VRLWNM          000100 ..... ..... ..... 00110000101    @VX
 VRLDNM          000100 ..... ..... ..... 00111000101    @VX
+VRLQNM          000100 ..... ..... ..... 00101000101    @VX
 
 ## Vector Integer Arithmetic Instructions
 
diff --git a/target/ppc/translate/vmx-impl.c.inc b/target/ppc/translate/vmx-impl.c.inc
index 478a62440d..eb305e84da 100644
--- a/target/ppc/translate/vmx-impl.c.inc
+++ b/target/ppc/translate/vmx-impl.c.inc
@@ -1055,28 +1055,83 @@  TRANS_FLAGS2(ISA310, VSLQ, do_vector_shift_quad, false, false);
 TRANS_FLAGS2(ISA310, VSRQ, do_vector_shift_quad, true, false);
 TRANS_FLAGS2(ISA310, VSRAQ, do_vector_shift_quad, true, true);
 
-static bool trans_VRLQ(DisasContext *ctx, arg_VX *a)
+static void do_vrlq_mask(TCGv_i64 mh, TCGv_i64 ml, TCGv_i64 b, TCGv_i64 e)
 {
-    TCGv_i64 ah, al, n, t0, t1, zero = tcg_constant_i64(0);
+    TCGv_i64 th, tl, t0, t1, zero = tcg_constant_i64(0),
+             ones = tcg_constant_i64(-1);
+
+    th = tcg_temp_new_i64();
+    tl = tcg_temp_new_i64();
+    t0 = tcg_temp_new_i64();
+    t1 = tcg_temp_new_i64();
+
+    /* m = ~0 >> b */
+    tcg_gen_andi_i64(t0, b, 64);
+    tcg_gen_movcond_i64(TCG_COND_NE, t1, t0, zero, zero, ones);
+    tcg_gen_andi_i64(t0, b, 0x3F);
+    tcg_gen_shr_i64(mh, t1, t0);
+    tcg_gen_shr_i64(ml, ones, t0);
+    tcg_gen_xori_i64(t0, t0, 63);
+    tcg_gen_shl_i64(t1, t1, t0);
+    tcg_gen_shli_i64(t1, t1, 1);
+    tcg_gen_or_i64(ml, t1, ml);
+
+    /* t = ~0 >> e */
+    tcg_gen_andi_i64(t0, e, 64);
+    tcg_gen_movcond_i64(TCG_COND_NE, t1, t0, zero, zero, ones);
+    tcg_gen_andi_i64(t0, e, 0x3F);
+    tcg_gen_shr_i64(th, t1, t0);
+    tcg_gen_shr_i64(tl, ones, t0);
+    tcg_gen_xori_i64(t0, t0, 63);
+    tcg_gen_shl_i64(t1, t1, t0);
+    tcg_gen_shli_i64(t1, t1, 1);
+    tcg_gen_or_i64(tl, t1, tl);
+
+    /* t = t >> 1 */
+    tcg_gen_shli_i64(t0, th, 63);
+    tcg_gen_shri_i64(tl, tl, 1);
+    tcg_gen_shri_i64(th, th, 1);
+    tcg_gen_or_i64(tl, t0, tl);
+
+    /* m = m ^ t */
+    tcg_gen_xor_i64(mh, mh, th);
+    tcg_gen_xor_i64(ml, ml, tl);
+
+    /* Negate the mask if begin > end */
+    tcg_gen_movcond_i64(TCG_COND_GT, t0, b, e, ones, zero);
+
+    tcg_gen_xor_i64(mh, mh, t0);
+    tcg_gen_xor_i64(ml, ml, t0);
+
+    tcg_temp_free_i64(th);
+    tcg_temp_free_i64(tl);
+    tcg_temp_free_i64(t0);
+    tcg_temp_free_i64(t1);
+}
+
+static bool do_vector_rotl_quad(DisasContext *ctx, arg_VX *a, bool mask)
+{
+    TCGv_i64 ah, al, vrb, n, t0, t1, zero = tcg_constant_i64(0);
 
     REQUIRE_VECTOR(ctx);
     REQUIRE_INSNS_FLAGS2(ctx, ISA310);
 
     ah = tcg_temp_new_i64();
     al = tcg_temp_new_i64();
+    vrb = tcg_temp_new_i64();
     n = tcg_temp_new_i64();
     t0 = tcg_temp_new_i64();
     t1 = tcg_temp_new_i64();
 
     get_avr64(ah, a->vra, true);
     get_avr64(al, a->vra, false);
-    get_avr64(n, a->vrb, true);
+    get_avr64(vrb, a->vrb, true);
 
     tcg_gen_mov_i64(t0, ah);
-    tcg_gen_andi_i64(t1, n, 64);
+    tcg_gen_andi_i64(t1, vrb, 64);
     tcg_gen_movcond_i64(TCG_COND_NE, ah, t1, zero, al, ah);
     tcg_gen_movcond_i64(TCG_COND_NE, al, t1, zero, t0, al);
-    tcg_gen_andi_i64(n, n, 0x3F);
+    tcg_gen_andi_i64(n, vrb, 0x3F);
 
     tcg_gen_shl_i64(t0, ah, n);
     tcg_gen_shl_i64(t1, al, n);
@@ -1091,11 +1146,24 @@  static bool trans_VRLQ(DisasContext *ctx, arg_VX *a)
     tcg_gen_shri_i64(ah, ah, 1);
     tcg_gen_or_i64(t1, ah, t1);
 
+    if (mask) {
+        tcg_gen_shri_i64(n, vrb, 8);
+        tcg_gen_shri_i64(vrb, vrb, 16);
+        tcg_gen_andi_i64(n, n, 0x7f);
+        tcg_gen_andi_i64(vrb, vrb, 0x7f);
+
+        do_vrlq_mask(ah, al, vrb, n);
+
+        tcg_gen_and_i64(t0, t0, ah);
+        tcg_gen_and_i64(t1, t1, al);
+    }
+
     set_avr64(a->vrt, t0, true);
     set_avr64(a->vrt, t1, false);
 
     tcg_temp_free_i64(ah);
     tcg_temp_free_i64(al);
+    tcg_temp_free_i64(vrb);
     tcg_temp_free_i64(n);
     tcg_temp_free_i64(t0);
     tcg_temp_free_i64(t1);
@@ -1103,6 +1171,9 @@  static bool trans_VRLQ(DisasContext *ctx, arg_VX *a)
     return true;
 }
 
+TRANS(VRLQ, do_vector_rotl_quad, false)
+TRANS(VRLQNM, do_vector_rotl_quad, true)
+
 #define GEN_VXFORM_SAT(NAME, VECE, NORM, SAT, OPC2, OPC3)               \
 static void glue(glue(gen_, NAME), _vec)(unsigned vece, TCGv_vec t,     \
                                          TCGv_vec sat, TCGv_vec a,      \

[v5,26/49] target/ppc: implement vrlqnm

Commit Message

Comments

Patch