[v4,20/47] target/ppc: implement vslq

Message ID	20220222143646.1268606-21-matheus.ferst@eldorado.org.br (mailing list archive)
State	New, archived
Headers	show Return-Path: <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org> From: matheus.ferst@eldorado.org.br To: qemu-devel@nongnu.org, qemu-ppc@nongnu.org Subject: [PATCH v4 20/47] target/ppc: implement vslq Date: Tue, 22 Feb 2022 11:36:18 -0300 Message-Id: <20220222143646.1268606-21-matheus.ferst@eldorado.org.br> In-Reply-To: <20220222143646.1268606-1-matheus.ferst@eldorado.org.br> References: <20220222143646.1268606-1-matheus.ferst@eldorado.org.br> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=187.72.171.209; envelope-from=matheus.ferst@eldorado.org.br; helo=outlook.eldorado.org.br X-Spam_score_int: -4 X-Spam_score: -0.5 X-Spam_bar: / X-Spam_report: (-0.5 / 5.0 requ) BAYES_00=-1.9, PDS_HP_HELO_NORDNS=0.659, RDNS_NONE=0.793, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action Precedence: list Cc: danielhb413@gmail.com, richard.henderson@linaro.org, groug@kaod.org, clg@kaod.org, Matheus Ferst <matheus.ferst@eldorado.org.br>, david@gibson.dropbear.id.au Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>
Series	target/ppc: PowerISA Vector/VSX instruction batch \| expand [v4,00/47] target/ppc: PowerISA Vector/VSX instruction batch [v4,01/47] target/ppc: Introduce TRANSFLAGS macros [v4,02/47] target/ppc: moved vector even and odd multiplication to decodetree [v4,03/47] target/ppc: Moved vector multiply high and low to decodetree [v4,04/47] target/ppc: vmulh instructions without helpers [v4,05/47] target/ppc: Implement vmsumcud instruction [v4,06/47] target/ppc: Implement vmsumudm instruction [v4,07/47] target/ppc: Move vexts[bhw]2[wd] to decodetree [v4,08/47] target/ppc: Implement vextsd2q [v4,09/47] target/ppc: Move Vector Compare Equal/Not Equal/Greater Than to decodetree [v4,10/47] target/ppc: Move Vector Compare Not Equal or Zero to decodetree [v4,11/47] target/ppc: Implement Vector Compare Equal Quadword [v4,12/47] target/ppc: Implement Vector Compare Greater Than Quadword [v4,13/47] target/ppc: Implement Vector Compare Quadword [v4,14/47] target/ppc: implement vstri[bh][lr] [v4,15/47] target/ppc: implement vclrlb [v4,16/47] target/ppc: implement vclrrb [v4,17/47] target/ppc: implement vcntmb[bhwd] [v4,18/47] target/ppc: implement vgnb [v4,19/47] target/ppc: move vs[lr][a][bhwd] to decodetree [v4,20/47] target/ppc: implement vslq [v4,21/47] target/ppc: implement vsrq [v4,22/47] target/ppc: implement vsraq [v4,23/47] target/ppc: move vrl[bhwd] to decodetree [v4,24/47] target/ppc: move vrl[bhwd]nm/vrl[bhwd]mi to decodetree [v4,25/47] target/ppc: implement vrlq [v4,26/47] target/ppc: Move vsel and vperm/vpermr to decodetree [v4,27/47] target/ppc: Move xxsel to decodetree [v4,28/47] target/ppc: move xxperm/xxpermr to decodetree [v4,29/47] target/ppc: Move xxpermdi to decodetree [v4,30/47] target/ppc: Implement xxpermx instruction [v4,31/47] tcg/tcg-op-gvec.c: Introduce tcg_gen_gvec_4i [v4,32/47] target/ppc: Implement xxeval [v4,33/47] target/ppc: Implement xxgenpcv[bhwd]m instruction [v4,34/47] target/ppc: move xs[n]madd[am][ds]p/xs[n]msub[am][ds]p to decodetree [v4,35/47] target/ppc: implement xs[n]maddqp[o]/xs[n]msubqp[o] [v4,36/47] target/ppc: Implement xvtlsbb instruction [v4,37/47] target/ppc: Remove xscmpnedp instruction [v4,38/47] target/ppc: Refactor VSX_SCALAR_CMP_DP [v4,39/47] target/ppc: Implement xscmp{eq,ge,gt}qp [v4,40/47] target/ppc: Move xscmp{eq,ge,gt}dp to decodetree [v4,41/47] target/ppc: Move xs{max, min}[cj]dp to use do_helper_XX3 [v4,42/47] target/ppc: Refactor VSX_MAX_MINC helper [v4,43/47] target/ppc: Implement xs{max,min}cqp [v4,44/47] target/ppc: Implement xvcvbf16spn and xvcvspbf16 instructions [v4,45/47] target/ppc: implement plxsd/pstxsd [v4,46/47] target/ppc: implement plxssp/pstxssp [v4,47/47] target/ppc: implement lxvr[bhwd]/stxvr[bhwd]x

Message ID

20220222143646.1268606-21-matheus.ferst@eldorado.org.br (mailing list archive)

State

New, archived

Headers

From: matheus.ferst@eldorado.org.br
To: qemu-devel@nongnu.org,
	qemu-ppc@nongnu.org
Subject: [PATCH v4 20/47] target/ppc: implement vslq
Date: Tue, 22 Feb 2022 11:36:18 -0300
Message-Id: <20220222143646.1268606-21-matheus.ferst@eldorado.org.br>
In-Reply-To: <20220222143646.1268606-1-matheus.ferst@eldorado.org.br>
References: <20220222143646.1268606-1-matheus.ferst@eldorado.org.br>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=187.72.171.209;
 envelope-from=matheus.ferst@eldorado.org.br; helo=outlook.eldorado.org.br
X-Spam_score_int: -4
X-Spam_score: -0.5
X-Spam_bar: /
X-Spam_report: (-0.5 / 5.0 requ) BAYES_00=-1.9, PDS_HP_HELO_NORDNS=0.659,
 RDNS_NONE=0.793, SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Cc: danielhb413@gmail.com, richard.henderson@linaro.org, groug@kaod.org,
 clg@kaod.org, Matheus Ferst <matheus.ferst@eldorado.org.br>,
 david@gibson.dropbear.id.au
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: "Qemu-devel"
 <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>

Series

target/ppc: PowerISA Vector/VSX instruction batch | expand

Commit Message

Matheus K. Ferst Feb. 22, 2022, 2:36 p.m. UTC

From: Matheus Ferst <matheus.ferst@eldorado.org.br>

Signed-off-by: Matheus Ferst <matheus.ferst@eldorado.org.br>
---
v4:
 -  New in v4.
---
 target/ppc/insn32.decode            |  1 +
 target/ppc/translate/vmx-impl.c.inc | 40 +++++++++++++++++++++++++++++
 2 files changed, 41 insertions(+)

Comments

Richard Henderson Feb. 22, 2022, 10:14 p.m. UTC | #1

On 2/22/22 04:36, matheus.ferst@eldorado.org.br wrote:
> From: Matheus Ferst <matheus.ferst@eldorado.org.br>
> 
> Signed-off-by: Matheus Ferst <matheus.ferst@eldorado.org.br>
> ---
> v4:
>   -  New in v4.
> ---
>   target/ppc/insn32.decode            |  1 +
>   target/ppc/translate/vmx-impl.c.inc | 40 +++++++++++++++++++++++++++++
>   2 files changed, 41 insertions(+)
> 
> diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode
> index 88baebe35e..3799065508 100644
> --- a/target/ppc/insn32.decode
> +++ b/target/ppc/insn32.decode
> @@ -473,6 +473,7 @@ VSLB            000100 ..... ..... ..... 00100000100    @VX
>   VSLH            000100 ..... ..... ..... 00101000100    @VX
>   VSLW            000100 ..... ..... ..... 00110000100    @VX
>   VSLD            000100 ..... ..... ..... 10111000100    @VX
> +VSLQ            000100 ..... ..... ..... 00100000101    @VX
>   
>   VSRB            000100 ..... ..... ..... 01000000100    @VX
>   VSRH            000100 ..... ..... ..... 01001000100    @VX
> diff --git a/target/ppc/translate/vmx-impl.c.inc b/target/ppc/translate/vmx-impl.c.inc
> index ec4f0e7654..ca98a545ef 100644
> --- a/target/ppc/translate/vmx-impl.c.inc
> +++ b/target/ppc/translate/vmx-impl.c.inc
> @@ -834,6 +834,46 @@ TRANS_FLAGS(ALTIVEC, VSRAH, do_vector_gvec3_VX, MO_16, tcg_gen_gvec_sarv);
>   TRANS_FLAGS(ALTIVEC, VSRAW, do_vector_gvec3_VX, MO_32, tcg_gen_gvec_sarv);
>   TRANS_FLAGS2(ALTIVEC_207, VSRAD, do_vector_gvec3_VX, MO_64, tcg_gen_gvec_sarv);
>   
> +static bool trans_VSLQ(DisasContext *ctx, arg_VX *a)
> +{
> +    TCGv_i64 hi, lo, tmp, n, sf = tcg_constant_i64(64);
> +
> +    REQUIRE_INSNS_FLAGS2(ctx, ISA310);
> +    REQUIRE_VECTOR(ctx);
> +
> +    n = tcg_temp_new_i64();
> +    hi = tcg_temp_new_i64();
> +    lo = tcg_temp_new_i64();
> +    tmp = tcg_const_i64(0);
> +
> +    get_avr64(lo, a->vra, false);
> +    get_avr64(hi, a->vra, true);
> +
> +    get_avr64(n, a->vrb, true);
> +    tcg_gen_andi_i64(n, n, 0x7F);
> +
> +    tcg_gen_movcond_i64(TCG_COND_GE, hi, n, sf, lo, hi);
> +    tcg_gen_movcond_i64(TCG_COND_GE, lo, n, sf, tmp, lo);

Since you have to mask twice anyway, better use (n & 64) != 0.

Otherwise,
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>


r~

Matheus K. Ferst Feb. 23, 2022, 9:53 p.m. UTC | #2

On 22/02/2022 19:14, Richard Henderson wrote:
> On 2/22/22 04:36, matheus.ferst@eldorado.org.br wrote:
>> From: Matheus Ferst <matheus.ferst@eldorado.org.br>
>>
>> Signed-off-by: Matheus Ferst <matheus.ferst@eldorado.org.br>
>> ---
>> v4:
>>   -  New in v4.
>> ---
>>   target/ppc/insn32.decode            |  1 +
>>   target/ppc/translate/vmx-impl.c.inc | 40 +++++++++++++++++++++++++++++
>>   2 files changed, 41 insertions(+)
>>
>> diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode
>> index 88baebe35e..3799065508 100644
>> --- a/target/ppc/insn32.decode
>> +++ b/target/ppc/insn32.decode
>> @@ -473,6 +473,7 @@ VSLB            000100 ..... ..... ..... 
>> 00100000100    @VX
>>   VSLH            000100 ..... ..... ..... 00101000100    @VX
>>   VSLW            000100 ..... ..... ..... 00110000100    @VX
>>   VSLD            000100 ..... ..... ..... 10111000100    @VX
>> +VSLQ            000100 ..... ..... ..... 00100000101    @VX
>>
>>   VSRB            000100 ..... ..... ..... 01000000100    @VX
>>   VSRH            000100 ..... ..... ..... 01001000100    @VX
>> diff --git a/target/ppc/translate/vmx-impl.c.inc 
>> b/target/ppc/translate/vmx-impl.c.inc
>> index ec4f0e7654..ca98a545ef 100644
>> --- a/target/ppc/translate/vmx-impl.c.inc
>> +++ b/target/ppc/translate/vmx-impl.c.inc
>> @@ -834,6 +834,46 @@ TRANS_FLAGS(ALTIVEC, VSRAH, do_vector_gvec3_VX, 
>> MO_16, tcg_gen_gvec_sarv);
>>   TRANS_FLAGS(ALTIVEC, VSRAW, do_vector_gvec3_VX, MO_32, 
>> tcg_gen_gvec_sarv);
>>   TRANS_FLAGS2(ALTIVEC_207, VSRAD, do_vector_gvec3_VX, MO_64, 
>> tcg_gen_gvec_sarv);
>>
>> +static bool trans_VSLQ(DisasContext *ctx, arg_VX *a)
>> +{
>> +    TCGv_i64 hi, lo, tmp, n, sf = tcg_constant_i64(64);
>> +
>> +    REQUIRE_INSNS_FLAGS2(ctx, ISA310);
>> +    REQUIRE_VECTOR(ctx);
>> +
>> +    n = tcg_temp_new_i64();
>> +    hi = tcg_temp_new_i64();
>> +    lo = tcg_temp_new_i64();
>> +    tmp = tcg_const_i64(0);
>> +
>> +    get_avr64(lo, a->vra, false);
>> +    get_avr64(hi, a->vra, true);
>> +
>> +    get_avr64(n, a->vrb, true);
>> +    tcg_gen_andi_i64(n, n, 0x7F);
>> +
>> +    tcg_gen_movcond_i64(TCG_COND_GE, hi, n, sf, lo, hi);
>> +    tcg_gen_movcond_i64(TCG_COND_GE, lo, n, sf, tmp, lo);
> 
> Since you have to mask twice anyway, better use (n & 64) != 0.
> 

Hmm, I'm not sure if I understood. To check != 0 we'll need a temp to 
hold n&64. We could use tmp here, but we'll need another one in patch 
22. Is that right?

Thanks,
Matheus K. Ferst
Instituto de Pesquisas ELDORADO <http://www.eldorado.org.br/>
Analista de Software
Aviso Legal - Disclaimer <https://www.eldorado.org.br/disclaimer.html>

Richard Henderson Feb. 23, 2022, 10:12 p.m. UTC | #3

On 2/23/22 11:53, Matheus K. Ferst wrote:
> On 22/02/2022 19:14, Richard Henderson wrote:
>> On 2/22/22 04:36, matheus.ferst@eldorado.org.br wrote:
>>> From: Matheus Ferst <matheus.ferst@eldorado.org.br>
>>>
>>> Signed-off-by: Matheus Ferst <matheus.ferst@eldorado.org.br>
>>> ---
>>> v4:
>>>   -  New in v4.
>>> ---
>>>   target/ppc/insn32.decode            |  1 +
>>>   target/ppc/translate/vmx-impl.c.inc | 40 +++++++++++++++++++++++++++++
>>>   2 files changed, 41 insertions(+)
>>>
>>> diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode
>>> index 88baebe35e..3799065508 100644
>>> --- a/target/ppc/insn32.decode
>>> +++ b/target/ppc/insn32.decode
>>> @@ -473,6 +473,7 @@ VSLB            000100 ..... ..... ..... 00100000100    @VX
>>>   VSLH            000100 ..... ..... ..... 00101000100    @VX
>>>   VSLW            000100 ..... ..... ..... 00110000100    @VX
>>>   VSLD            000100 ..... ..... ..... 10111000100    @VX
>>> +VSLQ            000100 ..... ..... ..... 00100000101    @VX
>>>
>>>   VSRB            000100 ..... ..... ..... 01000000100    @VX
>>>   VSRH            000100 ..... ..... ..... 01001000100    @VX
>>> diff --git a/target/ppc/translate/vmx-impl.c.inc b/target/ppc/translate/vmx-impl.c.inc
>>> index ec4f0e7654..ca98a545ef 100644
>>> --- a/target/ppc/translate/vmx-impl.c.inc
>>> +++ b/target/ppc/translate/vmx-impl.c.inc
>>> @@ -834,6 +834,46 @@ TRANS_FLAGS(ALTIVEC, VSRAH, do_vector_gvec3_VX, MO_16, 
>>> tcg_gen_gvec_sarv);
>>>   TRANS_FLAGS(ALTIVEC, VSRAW, do_vector_gvec3_VX, MO_32, tcg_gen_gvec_sarv);
>>>   TRANS_FLAGS2(ALTIVEC_207, VSRAD, do_vector_gvec3_VX, MO_64, tcg_gen_gvec_sarv);
>>>
>>> +static bool trans_VSLQ(DisasContext *ctx, arg_VX *a)
>>> +{
>>> +    TCGv_i64 hi, lo, tmp, n, sf = tcg_constant_i64(64);
>>> +
>>> +    REQUIRE_INSNS_FLAGS2(ctx, ISA310);
>>> +    REQUIRE_VECTOR(ctx);
>>> +
>>> +    n = tcg_temp_new_i64();
>>> +    hi = tcg_temp_new_i64();
>>> +    lo = tcg_temp_new_i64();
>>> +    tmp = tcg_const_i64(0);
>>> +
>>> +    get_avr64(lo, a->vra, false);
>>> +    get_avr64(hi, a->vra, true);
>>> +
>>> +    get_avr64(n, a->vrb, true);
>>> +    tcg_gen_andi_i64(n, n, 0x7F);
>>> +
>>> +    tcg_gen_movcond_i64(TCG_COND_GE, hi, n, sf, lo, hi);
>>> +    tcg_gen_movcond_i64(TCG_COND_GE, lo, n, sf, tmp, lo);
>>
>> Since you have to mask twice anyway, better use (n & 64) != 0.
>>
> 
> Hmm, I'm not sure if I understood. To check != 0 we'll need a temp to hold n&64. We could 
> use tmp here, but we'll need another one in patch 22. Is that right?

Yes.

r~

diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode
index 88baebe35e..3799065508 100644
--- a/target/ppc/insn32.decode
+++ b/target/ppc/insn32.decode
@@ -473,6 +473,7 @@  VSLB            000100 ..... ..... ..... 00100000100    @VX
 VSLH            000100 ..... ..... ..... 00101000100    @VX
 VSLW            000100 ..... ..... ..... 00110000100    @VX
 VSLD            000100 ..... ..... ..... 10111000100    @VX
+VSLQ            000100 ..... ..... ..... 00100000101    @VX
 
 VSRB            000100 ..... ..... ..... 01000000100    @VX
 VSRH            000100 ..... ..... ..... 01001000100    @VX
diff --git a/target/ppc/translate/vmx-impl.c.inc b/target/ppc/translate/vmx-impl.c.inc
index ec4f0e7654..ca98a545ef 100644
--- a/target/ppc/translate/vmx-impl.c.inc
+++ b/target/ppc/translate/vmx-impl.c.inc
@@ -834,6 +834,46 @@  TRANS_FLAGS(ALTIVEC, VSRAH, do_vector_gvec3_VX, MO_16, tcg_gen_gvec_sarv);
 TRANS_FLAGS(ALTIVEC, VSRAW, do_vector_gvec3_VX, MO_32, tcg_gen_gvec_sarv);
 TRANS_FLAGS2(ALTIVEC_207, VSRAD, do_vector_gvec3_VX, MO_64, tcg_gen_gvec_sarv);
 
+static bool trans_VSLQ(DisasContext *ctx, arg_VX *a)
+{
+    TCGv_i64 hi, lo, tmp, n, sf = tcg_constant_i64(64);
+
+    REQUIRE_INSNS_FLAGS2(ctx, ISA310);
+    REQUIRE_VECTOR(ctx);
+
+    n = tcg_temp_new_i64();
+    hi = tcg_temp_new_i64();
+    lo = tcg_temp_new_i64();
+    tmp = tcg_const_i64(0);
+
+    get_avr64(lo, a->vra, false);
+    get_avr64(hi, a->vra, true);
+
+    get_avr64(n, a->vrb, true);
+    tcg_gen_andi_i64(n, n, 0x7F);
+
+    tcg_gen_movcond_i64(TCG_COND_GE, hi, n, sf, lo, hi);
+    tcg_gen_movcond_i64(TCG_COND_GE, lo, n, sf, tmp, lo);
+    tcg_gen_andi_i64(n, n, ~64ULL);
+
+    tcg_gen_shl_i64(tmp, lo, n);
+    set_avr64(a->vrt, tmp, false);
+
+    tcg_gen_shl_i64(hi, hi, n);
+    tcg_gen_xori_i64(n, n, 63);
+    tcg_gen_shr_i64(lo, lo, n);
+    tcg_gen_shri_i64(lo, lo, 1);
+    tcg_gen_or_i64(hi, hi, lo);
+    set_avr64(a->vrt, hi, true);
+
+    tcg_temp_free_i64(hi);
+    tcg_temp_free_i64(lo);
+    tcg_temp_free_i64(tmp);
+    tcg_temp_free_i64(n);
+
+    return true;
+}
+
 #define GEN_VXFORM_SAT(NAME, VECE, NORM, SAT, OPC2, OPC3)               \
 static void glue(glue(gen_, NAME), _vec)(unsigned vece, TCGv_vec t,     \
                                          TCGv_vec sat, TCGv_vec a,      \

[v4,20/47] target/ppc: implement vslq

Commit Message

Comments

Patch