From patchwork Tue Aug 23 13:48:25 2016 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Pranith Kumar X-Patchwork-Id: 9295671 Return-Path: Received: from mail.wl.linuxfoundation.org (pdx-wl-mail.web.codeaurora.org [172.30.200.125]) by pdx-korg-patchwork.web.codeaurora.org (Postfix) with ESMTP id 65F07607F0 for ; Tue, 23 Aug 2016 13:49:25 +0000 (UTC) Received: from mail.wl.linuxfoundation.org (localhost [127.0.0.1]) by mail.wl.linuxfoundation.org (Postfix) with ESMTP id 570DF28790 for ; Tue, 23 Aug 2016 13:49:25 +0000 (UTC) Received: by mail.wl.linuxfoundation.org (Postfix, from userid 486) id 49B092884D; Tue, 23 Aug 2016 13:49:25 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on pdx-wl-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.8 required=2.0 tests=BAYES_00, DKIM_ADSP_CUSTOM_MED, DKIM_SIGNED, FREEMAIL_FROM, RCVD_IN_DNSWL_HI, T_DKIM_INVALID autolearn=ham version=3.3.1 Received: from lists.gnu.org (lists.gnu.org [208.118.235.17]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by mail.wl.linuxfoundation.org (Postfix) with ESMTPS id C228F28790 for ; Tue, 23 Aug 2016 13:49:24 +0000 (UTC) Received: from localhost ([::1]:46682 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bcC4x-0005Oa-AS for patchwork-qemu-devel@patchwork.kernel.org; Tue, 23 Aug 2016 09:49:23 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:60426) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bcC4c-0005Jk-RE for qemu-devel@nongnu.org; Tue, 23 Aug 2016 09:49:03 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bcC4W-0002ja-R7 for qemu-devel@nongnu.org; Tue, 23 Aug 2016 09:49:02 -0400 Received: from mail-lf0-x244.google.com ([2a00:1450:4010:c07::244]:36164) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bcC4V-0002jS-VO for qemu-devel@nongnu.org; Tue, 23 Aug 2016 09:48:56 -0400 Received: by mail-lf0-x244.google.com with SMTP id 33so7819654lfw.3 for ; Tue, 23 Aug 2016 06:48:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:to:cc:subject:date:message-id; bh=/58X8CulXvIKxIbkg/1LyogTO0A2bSCQowzuA3Q7h9M=; b=TiaKw9gQdW9NYEH3pWY4lOqd+XKqk5vdnFkdY2XNaL86eKsThIWpVlXMjlUeinkE9J B5ucSzJrVApia2L4tz7U7kfXShTAR53fmFhdrYSRlrmID1imMu0QwrwturLtpwdgMeei aB36/yLlVgUS28ZCy53Z2rLJPpHt+d4trrvokX3ZwIfp7d0kapXrOsrxT4shM+JDA84+ w2UnJwswRSrPD7D9BoILBeokTaa/dO/CMRoRVhIMMz1RBklfB2tEAnCep8ulUVmNlZgH CtAtnKiWhV4DuSzHxzC9XzescL/yC3WFYIiqKbV2/pdsKa+S6gC9cReJrucDQxgDmn8p bo5w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:date:message-id; bh=/58X8CulXvIKxIbkg/1LyogTO0A2bSCQowzuA3Q7h9M=; b=A6QroV9sfp8htBWgux2g1jLDdPsckM2o0SCTC6knWLWFE/e7zE8ldV54QkkuYZTuwm bfTLhDuosh4jOq3xhXvjgoZQ7CcT+wMe9ZSWmVuhYwgXAmS8FDAmVSl670sUzBBKdVDe twwFDMiAr4+yHWq7t2fNbXS/q97iAUlLoC1Hd7renIAucIb44vPX6lJg2oiurV53aEYC iExR2HpgRtL5fEVbOstAZyvPYDFPJ/UQ9b6A6G7vSBK0ReOG6vcnC34Dk7Qeh0pnVJ16 1fM8XKC4/8BWLWKOt/9wca+KJeMLKr7fxMyYTktWQmKUsGZ0W+3W7QuDupvhVNXpoQpf EfNA== X-Gm-Message-State: AEkooutXF/fnUnQ2wVIqP54rHIslP5hHkFxW6+BWautcLG1AUFzp6RulzJ7rInksMONheA== X-Received: by 10.25.16.212 with SMTP id 81mr6297928lfq.174.1471960134497; Tue, 23 Aug 2016 06:48:54 -0700 (PDT) Received: from evgadesktop.attlocal.net (se2x.mullvad.net. [185.65.132.111]) by smtp.gmail.com with ESMTPSA id r190sm844372lfg.49.2016.08.23.06.48.51 (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Tue, 23 Aug 2016 06:48:53 -0700 (PDT) From: Pranith Kumar To: qemu-devel@nongnu.org, Richard Henderson Date: Tue, 23 Aug 2016 09:48:25 -0400 Message-Id: <20160823134825.32578-1-bobby.prani@gmail.com> X-Mailer: git-send-email 2.9.3 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2a00:1450:4010:c07::244 Subject: [Qemu-devel] [PATCH v3] tcg: Optimize fence instructions X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: pbonzini@redhat.com, alex.bennee@linaro.org Errors-To: qemu-devel-bounces+patchwork-qemu-devel=patchwork.kernel.org@nongnu.org Sender: "Qemu-devel" X-Virus-Scanned: ClamAV using ClamSMTP This commit optimizes fence instructions. Two optimizations are currently implemented. These are: 1. Unnecessary duplicate fence instructions If the same fence instruction is detected consecutively, we remove one instance of it. ex: mb; mb => mb, strl; strl => strl 2. Merging weaker fence with subsequent/previous stronger fence load-acquire/store-release fence can be combined with a full fence without relaxing the ordering constraint. ex: a) ld; ldaq; mb => ld; mb b) mb; strl; st => mb; st Signed-off-by: Pranith Kumar --- tcg/optimize.c | 74 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ tcg/tcg.c | 4 ++++ tcg/tcg.h | 1 + 3 files changed, 79 insertions(+) v3: - Update with rth's feedback v2: - skip non-memory instructions while merging fences diff --git a/tcg/optimize.c b/tcg/optimize.c index cffe89b..827f2d0 100644 --- a/tcg/optimize.c +++ b/tcg/optimize.c @@ -538,6 +538,80 @@ static bool swap_commutative2(TCGArg *p1, TCGArg *p2) return false; } +/* Eliminate duplicate and unnecessary fence instructions */ +void tcg_optimize_mb(TCGContext *s) +{ + int oi, oi_next; + TCGArg prev_op_mb = -1; + TCGOp *prev_op = NULL; + + for (oi = s->gen_op_buf[0].next; oi != 0; oi = oi_next) { + TCGOp *op = &s->gen_op_buf[oi]; + TCGArg *args = &s->gen_opparam_buf[op->args]; + TCGOpcode opc = op->opc; + + switch (opc) { + case INDEX_op_mb: + { + if (prev_op_mb == -1) { + goto done; + } + + TCGBar curr_mb_type = args[0] & 0xF0; + TCGBar prev_mb_type = prev_op_mb & 0xF0; + + if (curr_mb_type == prev_mb_type || + (curr_mb_type == TCG_BAR_STRL && prev_mb_type == TCG_BAR_SC)) { + /* Remove the current weaker barrier op. The previous + * barrier is stronger and sufficient. + * mb; strl => mb; st + */ + tcg_op_remove(s, op); + op = prev_op; + args[0] = prev_op_mb; + break; + } else if (curr_mb_type == TCG_BAR_SC && + prev_mb_type == TCG_BAR_LDAQ) { + /* Remove the previous weaker barrier op. The current + * barrier is stronger and sufficient. + * ldaq; mb => ld; mb + */ + tcg_op_remove(s, prev_op); + } else if (curr_mb_type == TCG_BAR_STRL && + prev_mb_type == TCG_BAR_LDAQ) { + /* Consecutive load-acquire and store-release barriers + * can be merged into one stronger SC barrier + * ldaq; strl => ld; mb; st + */ + args[0] = TCG_BAR_SC | TCG_MO_ALL; + tcg_op_remove(s, prev_op); + } + + done: + prev_op_mb = args[0]; + prev_op = op; + break; + } + case INDEX_op_qemu_ld_i32: + case INDEX_op_qemu_ld_i64: + case INDEX_op_qemu_st_i32: + case INDEX_op_qemu_st_i64: + case INDEX_op_call: + prev_op_mb = -1; + prev_op = NULL; + break; + default: + if (tcg_op_defs[opc].flags & TCG_OPF_BB_END) { + prev_op_mb = -1; + prev_op = NULL; + } + break; + } + + oi_next = op->next; + } +} + /* Propagate constants and copies, fold constant expressions. */ void tcg_optimize(TCGContext *s) { diff --git a/tcg/tcg.c b/tcg/tcg.c index 42417bd..1db319e 100644 --- a/tcg/tcg.c +++ b/tcg/tcg.c @@ -2587,6 +2587,10 @@ int tcg_gen_code(TCGContext *s, TranslationBlock *tb) } } +#ifdef USE_TCG_OPTIMIZATIONS + tcg_optimize_mb(s); +#endif + #ifdef CONFIG_PROFILER s->la_time += profile_getclock(); #endif diff --git a/tcg/tcg.h b/tcg/tcg.h index 9ed78dc..79bb5bb 100644 --- a/tcg/tcg.h +++ b/tcg/tcg.h @@ -921,6 +921,7 @@ void tcg_op_remove(TCGContext *s, TCGOp *op); TCGOp *tcg_op_insert_before(TCGContext *s, TCGOp *op, TCGOpcode opc, int narg); TCGOp *tcg_op_insert_after(TCGContext *s, TCGOp *op, TCGOpcode opc, int narg); +void tcg_optimize_mb(TCGContext *s); void tcg_optimize(TCGContext *s); /* only used for debugging purposes */