[RFC,bpf-next,2/2] proof for the safe usage of tnum_in()

Message ID	20220831031907.16133-3-shung-hsi.yu@suse.com (mailing list archive)
State	RFC
Delegated to:	BPF
Headers	show Return-Path: <bpf-owner@kernel.org> From: Shung-Hsi Yu <shung-hsi.yu@suse.com> To: bpf@vger.kernel.org, linux-kernel@vger.kernel.org Cc: "Alexei Starovoitov" <ast@kernel.org>, "Daniel Borkmann" <daniel@iogearbox.net>, "John Fastabend" <john.fastabend@gmail.com>, Shung-Hsi Yu <shung-hsi.yu@suse.com> Subject: [RFC bpf-next 2/2] proof for the safe usage of tnum_in() Date: Wed, 31 Aug 2022 11:19:07 +0800 Message-Id: <20220831031907.16133-3-shung-hsi.yu@suse.com> In-Reply-To: <20220831031907.16133-1-shung-hsi.yu@suse.com> References: <20220831031907.16133-1-shung-hsi.yu@suse.com> Content-Transfer-Encoding: 8bit Content-Type: text/plain MIME-Version: 1.0 Precedence: bulk
Series	bpf: tnums: warn against the usage of tnum_in(tnum_range(), ...) \| expand [RFC,bpf-next,0/2] bpf: tnums: warn against the usage of tnum_in(tnum_range(), ...) [RFC,bpf-next,1/2] bpf: tnums: warn against the usage of tnum_in(tnum_range(), ...) [RFC,bpf-next,2/2] proof for the safe usage of tnum_in()

Message ID

20220831031907.16133-3-shung-hsi.yu@suse.com (mailing list archive)

State

RFC

Delegated to:

BPF

Headers

From: Shung-Hsi Yu <shung-hsi.yu@suse.com>
To: bpf@vger.kernel.org, linux-kernel@vger.kernel.org
Cc: "Alexei Starovoitov" <ast@kernel.org>,
        "Daniel Borkmann" <daniel@iogearbox.net>,
        "John Fastabend" <john.fastabend@gmail.com>,
        Shung-Hsi Yu <shung-hsi.yu@suse.com>
Subject: [RFC bpf-next 2/2] proof for the safe usage of tnum_in()
Date: Wed, 31 Aug 2022 11:19:07 +0800
Message-Id: <20220831031907.16133-3-shung-hsi.yu@suse.com>
In-Reply-To: <20220831031907.16133-1-shung-hsi.yu@suse.com>
References: <20220831031907.16133-1-shung-hsi.yu@suse.com>
Content-Transfer-Encoding: 8bit
Content-Type: text/plain
MIME-Version: 1.0
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: 
 kne0H212/9lpABBMtrVvbUb1WEwKyRthdI4PdperBVh/pLAtexcUwbiscAGYw9E7viGdVAXPvJMH2a04OTX9jWnIzHxV5i49/vd3ZJitsZm2BBP9vu5K9GtgGG34UR+Sd51cE3l5aJG9iHWEoxutsph1CpHIkVQ6NhPAXYmWavvMe7dKza8ncNQDkViYiSK03XGPyyj8TIMfBxxLFIpjUWJsxv1TfJA82QEv87i3drXBqPiurEIas7cVa3g+wdgmrxI4U3Hp/8jTTrFDougN1dDqKVSNhHBxZITnHdiR+5lKJAqC81PWrmmDfZozXojzDXmrdEzhH3p+ik0qXyyVxZ7XUNkRyZ4oBdIHcVFoRuNxD7HUL1f8Gtrk74odjLo3/UE3IpZE1lDS7OSjfhrY2V35iL4fSv3Q4fCxmiV0SxUH2S7VyT/Vr/wVuHR+N5xNLBx9o1PbaQjJdBIVsVVd2OQ6LaM6AujGZEQhZE5uxUp/yd52SZtaqCSxjHXZLOrlZUomx64uynNg0qwZJYTxXx13OOVuZOGB+nQisvZOptfJg8pzL9LWR03sJFfuYBUl8cGBw7/V6zAbSE46Yq3oYONwHBD0MH9tsPcG0oHVsr00fImN8xl998V0sJ6qH9SZgyPph6oHCXRl7Afwtj5pbxGGSyBv8G1IvWrTSyLq6OQqA6ArbUmglLkoU2RYG+ukfW3TuDzJvp26RZdyo4leUQf2oonPnTLe76phS43N6UsY46oqqCTzFZiG4hiHlW61Hx7hcNqCVk7T1CTWCJGlWFNUmRxWszbwdDYfA7EuVrxE9Pzyii04t0kHpIly5iv0zFUO2q/NEHNgfFM3bhjzkbfWnUdkC4fNJIFTbtKvuC3bio9hrJdLdDj9SWaWqtk8ErJXDEVBboyn4hP7gdFeSXVJxFW90W1k7Sy3tR5RK/ln1JOmZ4ShurLaxQEt2NWCWrvih/VhpPiaTKutfekSObuiKlo14k1FRMLGHLuMJhonbZEjozqkRq1hHyQ9yT0CObM3hxHMKSTOJBw0IhWvIeK+HJbmL7ZcU++ovz0ctFTII/7nRRPoJVtepLBgORc4GQR7NSVOlN9b7XWsv8V/5KxiPYz7ffe6Qts5D2DxuxM2tnqfLZSYAHoIwKsQVJREimrn6LqqUPIJz0qLOeF5X0Bpne3QMi24NdfAQps1UvFnwr8ptVAEHUEMIxv8OAsaidmxvAR0SlcOWS3kl6nrCMeKdMMZyiBsW7Kk/PIWrssnbRQbDGVI83i1pBDWLof9k2sNK1A/QWuIkQ7be56cW6obP95X6g58kldPBBSNMjRaiKclzWBZEAZeoJJtxqoOa/jLxMHv5JlwpUjRxIyX67gK8d0WAyANSmK01Lwb/OVwixIwsq7c4tTXEgcyAfmQPEf0M5K/IUnuWNzMeQbuAZ7wXg6xlNQHtdxUNidMFur/2wIx47hivzMF9JOmYbMVIMwy+zu76EpshasVQ7xIV9QGN26lVu5KlZDkcAHveTWsf9Db06gwgCqPVP9RFdBPGH6EbTiqR9QsxEmmaOWh0pBRNuUKYusjr13sovy0C4H3FfEXX8JoX0/PUtnQZ99P
X-OriginatorOrg: suse.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 4e87b229-89f6-4ed7-6eac-08da8affa3d3
X-MS-Exchange-CrossTenant-AuthSource: DB9PR04MB8107.eurprd04.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 31 Aug 2022 03:19:40.0088
 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 3CRjsrHSiobIxYf0amfklHH5gye45cecfSck2We3Ufx8L4enX5nUASmwwUvr/GoJaW2J4qgYuhCAFqK2urwgjA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM5PR04MB3009
Precedence: bulk
List-ID: <bpf.vger.kernel.org>
X-Mailing-List: bpf@vger.kernel.org
X-Patchwork-Delegate: bpf@iogearbox.net
X-Patchwork-State: RFC

Series

bpf: tnums: warn against the usage of tnum_in(tnum_range(), ...) | expand

Context	Check	Description
bpf/vmtest-bpf-next-PR	pending	PR summary
bpf/vmtest-bpf-next-VM_Test-4	success	Logs for llvm-toolchain
bpf/vmtest-bpf-next-VM_Test-5	success	Logs for set-matrix
netdev/tree_selection	success	Clearly marked for bpf-next
netdev/fixes_present	success	Fixes tag not required for -next series
netdev/subject_prefix	success	Link
netdev/cover_letter	success	Series has a cover letter
netdev/patch_count	success	Link
netdev/header_inline	success	No static functions without inline keyword in header files
netdev/build_32bit	success	Errors and warnings before: 0 this patch: 0
netdev/cc_maintainers	success	CCed 2 of 2 maintainers
netdev/build_clang	success	Errors and warnings before: 0 this patch: 0
netdev/module_param	success	Was 0 now: 0
netdev/verify_signedoff	success	Signed-off-by tag matches author and committer
netdev/check_selftest	success	No net selftest shell script
netdev/verify_fixes	success	No Fixes tag
netdev/build_allmodconfig_warn	success	Errors and warnings before: 0 this patch: 0
netdev/checkpatch	warning	WARNING: Missing or malformed SPDX-License-Identifier tag in line 2 WARNING: added, moved or deleted file(s), does MAINTAINERS need updating?
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/source_inline	success	Was 0 now: 0
bpf/vmtest-bpf-next-VM_Test-2	success	Logs for build for x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-3	success	Logs for build for x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-1	success	Logs for build for s390x with gcc
bpf/vmtest-bpf-next-VM_Test-6	success	Logs for test_maps on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-7	success	Logs for test_maps on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-8	success	Logs for test_maps on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-9	pending	Logs for test_progs on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-10	pending	Logs for test_progs on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-11	pending	Logs for test_progs on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-12	pending	Logs for test_progs_no_alu32 on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-13	pending	Logs for test_progs_no_alu32 on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-14	pending	Logs for test_progs_no_alu32 on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-15	success	Logs for test_verifier on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-16	success	Logs for test_verifier on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-17	success	Logs for test_verifier on x86_64 with llvm-16

Context

Check

Description

bpf/vmtest-bpf-next-PR

pending

PR summary

bpf/vmtest-bpf-next-VM_Test-4

success

Logs for llvm-toolchain

bpf/vmtest-bpf-next-VM_Test-5

success

Logs for set-matrix

netdev/tree_selection

success

Clearly marked for bpf-next

netdev/fixes_present

success

Fixes tag not required for -next series

netdev/subject_prefix

success

Link

netdev/cover_letter

success

Series has a cover letter

netdev/patch_count

success

Link

netdev/header_inline

success

No static functions without inline keyword in header files

netdev/build_32bit

success

Errors and warnings before: 0 this patch: 0

netdev/cc_maintainers

success

CCed 2 of 2 maintainers

netdev/build_clang

success

Errors and warnings before: 0 this patch: 0

netdev/module_param

success

Was 0 now: 0

netdev/verify_signedoff

success

Signed-off-by tag matches author and committer

netdev/check_selftest

success

No net selftest shell script

netdev/verify_fixes

success

No Fixes tag

netdev/build_allmodconfig_warn

success

Errors and warnings before: 0 this patch: 0

netdev/checkpatch

warning

WARNING: Missing or malformed SPDX-License-Identifier tag in line 2 WARNING: added, moved or deleted file(s), does MAINTAINERS need updating?

netdev/kdoc

success

Errors and warnings before: 0 this patch: 0

netdev/source_inline

success

Was 0 now: 0

bpf/vmtest-bpf-next-VM_Test-2

success

Logs for build for x86_64 with gcc

bpf/vmtest-bpf-next-VM_Test-3

success

Logs for build for x86_64 with llvm-16

bpf/vmtest-bpf-next-VM_Test-1

success

Logs for build for s390x with gcc

bpf/vmtest-bpf-next-VM_Test-6

success

Logs for test_maps on s390x with gcc

bpf/vmtest-bpf-next-VM_Test-7

success

Logs for test_maps on x86_64 with gcc

bpf/vmtest-bpf-next-VM_Test-8

success

Logs for test_maps on x86_64 with llvm-16

bpf/vmtest-bpf-next-VM_Test-9

pending

Logs for test_progs on s390x with gcc

bpf/vmtest-bpf-next-VM_Test-10

pending

Logs for test_progs on x86_64 with gcc

bpf/vmtest-bpf-next-VM_Test-11

pending

Logs for test_progs on x86_64 with llvm-16

bpf/vmtest-bpf-next-VM_Test-12

pending

Logs for test_progs_no_alu32 on s390x with gcc

bpf/vmtest-bpf-next-VM_Test-13

pending

Logs for test_progs_no_alu32 on x86_64 with gcc

bpf/vmtest-bpf-next-VM_Test-14

pending

Logs for test_progs_no_alu32 on x86_64 with llvm-16

bpf/vmtest-bpf-next-VM_Test-15

success

Logs for test_verifier on s390x with gcc

bpf/vmtest-bpf-next-VM_Test-16

success

Logs for test_verifier on x86_64 with gcc

bpf/vmtest-bpf-next-VM_Test-17

success

Logs for test_verifier on x86_64 with llvm-16

Commit Message

Shung-Hsi Yu Aug. 31, 2022, 3:19 a.m. UTC

This commit is not meant to be merged, merely as a display of proof
about the claims in previous commit that tnum_in() can be trusted when
used in the following form:

- tnum_in(tnum_const(), ...)
- tnum_in(tnum_range(0, 2**n - 1), ...)
- tnum_in(tnum_range(2**n, 2**(n+1) - 1), ...)

Note that this only proves that tnum_in() can be trusted when it returns
true, and proof nothing about whether it's trustworthy or not when it
returns false; the latter is still being worked on.

Signed-off-by: Shung-Hsi Yu <shung-hsi.yu@suse.com>
---
 tnum_in.py | 158 +++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 158 insertions(+)
 create mode 100755 tnum_in.py

diff --git a/tnum_in.py b/tnum_in.py
new file mode 100755
index 000000000000..e4567bda51c4
--- /dev/null
+++ b/tnum_in.py
@@ -0,0 +1,158 @@ 
+#!/usr/bin/env python3
+#
+# A proof on the property of tnum_in(tnum_range(a, b), ...) using the Z3
+# theorem prover
+#
+# Requires the z3 Python module (aka Z3Py), which can be installed with the
+# command `pip3 install z3-solver`
+#
+from uuid import uuid4
+from z3 import And, BitVec, BitVecs, BitVecVal, Extract, If, Implies, Or, ULE, UGT, ZeroExt, prove
+
+
+class Tnum:
+    """A model of tristate number use in Linux kernel's BPF verifier.
+
+    Largely based on the "Sound, Precise, and Fast Abstract Interpretation with
+    Tristate Numbers" paper <https://arxiv.org/abs/2105.05398>.
+    """
+    SIZE = 64
+    def __init__(self, val=None, mask=None):
+        uid = uuid4() # Ensure that the BitVec are uniq, required by the Z3 solver
+        self.val = BitVec(f'Tnum-val-{uid}', bv=Tnum.SIZE) if val is None else val
+        self.mask = BitVec(f'Tnum-mask-{uid}', bv=Tnum.SIZE) if mask is None else mask
+
+    def contains(self, bitvec):
+        # Mask out the unknown bits, if what left is that same as value, then
+        # this that integer is represented by this tnum
+        return (~self.mask & bitvec) == self.val
+
+    def wellformed(self):
+        # Bit cannot be set in both val and mask, such tnum is not valid
+        return self.val & self.mask == BitVecVal(0, bv=Tnum.SIZE)
+
+
+def is_power_of_2(n):
+    return And(n != 0, n & (n-1) == 0)
+
+
+def fls64(bv):
+    size = Tnum.SIZE
+    num = BitVecVal(0, bv=Tnum.SIZE)
+    while size > 1:
+        half_size = size // 2
+        h = Extract(size - 1, half_size, bv)
+        bv = If(
+            h != 0,
+            h,
+            Extract(half_size - 1, 0, bv),
+        )
+        num += If(h != 0, BitVecVal(half_size, bv=Tnum.SIZE), BitVecVal(0, bv=Tnum.SIZE))
+        size = half_size
+
+    assert(size == 1) # Size is now 1
+    num += If(bv != 0, BitVecVal(1, bv=Tnum.SIZE), BitVecVal(0, bv=Tnum.SIZE))
+    return num
+
+
+def tnum_range(min_, max_): # Don't shadow built-in min & max
+    """tnum_range() implementation modeling what's found in the Linux Kernel"""
+    chi = min_ ^ max_
+    bits = fls64(chi)
+    delta = (BitVecVal(1, bv=Tnum.SIZE) << bits) - 1
+    too_large = UGT(bits, BitVecVal(Tnum.SIZE - 1, bv=Tnum.SIZE))
+
+    val = If(
+        too_large,
+        BitVecVal(0, bv=Tnum.SIZE),
+        min_ & ~delta,
+    )
+    mask = If(
+        too_large,
+        BitVecVal(-1, bv=Tnum.SIZE),
+        delta,
+    )
+    return Tnum(val=val, mask=mask)
+
+
+def tnum_in(a, b):
+    """tnum_in() implementation modeling what's found in the Linux Kernel"""
+    return If(
+        (b.mask & ~a.mask) != 0,
+        False,
+        a.val == (b.val & ~a.mask),
+    )
+
+
+# a, b, and x are integers which could be of any value
+a, b, x = BitVecs('a b x', bv=Tnum.SIZE)
+assumptions = []
+
+t = tnum_range(a, b) # Any possible range we could get out of tnum_range()
+assumptions += [
+    ULE(a, b), # a <= b
+]
+
+st = Tnum() # The second argument can be any tnum
+assumptions += [
+    st.wellformed(), # As long as it is a valid one
+    st.contains(x), # And contains the number x (that could be any integers)
+]
+
+condition = [
+    # When tnum_in() returns true
+    tnum_in(t, st) == True,
+]
+
+print("""\
+Trying to proof that tnum_in(tnum_range(a,b), ...) can always be trusted when
+it returns true...
+""")
+prove(
+    Implies(
+        # When using tnum_in(tnum_range(a, b), ...)
+        And(assumptions + condition),
+        # Try to prove that we can always trust it when it returns true
+        # That is, all number that the second argument can represent (i.e. x) is
+        # inclusively between a and b
+        And(ULE(a, x), ULE(x, b)),
+    )
+)
+print("")
+
+# Additional constrains, namely that the first argument need to be in the form of either
+#   tnum_const()
+# or
+#   tnum_range(0, 2**n - 1)
+# or
+#   tnum_range(2**n, 2**(n+1) - 1)
+additional_assumptions = [
+    Or(
+        a == b, # since a == b, tnum_range(a, b) == tnum_const()
+        And(a == 0, is_power_of_2(b + 1)), # b is 2**n - 1
+        And(is_power_of_2(a), b == (a << 1) - 1) # a is 2**n and b is 2**(n+1) - 1
+    ),
+]
+
+print("""\
+Trying to proof that tnum_in(tnum_range(a,b), ...) can always be trusted when
+it returns true, again, but with constrains on a and b, namely the first
+argument of tnum_in() must be in one of the following forms:
+- tnum_in(tnum_const(), ...)
+- tnum_in(tnum_range(0, 2**n - 1), ...)
+- tnum_in(tnum_range(2**n, 2**(n+1) - 1), ...)
+""")
+prove(
+    Implies(
+        # When tnum_in() is used in the form of
+        #   tnum_in(tnum_const(), ...)
+        # or
+        #   tnum_in(tnum_range(0, 2**n - 1), ...)
+        # or
+        #   tnum_in(tnum_range(2**n, 2**(n+1) - 1), ...)
+        And(assumptions + additional_assumptions + condition),
+        # Try to prove that we can always trust it when it returns true when the additional
+        # contrains above is inplace
+        And(ULE(a, x), ULE(x, b)),
+    )
+)

[RFC,bpf-next,2/2] proof for the safe usage of tnum_in()

Checks

Commit Message

Patch