[bpf-next] bpf: Relax precision marking in open coded iters and may_goto loop.

From: Alexei Starovoitov <ast@kernel.org>

From: Alexei Starovoitov <ast@kernel.org>

Motivation for the patch
------------------------
Open coded iterators and may_goto is a great mechanism to implement loops,
but counted loops are problematic. For example:
  for (i = 0; i < 100 && can_loop; i++)
is verified as a bounded loop, since i < 100 condition forces the verifier
to mark 'i' as precise and loop states at different iterations are not equivalent.
That removes the benefit of open coded iterators and may_goto.
The workaround is to do:
  int zero = 0; /* global or volatile variable */
  for (i = zero; i < 100 && can_loop; i++)
to hide from the verifier the value of 'i'.
It's unnatural and so far users didn't learn such odd programming pattern.

This patch aims to improve the verifier to support
  for (i = 0; i < 100000 && can_loop; i++)
as open coded iter loop (when 'i' doesn't need to be precise).

Algorithm
---------
First of all:
   if (is_may_goto_insn_at(env, insn_idx)) {
+          update_loop_entry(cur, &sl->state);
           if (states_equal(env, &sl->state, cur, RANGE_WITHIN)) {
-                  update_loop_entry(cur, &sl->state);

This should be correct, since reaching the same insn should
satisfy "if h1 in path" requirement of update_loop_entry() algorithm.
It's too conservative to update loop_entry only on a state match.

With that the get_loop_entry() can be used to gate is_branch_taken() logic.
When 'if (i < 1000)' is done within open coded iterator or in a loop with may_goto
don't invoke is_branch_taken() logic.
When it's skipped don't do reg_bounds_sanity_check(), since it will surely
see range violations.

Now, consider progs/iters_task_vma.c that has the following logic:
    bpf_for_each(...) {
       if (i > 1000)
          break;

       arr[i] = ..;
    }

Skipping precision mark at if (i > 1000) keeps 'i' imprecise,
but arr[i] will mark 'i' as precise anyway, because 'arr' is a map.
On the next iteration of the loop the patch does copy_precision()
that copies precision markings for top of the loop into next state
of the loop. So on the next iteration 'i' will be seen as precise.

Hence the key part of the patch:
-       pred = is_branch_taken(dst_reg, src_reg, opcode, is_jmp32);
+       if (!get_loop_entry(this_branch) || src_reg->precise || dst_reg->precise ||
+           (BPF_SRC(insn->code) == BPF_K && insn->imm == 0))
+               pred = is_branch_taken(dst_reg, src_reg, opcode, is_jmp32);

!get_loop_entry(this_branch) -> if not inside open coded iter keep
  existing is_branch_taken() logic, since bounded loop relies on it.

src_reg->precise || dst_reg->precise -> if later inside the loop the 'i' was
  actually marked as precise then we have to do is_branch_taken() and above
  bpf_for_each() will be verified as a bounded loop checking all 1000
  iterations. Otherwise we will keep incrementing 'i' and it will eventually
  get out of bounds in arr[i] and the verifier will reject such memory access.

BPF_SRC(insn->code) == BPF_K && insn->imm == 0 -> if it's a check for
  an exit condition from open coded iterator then do is_branch_taken() as well.
  Otherwise all open coded iterators won't work.

Now consider the same example:
    bpf_for_each(...) {
       if (i > 1000)
          break;

       arr[i] = ..;
    }
but 'arr' is an arena pointer. In this case 'i > 1000' will keep 'i' as
imprecise and arr[i] will keep it as imprecise as well.
And the whole loop will be verified with open coded iterator logic.

Now the following works:
-       for (i = zero; i < 1000; i++)
+       for (i = 0; i < 100000 && can_loop; i++) {
                htab_update_elem(htab, i, i);
+               arr[i] = i; // either arr1 or arr2
+       }
+char __arena arr1[100000]; /* works */
+char arr2[100000]; /* runs into 1M limit */

So the users can now use 'for (i = 0;...' pattern everywhere and
the verifier will fall back to bounded loop logic and precise 'i'
when 'i' is used in map-style memory access.
For arena based algorithms 'i' will stay imprecise.

-       for (i = zero; i < ARR_SZ && can_loop; i++)
+       /* i = 0 is ok here, since i is not used in memory access */
+       for (i = 0; i < ARR_SZ && can_loop; i++)
                sum += i;
+
+       /* have to use i = zero due to arr[i] where arr is not an arena */
        for (i = zero; i < ARR_SZ; i++) {
                barrier_var(i);
                sum += i + arr[i];

and i = zero workaround in iter_obfuscate_counter() can be removed.

copy_precision() is a hack, of course, to demonstrate an idea.

Signed-off-by: Alexei Starovoitov <ast@kernel.org>
---
 kernel/bpf/verifier.c                         | 94 +++++++++++++++++--
 .../testing/selftests/bpf/progs/arena_htab.c  | 11 ++-
 tools/testing/selftests/bpf/progs/iters.c     | 18 +---
 .../bpf/progs/verifier_iterating_callbacks.c  | 17 ++--
 4 files changed, 112 insertions(+), 28 deletions(-)

Message ID	20240522024713.59136-1-alexei.starovoitov@gmail.com (mailing list archive)
State	Superseded
Delegated to:	BPF
Headers	show Received: from mail-pl1-f181.google.com (mail-pl1-f181.google.com [209.85.214.181]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 4C6BC26293 for <bpf@vger.kernel.org>; Wed, 22 May 2024 02:47:19 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.214.181 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1716346041; cv=none; b=khg7QEKeAvv8ylGQrKIEfeCHlNxdv/6pky6rpI0U4YBk9pO6sePaxmnPTUYEm0q2MyfPyhoHo2vJrpCMlYZmbszpB2phhXunyW8YvTAxpzgHw+pSX+JOkotnz1kgLWTUXUt+Fn486RBS3WaT+uL1uJ732xfU9YSNVBISyFPyVFk= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1716346041; c=relaxed/simple; bh=JMSK5Kgp2ibp/jL8V6puEswzVK2vqH755gBAWXwSn1c=; h=From:To:Cc:Subject:Date:Message-Id:MIME-Version; b=usMomy5kfdYBX5v4XhyzvlF0oGSK11pIW94eTMl9a87KDJ6GRIXnkqXz8bDqXVAlF0iGcrj/fAp9sNMEXbTzryCIAuuqfILqi5czN9SiVhnvvIGUcIN/tgXuMncd0wYOvOZFM3TOgFl/MaGhi2F3rAwSz7P5PDRGyKoufmeaRXw= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=E9ga+xF/; arc=none smtp.client-ip=209.85.214.181 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="E9ga+xF/" Received: by mail-pl1-f181.google.com with SMTP id d9443c01a7336-1ecc23e6c9dso109216085ad.2 for <bpf@vger.kernel.org>; Tue, 21 May 2024 19:47:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1716346038; x=1716950838; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=Ip97ZryjgvDw0E3emd452VFF721rwhMu9VSqnkvJOKA=; b=E9ga+xF/jq2ef3X7uPTwaU9RxJDEy1/0E9PdWMC59PnsOVWgrJExn7dDgkiOcK3Ca6 t1t9tW7JmZhNKDRfu3BnB1+e+ri1FR0FJsoX0Xj5VR7jpzzNN6+dWwoRdDzpfB0Rc5Qe Ab3t+BbfeUcUMCKcPRJ8GyrdChLUMDbVog6eF1Vu9AnbMVJT6lC3s7/Ag9y0pQhtMS32 X6rDOxrwujQdx9rJT1CQhuJzRhKWAiShQRzyZ2pAKDOw7XDbM8TxT2Qnsa2UjbJ1Jvyo ns2wz8D1vxYNt6EPVK/grbb132OL6f2FtrAGlr406cXXo0nwDBm4QLn9Wbjpj9DuUWGs 8ZaA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1716346038; x=1716950838; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=Ip97ZryjgvDw0E3emd452VFF721rwhMu9VSqnkvJOKA=; b=aGUA3X9w59G+iV1J3I3AU6Zku3Zsf6H1Y0ldeVVvlUnF1pFTF7NGZT4arAWSdfu6Y/ S0xA3/rs3XndJ6yYELMPWeky5yWKMdx1RydMZYd6tP1L4iRIo4yUtu0HAPnbbtXF7Nlq N9vwwskcUnSnnrXZDXHnR6XcSkrCKOjKs2kI+kdPL1X+8kArg2WXfEylnR6mpvB02Ana 9WSlt5Nv9RZnJ7pvVXeOvubJ8AlYWKvdDlz2rKNtPBicjJaZuJNoHGXFyF4GPbuiLxBu zzNd0UUACUc/Ii5Sr5ItMX4YVXBJ+zByAQQnFAjZfrojfpd/UM2CXE5GR0l9k/7/A9AN +R/w== X-Gm-Message-State: AOJu0YxWcTi4Lu1wqfKXIyKXmwUGHxdaT9o9u3fEgmGKMu3iTvCSMSiW +MoVz3myHVll5HCbg+tZS1Ol8/a17So7Ei/gR5zx8zJohitBPHD4Ll7G6g== X-Google-Smtp-Source: AGHT+IENtx68chO53RMB//cq/A6ZQvMBOOAescZvUjFvatvsZRItG99ptnWncwrNBRYnEnLvxCejfg== X-Received: by 2002:a17:902:da8d:b0:1f2:f7ff:96af with SMTP id d9443c01a7336-1f31c9f4ef0mr7999245ad.69.1716346037663; Tue, 21 May 2024 19:47:17 -0700 (PDT) Received: from macbook-pro-49.dhcp.thefacebook.com ([2620:10d:c090:400::5:acf5]) by smtp.gmail.com with ESMTPSA id d9443c01a7336-1ef0c035c42sm227846255ad.187.2024.05.21.19.47.16 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 21 May 2024 19:47:17 -0700 (PDT) From: Alexei Starovoitov <alexei.starovoitov@gmail.com> To: bpf@vger.kernel.org Cc: daniel@iogearbox.net, andrii@kernel.org, martin.lau@kernel.org, memxor@gmail.com, eddyz87@gmail.com, kernel-team@fb.com Subject: [PATCH bpf-next] bpf: Relax precision marking in open coded iters and may_goto loop. Date: Tue, 21 May 2024 19:47:13 -0700 Message-Id: <20240522024713.59136-1-alexei.starovoitov@gmail.com> X-Mailer: git-send-email 2.39.3 (Apple Git-146) Precedence: bulk X-Mailing-List: bpf@vger.kernel.org List-Id: <bpf.vger.kernel.org> List-Subscribe: <mailto:bpf+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:bpf+unsubscribe@vger.kernel.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Delegate: bpf@iogearbox.net
Series	[bpf-next] bpf: Relax precision marking in open coded iters and may_goto loop. \| expand [bpf-next] bpf: Relax precision marking in open coded iters and may_goto loop.

Context	Check	Description
bpf/vmtest-bpf-next-PR	fail	PR summary
bpf/vmtest-bpf-next-VM_Test-3	success	Logs for Validate matrix.py
bpf/vmtest-bpf-next-VM_Test-2	success	Logs for Unittests
bpf/vmtest-bpf-next-VM_Test-0	success	Logs for Lint
bpf/vmtest-bpf-next-VM_Test-5	success	Logs for aarch64-gcc / build-release
bpf/vmtest-bpf-next-VM_Test-1	success	Logs for ShellCheck
bpf/vmtest-bpf-next-VM_Test-10	success	Logs for aarch64-gcc / veristat
bpf/vmtest-bpf-next-VM_Test-12	success	Logs for s390x-gcc / build-release
bpf/vmtest-bpf-next-VM_Test-4	success	Logs for aarch64-gcc / build / build for aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-14	success	Logs for s390x-gcc / test (test_progs, false, 360) / test_progs on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-8	success	Logs for aarch64-gcc / test (test_progs_no_alu32, false, 360) / test_progs_no_alu32 on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-11	success	Logs for s390x-gcc / build / build for s390x with gcc
bpf/vmtest-bpf-next-VM_Test-9	success	Logs for aarch64-gcc / test (test_verifier, false, 360) / test_verifier on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-7	success	Logs for aarch64-gcc / test (test_progs, false, 360) / test_progs on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-34	success	Logs for x86_64-llvm-17 / veristat
bpf/vmtest-bpf-next-VM_Test-13	success	Logs for s390x-gcc / test (test_maps, false, 360) / test_maps on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-17	success	Logs for s390x-gcc / veristat
bpf/vmtest-bpf-next-VM_Test-36	success	Logs for x86_64-llvm-18 / build-release / build for x86_64 with llvm-18 and -O2 optimization
bpf/vmtest-bpf-next-VM_Test-18	success	Logs for set-matrix
bpf/vmtest-bpf-next-VM_Test-28	success	Logs for x86_64-llvm-17 / build / build for x86_64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-29	success	Logs for x86_64-llvm-17 / build-release / build for x86_64 with llvm-17 and -O2 optimization
bpf/vmtest-bpf-next-VM_Test-16	success	Logs for s390x-gcc / test (test_verifier, false, 360) / test_verifier on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-42	success	Logs for x86_64-llvm-18 / veristat
bpf/vmtest-bpf-next-VM_Test-15	success	Logs for s390x-gcc / test (test_progs_no_alu32, false, 360) / test_progs_no_alu32 on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-35	success	Logs for x86_64-llvm-18 / build / build for x86_64 with llvm-18
bpf/vmtest-bpf-next-VM_Test-6	success	Logs for aarch64-gcc / test (test_maps, false, 360) / test_maps on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-20	success	Logs for x86_64-gcc / build-release
bpf/vmtest-bpf-next-VM_Test-19	success	Logs for x86_64-gcc / build / build for x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-21	success	Logs for x86_64-gcc / test (test_maps, false, 360) / test_maps on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-32	success	Logs for x86_64-llvm-17 / test (test_progs_no_alu32, false, 360) / test_progs_no_alu32 on x86_64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-26	success	Logs for x86_64-gcc / test (test_verifier, false, 360) / test_verifier on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-31	success	Logs for x86_64-llvm-17 / test (test_progs, false, 360) / test_progs on x86_64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-41	success	Logs for x86_64-llvm-18 / test (test_verifier, false, 360) / test_verifier on x86_64 with llvm-18
bpf/vmtest-bpf-next-VM_Test-33	success	Logs for x86_64-llvm-17 / test (test_verifier, false, 360) / test_verifier on x86_64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-30	success	Logs for x86_64-llvm-17 / test (test_maps, false, 360) / test_maps on x86_64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-23	success	Logs for x86_64-gcc / test (test_progs_no_alu32, false, 360) / test_progs_no_alu32 on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-24	success	Logs for x86_64-gcc / test (test_progs_no_alu32_parallel, true, 30) / test_progs_no_alu32_parallel on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-22	success	Logs for x86_64-gcc / test (test_progs, false, 360) / test_progs on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-27	fail	Logs for x86_64-gcc / veristat / veristat on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-25	success	Logs for x86_64-gcc / test (test_progs_parallel, true, 30) / test_progs_parallel on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-39	success	Logs for x86_64-llvm-18 / test (test_progs_cpuv4, false, 360) / test_progs_cpuv4 on x86_64 with llvm-18
bpf/vmtest-bpf-next-VM_Test-37	success	Logs for x86_64-llvm-18 / test (test_maps, false, 360) / test_maps on x86_64 with llvm-18
bpf/vmtest-bpf-next-VM_Test-40	success	Logs for x86_64-llvm-18 / test (test_progs_no_alu32, false, 360) / test_progs_no_alu32 on x86_64 with llvm-18
bpf/vmtest-bpf-next-VM_Test-38	success	Logs for x86_64-llvm-18 / test (test_progs, false, 360) / test_progs on x86_64 with llvm-18

[bpf-next] bpf: Relax precision marking in open coded iters and may_goto loop.

Checks

Commit Message

Comments

Patch