[bpf-next,04/10] bpf: remember if bpf_map was unprivileged and use that consistently

Message ID	20230502230619.2592406-5-andrii@kernel.org (mailing list archive)
State	Changes Requested
Delegated to:	BPF
Headers	show Return-Path: <bpf-owner@vger.kernel.org> From: Andrii Nakryiko <andrii@kernel.org> To: <bpf@vger.kernel.org>, <ast@kernel.org>, <daniel@iogearbox.net>, <martin.lau@kernel.org> CC: <andrii@kernel.org>, <kernel-team@meta.com> Subject: [PATCH bpf-next 04/10] bpf: remember if bpf_map was unprivileged and use that consistently Date: Tue, 2 May 2023 16:06:13 -0700 Message-ID: <20230502230619.2592406-5-andrii@kernel.org> In-Reply-To: <20230502230619.2592406-1-andrii@kernel.org> References: <20230502230619.2592406-1-andrii@kernel.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8BIT Content-Type: text/plain Precedence: bulk
Series	Centralize BPF permission checks \| expand [bpf-next,00/10] Centralize BPF permission checks [bpf-next,01/10] bpf: move unprivileged checks into map_create() and bpf_prog_load() [bpf-next,02/10] bpf: inline map creation logic in map_create() function [bpf-next,03/10] bpf: centralize permissions checks for all BPF map types [bpf-next,04/10] bpf: remember if bpf_map was unprivileged and use that consistently [bpf-next,05/10] bpf: drop unnecessary bpf_capable() check in BPF_MAP_FREEZE command [bpf-next,06/10] bpf: keep BPF_PROG_LOAD permission checks clear of validations [bpf-next,07/10] bpf: record effective capabilities at BPF prog load time [bpf-next,08/10] bpf: use recorded BPF prog effective caps when fetching helper protos [bpf-next,09/10] bpf: use recorded bpf_capable flag in JIT code [bpf-next,10/10] bpf: consistenly use program's recorded capabilities in BPF verifier

Context	Check	Description
netdev/series_format	success	Posting correctly formatted
netdev/tree_selection	success	Clearly marked for bpf-next, async
netdev/fixes_present	success	Fixes tag not required for -next series
netdev/header_inline	success	No static functions without inline keyword in header files
netdev/build_32bit	success	Errors and warnings before: 1471 this patch: 1471
netdev/cc_maintainers	warning	8 maintainers not CCed: yhs@fb.com kpsingh@kernel.org martin.lau@linux.dev john.fastabend@gmail.com song@kernel.org sdf@google.com jolsa@kernel.org haoluo@google.com
netdev/build_clang	success	Errors and warnings before: 175 this patch: 175
netdev/verify_signedoff	success	Signed-off-by tag matches author and committer
netdev/deprecated_api	success	None detected
netdev/check_selftest	success	No net selftest shell script
netdev/verify_fixes	success	No Fixes tag
netdev/build_allmodconfig_warn	success	Errors and warnings before: 1465 this patch: 1465
netdev/checkpatch	warning	WARNING: line length of 82 exceeds 80 columns WARNING: line length of 86 exceeds 80 columns
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/source_inline	success	Was 0 now: 0
bpf/vmtest-bpf-next-PR	success	PR summary
bpf/vmtest-bpf-next-VM_Test-1	success	Logs for ShellCheck
bpf/vmtest-bpf-next-VM_Test-2	success	Logs for build for aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-3	success	Logs for build for aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-5	success	Logs for build for x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-6	success	Logs for build for x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-7	success	Logs for set-matrix
bpf/vmtest-bpf-next-VM_Test-4	success	Logs for build for s390x with gcc
bpf/vmtest-bpf-next-VM_Test-8	success	Logs for test_maps on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-9	success	Logs for test_maps on aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-11	success	Logs for test_maps on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-12	success	Logs for test_maps on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-13	success	Logs for test_progs on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-14	success	Logs for test_progs on aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-16	success	Logs for test_progs on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-17	success	Logs for test_progs on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-18	success	Logs for test_progs_no_alu32 on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-19	success	Logs for test_progs_no_alu32 on aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-21	success	Logs for test_progs_no_alu32 on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-22	success	Logs for test_progs_no_alu32 on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-23	success	Logs for test_progs_no_alu32_parallel on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-24	success	Logs for test_progs_no_alu32_parallel on aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-25	success	Logs for test_progs_no_alu32_parallel on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-26	success	Logs for test_progs_no_alu32_parallel on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-27	success	Logs for test_progs_parallel on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-28	success	Logs for test_progs_parallel on aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-29	success	Logs for test_progs_parallel on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-30	success	Logs for test_progs_parallel on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-31	success	Logs for test_verifier on aarch64 with gcc
bpf/vmtest-bpf-next-VM_Test-32	success	Logs for test_verifier on aarch64 with llvm-17
bpf/vmtest-bpf-next-VM_Test-34	success	Logs for test_verifier on x86_64 with gcc
bpf/vmtest-bpf-next-VM_Test-35	success	Logs for test_verifier on x86_64 with llvm-16
bpf/vmtest-bpf-next-VM_Test-36	success	Logs for veristat
bpf/vmtest-bpf-next-VM_Test-33	success	Logs for test_verifier on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-20	fail	Logs for test_progs_no_alu32 on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-15	success	Logs for test_progs on s390x with gcc
bpf/vmtest-bpf-next-VM_Test-10	success	Logs for test_maps on s390x with gcc

diff --git a/include/linux/bpf.h b/include/linux/bpf.h index 456f33b9d205..479657bb113e 100644 --- a/include/linux/bpf.h +++ b/include/linux/bpf.h @@ -273,7 +273,7 @@ struct bpf_map { bool jited; bool xdp_has_frags; } owner; - bool bypass_spec_v1; + bool unpriv; bool frozen; /* write-once; write-protected by freeze_mutex */ }; @@ -2058,6 +2058,8 @@ static inline bool bpf_bypass_spec_v1(void) return perfmon_capable(); } +int bpf_array_adjust_for_spec_v1(union bpf_attr *attr); + static inline bool bpf_bypass_spec_v4(void) { return perfmon_capable(); diff --git a/kernel/bpf/arraymap.c b/kernel/bpf/arraymap.c index 2058e89b5ddd..a51d22a3afd1 100644 --- a/kernel/bpf/arraymap.c +++ b/kernel/bpf/arraymap.c @@ -77,18 +77,9 @@ int array_map_alloc_check(union bpf_attr *attr) return 0; } -static struct bpf_map *array_map_alloc(union bpf_attr *attr) +static u32 array_index_mask(u32 max_entries) { - bool percpu = attr->map_type == BPF_MAP_TYPE_PERCPU_ARRAY; - int numa_node = bpf_map_attr_numa_node(attr); - u32 elem_size, index_mask, max_entries; - bool bypass_spec_v1 = bpf_bypass_spec_v1(); - u64 array_size, mask64; - struct bpf_array *array; - - elem_size = round_up(attr->value_size, 8); - - max_entries = attr->max_entries; + u64 mask64; /* On 32 bit archs roundup_pow_of_two() with max_entries that has * upper most bit set in u32 space is undefined behavior due to @@ -98,17 +89,38 @@ static struct bpf_map *array_map_alloc(union bpf_attr *attr) mask64 = 1ULL << mask64; mask64 -= 1; - index_mask = mask64; - if (!bypass_spec_v1) { - /* round up array size to nearest power of 2, - * since cpu will speculate within index_mask limits - */ - max_entries = index_mask + 1; - /* Check for overflows. */ - if (max_entries < attr->max_entries) - return ERR_PTR(-E2BIG); - } + return (u32)mask64; +} + +int bpf_array_adjust_for_spec_v1(union bpf_attr *attr) +{ + u32 max_entries, index_mask; + + /* round up array size to nearest power of 2, + * since cpu will speculate within index_mask limits + */ + index_mask = array_index_mask(attr->max_entries); + max_entries = index_mask + 1; + /* Check for overflows. */ + if (max_entries < attr->max_entries) + return -E2BIG; + + attr->max_entries = max_entries; + return 0; +} +static struct bpf_map *array_map_alloc(union bpf_attr *attr) +{ + bool percpu = attr->map_type == BPF_MAP_TYPE_PERCPU_ARRAY; + int numa_node = bpf_map_attr_numa_node(attr); + u32 elem_size, index_mask, max_entries; + u64 array_size; + struct bpf_array *array; + + elem_size = round_up(attr->value_size, 8); + + max_entries = attr->max_entries; + index_mask = array_index_mask(max_entries); array_size = sizeof(*array); if (percpu) { array_size += (u64) max_entries * sizeof(void *); @@ -140,7 +152,6 @@ static struct bpf_map *array_map_alloc(union bpf_attr *attr) if (!array) return ERR_PTR(-ENOMEM); array->index_mask = index_mask; - array->map.bypass_spec_v1 = bypass_spec_v1; /* copy mandatory map attributes */ bpf_map_init_from_attr(&array->map, attr); @@ -216,7 +227,7 @@ static int array_map_gen_lookup(struct bpf_map *map, struct bpf_insn *insn_buf) *insn++ = BPF_ALU64_IMM(BPF_ADD, map_ptr, offsetof(struct bpf_array, value)); *insn++ = BPF_LDX_MEM(BPF_W, ret, index, 0); - if (!map->bypass_spec_v1) { + if (map->unpriv) { *insn++ = BPF_JMP_IMM(BPF_JGE, ret, map->max_entries, 4); *insn++ = BPF_ALU32_IMM(BPF_AND, ret, array->index_mask); } else { @@ -1373,7 +1384,7 @@ static int array_of_map_gen_lookup(struct bpf_map *map, *insn++ = BPF_ALU64_IMM(BPF_ADD, map_ptr, offsetof(struct bpf_array, value)); *insn++ = BPF_LDX_MEM(BPF_W, ret, index, 0); - if (!map->bypass_spec_v1) { + if (map->unpriv) { *insn++ = BPF_JMP_IMM(BPF_JGE, ret, map->max_entries, 6); *insn++ = BPF_ALU32_IMM(BPF_AND, ret, array->index_mask); } else { diff --git a/kernel/bpf/map_in_map.c b/kernel/bpf/map_in_map.c index 2c5c64c2a53b..21cb4be92097 100644 --- a/kernel/bpf/map_in_map.c +++ b/kernel/bpf/map_in_map.c @@ -41,6 +41,7 @@ struct bpf_map *bpf_map_meta_alloc(int inner_map_ufd) goto put; } + inner_map_meta->unpriv = inner_map->unpriv; inner_map_meta->map_type = inner_map->map_type; inner_map_meta->key_size = inner_map->key_size; inner_map_meta->value_size = inner_map->value_size; @@ -69,7 +70,6 @@ struct bpf_map *bpf_map_meta_alloc(int inner_map_ufd) /* Misc members not needed in bpf_map_meta_equal() check. */ inner_map_meta->ops = inner_map->ops; if (inner_map->ops == &array_map_ops) { - inner_map_meta->bypass_spec_v1 = inner_map->bypass_spec_v1; container_of(inner_map_meta, struct bpf_array, map)->index_mask = container_of(inner_map, struct bpf_array, map)->index_mask; } @@ -98,6 +98,7 @@ bool bpf_map_meta_equal(const struct bpf_map *meta0, meta0->key_size == meta1->key_size && meta0->value_size == meta1->value_size && meta0->map_flags == meta1->map_flags && + meta0->unpriv == meta1->unpriv && btf_record_equal(meta0->record, meta1->record); } diff --git a/kernel/bpf/syscall.c b/kernel/bpf/syscall.c index 92127eaee467..ffc61a764fe5 100644 --- a/kernel/bpf/syscall.c +++ b/kernel/bpf/syscall.c @@ -1010,7 +1010,7 @@ static int map_check_btf(struct bpf_map *map, const struct btf *btf, if (!IS_ERR_OR_NULL(map->record)) { int i; - if (!bpf_capable()) { + if (map->unpriv) { ret = -EPERM; goto free_map_tab; } @@ -1100,6 +1100,7 @@ static int map_create(union bpf_attr *attr) int numa_node = bpf_map_attr_numa_node(attr); u32 map_type = attr->map_type; struct bpf_map *map; + bool unpriv; int f_flags; int err; @@ -1176,6 +1177,7 @@ static int map_create(union bpf_attr *attr) case BPF_MAP_TYPE_CPUMAP: if (!bpf_capable()) return -EPERM; + unpriv = false; break; case BPF_MAP_TYPE_SOCKMAP: case BPF_MAP_TYPE_SOCKHASH: @@ -1184,6 +1186,7 @@ static int map_create(union bpf_attr *attr) case BPF_MAP_TYPE_XSKMAP: if (!capable(CAP_NET_ADMIN)) return -EPERM; + unpriv = false; break; case BPF_MAP_TYPE_ARRAY: case BPF_MAP_TYPE_PERCPU_ARRAY: @@ -1198,18 +1201,36 @@ static int map_create(union bpf_attr *attr) case BPF_MAP_TYPE_USER_RINGBUF: case BPF_MAP_TYPE_CGROUP_STORAGE: case BPF_MAP_TYPE_PERCPU_CGROUP_STORAGE: - /* unprivileged */ + /* unprivileged is OK, but we still record if we had CAP_BPF */ + unpriv = !bpf_capable(); break; default: WARN(1, "unsupported map type %d", map_type); return -EPERM; } + /* ARRAY-like maps have special sizing provisions for mitigating Spectre v1 */ + if (unpriv) { + switch (map_type) { + case BPF_MAP_TYPE_ARRAY: + case BPF_MAP_TYPE_PERCPU_ARRAY: + case BPF_MAP_TYPE_PROG_ARRAY: + case BPF_MAP_TYPE_PERF_EVENT_ARRAY: + case BPF_MAP_TYPE_CGROUP_ARRAY: + case BPF_MAP_TYPE_ARRAY_OF_MAPS: + err = bpf_array_adjust_for_spec_v1(attr); + if (err) + return err; + break; + } + } + map = ops->map_alloc(attr); if (IS_ERR(map)) return PTR_ERR(map); map->ops = ops; map->map_type = map_type; + map->unpriv = unpriv; err = bpf_obj_name_cpy(map->name, attr->map_name, sizeof(attr->map_name)); diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c index ff4a8ab99f08..481aaf189183 100644 --- a/kernel/bpf/verifier.c +++ b/kernel/bpf/verifier.c @@ -8731,11 +8731,9 @@ record_func_map(struct bpf_verifier_env *env, struct bpf_call_arg_meta *meta, } if (!BPF_MAP_PTR(aux->map_ptr_state)) - bpf_map_ptr_store(aux, meta->map_ptr, - !meta->map_ptr->bypass_spec_v1); + bpf_map_ptr_store(aux, meta->map_ptr, meta->map_ptr->unpriv); else if (BPF_MAP_PTR(aux->map_ptr_state) != meta->map_ptr) - bpf_map_ptr_store(aux, BPF_MAP_PTR_POISON, - !meta->map_ptr->bypass_spec_v1); + bpf_map_ptr_store(aux, BPF_MAP_PTR_POISON, meta->map_ptr->unpriv); return 0; }

[bpf-next,04/10] bpf: remember if bpf_map was unprivileged and use that consistently

Checks

Commit Message

Comments

Patch