[bpf-next,v2] bpf: Do not try bpf_msg_push_data with len 0

Message ID	df69012695c7094ccb1943ca02b4920db3537466.1644421921.git.fmaurer@redhat.com (mailing list archive)
State	Accepted
Commit	4a11678f683814df82fca9018d964771e02d7e6d
Delegated to:	BPF
Headers	show Return-Path: <bpf-owner@kernel.org> From: Felix Maurer <fmaurer@redhat.com> To: bpf@vger.kernel.org Cc: ast@kernel.org, daniel@iogearbox.net, andrii@kernel.org, kafai@fb.com, songliubraving@fb.com, yhs@fb.com, john.fastabend@gmail.com, kpsingh@kernel.org Subject: [PATCH bpf-next v2] bpf: Do not try bpf_msg_push_data with len 0 Date: Wed, 9 Feb 2022 16:55:26 +0100 Message-Id: <df69012695c7094ccb1943ca02b4920db3537466.1644421921.git.fmaurer@redhat.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	[bpf-next,v2] bpf: Do not try bpf_msg_push_data with len 0 \| expand [bpf-next,v2] bpf: Do not try bpf_msg_push_data with len 0

Context	Check	Description
netdev/tree_selection	success	Clearly marked for bpf-next
netdev/fixes_present	success	Fixes tag not required for -next series
netdev/subject_prefix	success	Link
netdev/cover_letter	success	Single patches do not need cover letters
netdev/patch_count	success	Link
netdev/header_inline	success	No static functions without inline keyword in header files
netdev/build_32bit	success	Errors and warnings before: 28 this patch: 28
netdev/cc_maintainers	warning	3 maintainers not CCed: netdev@vger.kernel.org davem@davemloft.net kuba@kernel.org
netdev/build_clang	success	Errors and warnings before: 18 this patch: 18
netdev/module_param	success	Was 0 now: 0
netdev/verify_signedoff	success	Signed-off-by tag matches author and committer
netdev/verify_fixes	success	Fixes tag looks correct
netdev/build_allmodconfig_warn	success	Errors and warnings before: 33 this patch: 33
netdev/checkpatch	success	total: 0 errors, 0 warnings, 0 checks, 9 lines checked
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/source_inline	success	Was 0 now: 0
bpf/vmtest-bpf-next-PR	success	PR summary
bpf/vmtest-bpf-next	success	VM_Test

Felix Maurer Feb. 9, 2022, 3:55 p.m. UTC

If bpf_msg_push_data is called with len 0 (as it happens during
selftests/bpf/test_sockmap), we do not need to do anything and can
return early.

Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
error: we later called get_order(copy + len); if len was 0, copy + len
was also often 0 and get_order returned some undefined value (at the
moment 52). alloc_pages caught that and failed, but then
bpf_msg_push_data returned ENOMEM. This was wrong because we are most
probably not out of memory and actually do not need any additional
memory.

v2: Add bug description and Fixes tag

Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
Signed-off-by: Felix Maurer <fmaurer@redhat.com>
---
 net/core/filter.c | 3 +++
 1 file changed, 3 insertions(+)

Yonghong Song Feb. 9, 2022, 5:06 p.m. UTC | #1

On 2/9/22 7:55 AM, Felix Maurer wrote:
> If bpf_msg_push_data is called with len 0 (as it happens during
> selftests/bpf/test_sockmap), we do not need to do anything and can
> return early.
> 
> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
> error: we later called get_order(copy + len); if len was 0, copy + len
> was also often 0 and get_order returned some undefined value (at the
> moment 52). alloc_pages caught that and failed, but then
> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
> probably not out of memory and actually do not need any additional
> memory.
> 
> v2: Add bug description and Fixes tag
> 
> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
> Signed-off-by: Felix Maurer <fmaurer@redhat.com>

LGTM. I am wondering why bpf CI didn't catch this problem. Did you
modified the test with length 0 in order to trigger that? If this
is the case, it would be great you can add such a test to the
test_sockmap.

Acked-by: Yonghong Song <yhs@fb.com>

Felix Maurer Feb. 10, 2022, 3:45 p.m. UTC | #2

On 09.02.22 18:06, Yonghong Song wrote:
> On 2/9/22 7:55 AM, Felix Maurer wrote:
>> If bpf_msg_push_data is called with len 0 (as it happens during
>> selftests/bpf/test_sockmap), we do not need to do anything and can
>> return early.
>>
>> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
>> error: we later called get_order(copy + len); if len was 0, copy + len
>> was also often 0 and get_order returned some undefined value (at the
>> moment 52). alloc_pages caught that and failed, but then
>> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
>> probably not out of memory and actually do not need any additional
>> memory.
>>
>> v2: Add bug description and Fixes tag
>>
>> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
>> Signed-off-by: Felix Maurer <fmaurer@redhat.com>
> 
> LGTM. I am wondering why bpf CI didn't catch this problem. Did you
> modified the test with length 0 in order to trigger that? If this
> is the case, it would be great you can add such a test to the
> test_sockmap.

I did not modify the tests to trigger that. The state of the selftests
around that is unfortunately not very good. There is no explicit test
with length 0 but bpf_msg_push_data is still called with length 0,
because of what I consider to be bugs in the test. On the other hand,
explicit tests with other lengths are sometimes not called as well. I'll
elaborate on that in a bit.

Something easy to fix is that the tests do not check the return value of
bpf_msg_push_data which they probably should. That may have helped find
the problem earlier.

Now to the issue mentioned in the beginning: Only some of the BPF
programs used in test_sockmap actually call bpf_msg_push_data. However,
they are not always attached, just for particular scenarios:
txmsg_pass==1, txmsg_redir==1, or txmsg_drop==1. If none of those apply,
bpf_msg_push_data is never called. This happens for example in
test_txmsg_push. Out of the four defined tests only one actually calls
the helper.

But after a test, the parameters in the map are reset to 0 (instead of
being removed). Therefore, when the maps are reused in a subsequent test
which is one of the scenarios above, the values are present and
bpf_msg_push_data is called, albeit with the parameters set to 0. This
is also what triggered the wrong behavior fixed in the patch.

Unfortunately, I do not have the time to fix these issues in the test at
the moment.

> Acked-by: Yonghong Song <yhs@fb.com>

Thanks!

Yonghong Song Feb. 10, 2022, 6:04 p.m. UTC | #3

On 2/10/22 7:45 AM, Felix Maurer wrote:
> On 09.02.22 18:06, Yonghong Song wrote:
>> On 2/9/22 7:55 AM, Felix Maurer wrote:
>>> If bpf_msg_push_data is called with len 0 (as it happens during
>>> selftests/bpf/test_sockmap), we do not need to do anything and can
>>> return early.
>>>
>>> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
>>> error: we later called get_order(copy + len); if len was 0, copy + len
>>> was also often 0 and get_order returned some undefined value (at the
>>> moment 52). alloc_pages caught that and failed, but then
>>> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
>>> probably not out of memory and actually do not need any additional
>>> memory.
>>>
>>> v2: Add bug description and Fixes tag
>>>
>>> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
>>> Signed-off-by: Felix Maurer <fmaurer@redhat.com>
>>
>> LGTM. I am wondering why bpf CI didn't catch this problem. Did you
>> modified the test with length 0 in order to trigger that? If this
>> is the case, it would be great you can add such a test to the
>> test_sockmap.
> 
> I did not modify the tests to trigger that. The state of the selftests
> around that is unfortunately not very good. There is no explicit test
> with length 0 but bpf_msg_push_data is still called with length 0,
> because of what I consider to be bugs in the test. On the other hand,
> explicit tests with other lengths are sometimes not called as well. I'll
> elaborate on that in a bit.
> 
> Something easy to fix is that the tests do not check the return value of
> bpf_msg_push_data which they probably should. That may have helped find
> the problem earlier.
> 
> Now to the issue mentioned in the beginning: Only some of the BPF
> programs used in test_sockmap actually call bpf_msg_push_data. However,
> they are not always attached, just for particular scenarios:
> txmsg_pass==1, txmsg_redir==1, or txmsg_drop==1. If none of those apply,
> bpf_msg_push_data is never called. This happens for example in
> test_txmsg_push. Out of the four defined tests only one actually calls
> the helper.
> 
> But after a test, the parameters in the map are reset to 0 (instead of
> being removed). Therefore, when the maps are reused in a subsequent test
> which is one of the scenarios above, the values are present and
> bpf_msg_push_data is called, albeit with the parameters set to 0. This
> is also what triggered the wrong behavior fixed in the patch.
> 
> Unfortunately, I do not have the time to fix these issues in the test at
> the moment.

Thanks for detailed explanation. Maybe for the immediate case, can you
just fix this in the selftest,

   > Something easy to fix is that the tests do not check the return 
value of
   > bpf_msg_push_data which they probably should. That may have helped find
   > the problem earlier.

This will be enough to verify your kernel change as without it the
test will fail.

The rest of test improvements can come later.

> 
>> Acked-by: Yonghong Song <yhs@fb.com>
> 
> Thanks!
>

Alexei Starovoitov Feb. 11, 2022, 3:08 a.m. UTC | #4

On Thu, Feb 10, 2022 at 10:05 AM Yonghong Song <yhs@fb.com> wrote:
>
>
>
> On 2/10/22 7:45 AM, Felix Maurer wrote:
> > On 09.02.22 18:06, Yonghong Song wrote:
> >> On 2/9/22 7:55 AM, Felix Maurer wrote:
> >>> If bpf_msg_push_data is called with len 0 (as it happens during
> >>> selftests/bpf/test_sockmap), we do not need to do anything and can
> >>> return early.
> >>>
> >>> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
> >>> error: we later called get_order(copy + len); if len was 0, copy + len
> >>> was also often 0 and get_order returned some undefined value (at the
> >>> moment 52). alloc_pages caught that and failed, but then
> >>> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
> >>> probably not out of memory and actually do not need any additional
> >>> memory.
> >>>
> >>> v2: Add bug description and Fixes tag
> >>>
> >>> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
> >>> Signed-off-by: Felix Maurer <fmaurer@redhat.com>
> >>
> >> LGTM. I am wondering why bpf CI didn't catch this problem. Did you
> >> modified the test with length 0 in order to trigger that? If this
> >> is the case, it would be great you can add such a test to the
> >> test_sockmap.
> >
> > I did not modify the tests to trigger that. The state of the selftests
> > around that is unfortunately not very good. There is no explicit test
> > with length 0 but bpf_msg_push_data is still called with length 0,
> > because of what I consider to be bugs in the test. On the other hand,
> > explicit tests with other lengths are sometimes not called as well. I'll
> > elaborate on that in a bit.
> >
> > Something easy to fix is that the tests do not check the return value of
> > bpf_msg_push_data which they probably should. That may have helped find
> > the problem earlier.
> >
> > Now to the issue mentioned in the beginning: Only some of the BPF
> > programs used in test_sockmap actually call bpf_msg_push_data. However,
> > they are not always attached, just for particular scenarios:
> > txmsg_pass==1, txmsg_redir==1, or txmsg_drop==1. If none of those apply,
> > bpf_msg_push_data is never called. This happens for example in
> > test_txmsg_push. Out of the four defined tests only one actually calls
> > the helper.
> >
> > But after a test, the parameters in the map are reset to 0 (instead of
> > being removed). Therefore, when the maps are reused in a subsequent test
> > which is one of the scenarios above, the values are present and
> > bpf_msg_push_data is called, albeit with the parameters set to 0. This
> > is also what triggered the wrong behavior fixed in the patch.
> >
> > Unfortunately, I do not have the time to fix these issues in the test at
> > the moment.
>
> Thanks for detailed explanation. Maybe for the immediate case, can you
> just fix this in the selftest,
>
>    > Something easy to fix is that the tests do not check the return
> value of
>    > bpf_msg_push_data which they probably should. That may have helped find
>    > the problem earlier.
>
> This will be enough to verify your kernel change as without it the
> test will fail.
>
> The rest of test improvements can come later.

John,
what is your take on this fix?
bpf tree material?

John Fastabend Feb. 11, 2022, 6:33 a.m. UTC | #5

Alexei Starovoitov wrote:
> On Thu, Feb 10, 2022 at 10:05 AM Yonghong Song <yhs@fb.com> wrote:
> >
> >
> >
> > On 2/10/22 7:45 AM, Felix Maurer wrote:
> > > On 09.02.22 18:06, Yonghong Song wrote:
> > >> On 2/9/22 7:55 AM, Felix Maurer wrote:
> > >>> If bpf_msg_push_data is called with len 0 (as it happens during
> > >>> selftests/bpf/test_sockmap), we do not need to do anything and can
> > >>> return early.
> > >>>
> > >>> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
> > >>> error: we later called get_order(copy + len); if len was 0, copy + len
> > >>> was also often 0 and get_order returned some undefined value (at the
> > >>> moment 52). alloc_pages caught that and failed, but then
> > >>> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
> > >>> probably not out of memory and actually do not need any additional
> > >>> memory.
> > >>>
> > >>> v2: Add bug description and Fixes tag
> > >>>
> > >>> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
> > >>> Signed-off-by: Felix Maurer <fmaurer@redhat.com>
> > >>
> > >> LGTM. I am wondering why bpf CI didn't catch this problem. Did you
> > >> modified the test with length 0 in order to trigger that? If this
> > >> is the case, it would be great you can add such a test to the
> > >> test_sockmap.
> > >
> > > I did not modify the tests to trigger that. The state of the selftests
> > > around that is unfortunately not very good. There is no explicit test
> > > with length 0 but bpf_msg_push_data is still called with length 0,
> > > because of what I consider to be bugs in the test. On the other hand,
> > > explicit tests with other lengths are sometimes not called as well. I'll
> > > elaborate on that in a bit.
> > >
> > > Something easy to fix is that the tests do not check the return value of
> > > bpf_msg_push_data which they probably should. That may have helped find
> > > the problem earlier.
> > >
> > > Now to the issue mentioned in the beginning: Only some of the BPF
> > > programs used in test_sockmap actually call bpf_msg_push_data. However,
> > > they are not always attached, just for particular scenarios:
> > > txmsg_pass==1, txmsg_redir==1, or txmsg_drop==1. If none of those apply,
> > > bpf_msg_push_data is never called. This happens for example in
> > > test_txmsg_push. Out of the four defined tests only one actually calls
> > > the helper.
> > >
> > > But after a test, the parameters in the map are reset to 0 (instead of
> > > being removed). Therefore, when the maps are reused in a subsequent test
> > > which is one of the scenarios above, the values are present and
> > > bpf_msg_push_data is called, albeit with the parameters set to 0. This
> > > is also what triggered the wrong behavior fixed in the patch.
> > >
> > > Unfortunately, I do not have the time to fix these issues in the test at
> > > the moment.
> >
> > Thanks for detailed explanation. Maybe for the immediate case, can you
> > just fix this in the selftest,
> >
> >    > Something easy to fix is that the tests do not check the return
> > value of
> >    > bpf_msg_push_data which they probably should. That may have helped find
> >    > the problem earlier.
> >
> > This will be enough to verify your kernel change as without it the
> > test will fail.
> >
> > The rest of test improvements can come later.
> 
> John,
> what is your take on this fix?

Fix looks good its nice to return 0 here instead of ENOMEM as a result
of paticulars of passing 0 to get_order(). Ack for me.

> bpf tree material?

I checked our code here and we would never pass '0' to pull data. Its hard
to imagine what type of code would do that, but on the other hand its a
bug and its only rc3... I've no strong opinion if I wrote the patch I would
have pointed it at bpf tree so slight preference to push it as a fix.

Acked-by: John Fastabend <john.fastabend@gmail.com>

patchwork-bot+netdevbpf@kernel.org Feb. 11, 2022, 1 p.m. UTC | #6

Hello:

This patch was applied to bpf/bpf.git (master)
by Daniel Borkmann <daniel@iogearbox.net>:

On Wed,  9 Feb 2022 16:55:26 +0100 you wrote:
> If bpf_msg_push_data is called with len 0 (as it happens during
> selftests/bpf/test_sockmap), we do not need to do anything and can
> return early.
> 
> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
> error: we later called get_order(copy + len); if len was 0, copy + len
> was also often 0 and get_order returned some undefined value (at the
> moment 52). alloc_pages caught that and failed, but then
> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
> probably not out of memory and actually do not need any additional
> memory.
> 
> [...]

Here is the summary with links:
  - [bpf-next,v2] bpf: Do not try bpf_msg_push_data with len 0
    https://git.kernel.org/bpf/bpf/c/4a11678f6838

You are awesome, thank you!

Felix Maurer Feb. 11, 2022, 5:52 p.m. UTC | #7

On 10.02.22 19:04, Yonghong Song wrote:
> On 2/10/22 7:45 AM, Felix Maurer wrote:
>> On 09.02.22 18:06, Yonghong Song wrote:
>>> On 2/9/22 7:55 AM, Felix Maurer wrote:
>>>> If bpf_msg_push_data is called with len 0 (as it happens during
>>>> selftests/bpf/test_sockmap), we do not need to do anything and can
>>>> return early.
>>>>
>>>> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
>>>> error: we later called get_order(copy + len); if len was 0, copy + len
>>>> was also often 0 and get_order returned some undefined value (at the
>>>> moment 52). alloc_pages caught that and failed, but then
>>>> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
>>>> probably not out of memory and actually do not need any additional
>>>> memory.
>>>>
>>>> v2: Add bug description and Fixes tag
>>>>
>>>> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
>>>> Signed-off-by: Felix Maurer <fmaurer@redhat.com>
>>>
>>> LGTM. I am wondering why bpf CI didn't catch this problem. Did you
>>> modified the test with length 0 in order to trigger that? If this
>>> is the case, it would be great you can add such a test to the
>>> test_sockmap.
>>
>> I did not modify the tests to trigger that. The state of the selftests
>> around that is unfortunately not very good. There is no explicit test
>> with length 0 but bpf_msg_push_data is still called with length 0,
>> because of what I consider to be bugs in the test. On the other hand,
>> explicit tests with other lengths are sometimes not called as well. I'll
>> elaborate on that in a bit.
>>
>> Something easy to fix is that the tests do not check the return value of
>> bpf_msg_push_data which they probably should. That may have helped find
>> the problem earlier.
>>
>> Now to the issue mentioned in the beginning: Only some of the BPF
>> programs used in test_sockmap actually call bpf_msg_push_data. However,
>> they are not always attached, just for particular scenarios:
>> txmsg_pass==1, txmsg_redir==1, or txmsg_drop==1. If none of those apply,
>> bpf_msg_push_data is never called. This happens for example in
>> test_txmsg_push. Out of the four defined tests only one actually calls
>> the helper.
>>
>> But after a test, the parameters in the map are reset to 0 (instead of
>> being removed). Therefore, when the maps are reused in a subsequent test
>> which is one of the scenarios above, the values are present and
>> bpf_msg_push_data is called, albeit with the parameters set to 0. This
>> is also what triggered the wrong behavior fixed in the patch.
>>
>> Unfortunately, I do not have the time to fix these issues in the test at
>> the moment.
> 
> Thanks for detailed explanation. Maybe for the immediate case, can you
> just fix this in the selftest,
> 
>   > Something easy to fix is that the tests do not check the return
> value of
>   > bpf_msg_push_data which they probably should. That may have helped find
>   > the problem earlier.
> 
> This will be enough to verify your kernel change as without it the
> test will fail.

I just send a patch checking the return values of the bpf_msg_push_data
usages in the test [1]. Passing the errors to userspace by dropping
packets is not very nice, but a straightforward way in the current test
program.

I did try the same checks of the return values of bpf_msg_pull_data, but
then the tests fail. So there might be something hidden here as well.

[1]:https://lore.kernel.org/bpf/89f767bb44005d6b4dd1f42038c438f76b3ebfad.1644601294.git.fmaurer@redhat.com/

> The rest of test improvements can come later.
> 
>>
>>> Acked-by: Yonghong Song <yhs@fb.com>
>>
>> Thanks!
>>
>

Yonghong Song Feb. 11, 2022, 10:27 p.m. UTC | #8

On 2/11/22 9:52 AM, Felix Maurer wrote:
> On 10.02.22 19:04, Yonghong Song wrote:
>> On 2/10/22 7:45 AM, Felix Maurer wrote:
>>> On 09.02.22 18:06, Yonghong Song wrote:
>>>> On 2/9/22 7:55 AM, Felix Maurer wrote:
>>>>> If bpf_msg_push_data is called with len 0 (as it happens during
>>>>> selftests/bpf/test_sockmap), we do not need to do anything and can
>>>>> return early.
>>>>>
>>>>> Calling bpf_msg_push_data with len 0 previously lead to a wrong ENOMEM
>>>>> error: we later called get_order(copy + len); if len was 0, copy + len
>>>>> was also often 0 and get_order returned some undefined value (at the
>>>>> moment 52). alloc_pages caught that and failed, but then
>>>>> bpf_msg_push_data returned ENOMEM. This was wrong because we are most
>>>>> probably not out of memory and actually do not need any additional
>>>>> memory.
>>>>>
>>>>> v2: Add bug description and Fixes tag
>>>>>
>>>>> Fixes: 6fff607e2f14b ("bpf: sk_msg program helper bpf_msg_push_data")
>>>>> Signed-off-by: Felix Maurer <fmaurer@redhat.com>
>>>>
>>>> LGTM. I am wondering why bpf CI didn't catch this problem. Did you
>>>> modified the test with length 0 in order to trigger that? If this
>>>> is the case, it would be great you can add such a test to the
>>>> test_sockmap.
>>>
>>> I did not modify the tests to trigger that. The state of the selftests
>>> around that is unfortunately not very good. There is no explicit test
>>> with length 0 but bpf_msg_push_data is still called with length 0,
>>> because of what I consider to be bugs in the test. On the other hand,
>>> explicit tests with other lengths are sometimes not called as well. I'll
>>> elaborate on that in a bit.
>>>
>>> Something easy to fix is that the tests do not check the return value of
>>> bpf_msg_push_data which they probably should. That may have helped find
>>> the problem earlier.
>>>
>>> Now to the issue mentioned in the beginning: Only some of the BPF
>>> programs used in test_sockmap actually call bpf_msg_push_data. However,
>>> they are not always attached, just for particular scenarios:
>>> txmsg_pass==1, txmsg_redir==1, or txmsg_drop==1. If none of those apply,
>>> bpf_msg_push_data is never called. This happens for example in
>>> test_txmsg_push. Out of the four defined tests only one actually calls
>>> the helper.
>>>
>>> But after a test, the parameters in the map are reset to 0 (instead of
>>> being removed). Therefore, when the maps are reused in a subsequent test
>>> which is one of the scenarios above, the values are present and
>>> bpf_msg_push_data is called, albeit with the parameters set to 0. This
>>> is also what triggered the wrong behavior fixed in the patch.
>>>
>>> Unfortunately, I do not have the time to fix these issues in the test at
>>> the moment.
>>
>> Thanks for detailed explanation. Maybe for the immediate case, can you
>> just fix this in the selftest,
>>
>>    > Something easy to fix is that the tests do not check the return
>> value of
>>    > bpf_msg_push_data which they probably should. That may have helped find
>>    > the problem earlier.
>>
>> This will be enough to verify your kernel change as without it the
>> test will fail.
> 
> I just send a patch checking the return values of the bpf_msg_push_data
> usages in the test [1]. Passing the errors to userspace by dropping
> packets is not very nice, but a straightforward way in the current test
> program.
> 
> I did try the same checks of the return values of bpf_msg_pull_data, but
> then the tests fail. So there might be something hidden here as well.

Thanks for the patch! Maybe this can be the first step to fix
test_sockmap.

John, could you help take a look at the patch?

> 
> [1]:https://lore.kernel.org/bpf/89f767bb44005d6b4dd1f42038c438f76b3ebfad.1644601294.git.fmaurer@redhat.com/
> 
>> The rest of test improvements can come later.
>>
>>>
>>>> Acked-by: Yonghong Song <yhs@fb.com>
>>>
>>> Thanks!
>>>
>>
>

[bpf-next,v2] bpf: Do not try bpf_msg_push_data with len 0

Checks

Commit Message

Comments

Patch