Message ID | 20240129015708.GA1762343@coredump.intra.peff.net (mailing list archive) |
---|---|
State | Accepted |
Commit | 85a9a63c9268b18b24f25f6a14d6ae9966c3566d |
Headers | show |
Series | diff: handle NULL meta-info when spawning external diff | expand |
Jeff King <peff@peff.net> writes: >> $ git diff --no-index foo bar >> zsh: segmentation fault (core dumped) git diff --no-index foo bar > > Thanks for providing a simple reproduction recipe. There's a pretty > straight-forward fix below, though it leaves open some question of > whether there's another bug lurking with --no-index (but either way, I > think we'd want this simple fix as a first step). Yup, I agree with you that the "--no-index" mode violates the basic design that "the other path" and "xfrm_msg" go hand-in-hand. In its two tree comparison mode "git diff --no-index A/ B/", it should be able to behave sensibly, but in its two files comparison mode to compare plain regular files 'foo' and 'bar', there is nothing it can do reasonably, I am afraid. You could say that the change is renaming 'foo' to create 'bar', and feed consistent data that is aligned with that rename to external diff, which might be slightly more logical than showing a change to 'foo' that has no rename involved (i.e. omitting "other name"), but neither is satisfying. > But I'm not sure what fallout we might have from changing that behavior > now. So this patch takes the less-risky option, and simply teaches > run_external_diff() to avoid passing xfrm_msg when it's NULL. That makes > it agnostic to whether "other" and "xfrm_msg" always come as a pair. It > fixes the segfault now, and if we want to change the --no-index "other" > behavior on top, it will handle that, too. Sounds sensible. Thanks. Will queue.
On Mon, Jan 29, 2024 at 10:37:29AM -0800, Junio C Hamano wrote: > Jeff King <peff@peff.net> writes: > > >> $ git diff --no-index foo bar > >> zsh: segmentation fault (core dumped) git diff --no-index foo bar > > > > Thanks for providing a simple reproduction recipe. There's a pretty > > straight-forward fix below, though it leaves open some question of > > whether there's another bug lurking with --no-index (but either way, I > > think we'd want this simple fix as a first step). > > Yup, I agree with you that the "--no-index" mode violates the basic > design that "the other path" and "xfrm_msg" go hand-in-hand. In its > two tree comparison mode "git diff --no-index A/ B/", it should be > able to behave sensibly, but in its two files comparison mode to > compare plain regular files 'foo' and 'bar', there is nothing it can > do reasonably, I am afraid. You could say that the change is > renaming 'foo' to create 'bar', and feed consistent data that is > aligned with that rename to external diff, which might be slightly > more logical than showing a change to 'foo' that has no rename > involved (i.e. omitting "other name"), but neither is satisfying. Yeah, I think the two-tree mode does behave correctly, and this is really just about the two-blob mode. I agree that one could think of it as a rename or not, depending on how much you want to read into the importance of the names (after all, you could compare a/foo and b/foo, which is sort of a moral equivalent of the usual two-tree case). The current behavior is somewhere in between, though. You get an "other" name passed to the external diff, but the metainfo argument makes no mention of a rename (it's either blank for an exact rename, or may contain an "index" line if there was a content change). I'm not sure anybody really cares that much either way, though. It's external diff, which I suspect hardly anybody uses, and those extra fields aren't even documented in the first place. -Peff
Jeff King <peff@peff.net> writes: > The current behavior is somewhere in between, though. You get an "other" > name passed to the external diff, but the metainfo argument makes no > mention of a rename (it's either blank for an exact rename, or may > contain an "index" line if there was a content change). > > I'm not sure anybody really cares that much either way, though. It's > external diff, which I suspect hardly anybody uses, and those extra > fields aren't even documented in the first place. Oh, we probably should fix the documentation eventually, then. But I agree that in this case, whatever stops the segfault would be good enough. I am surprised to learn that this 8th hidden parameter dates back to 427dcb4b ([PATCH] Diff overhaul, adding half of copy detection., 2005-05-21), and it is more surprising that even before it happened, the external diff interface with 7 parameters was already documented, which happened with 03ea2802 ([PATCH 2/2] core-git documentation update, 2005-05-08). Before the addition of the copy detection, the presence of the "other" was how you learned if we saw a rename (because there was no copy, the only reason "other" is there was due to a rename). With copy detection added, extra bits of information needed to be passed and we started passing the xfrm_msg as well through the interface. At least, by dumping it to the end-user, an external diff driver could help the end-user tell if that "other" came from a rename or from a copy, even if it did not understand it itself. And of course, after merely 6 weeks since the inception, Git did not have the "--no-index" mode (we did not even have a unified "git diff" frontend), so this was never a problem back then.
diff --git a/diff.c b/diff.c index a89a6a6128..ccfa1fca0d 100644 --- a/diff.c +++ b/diff.c @@ -4384,7 +4384,8 @@ static void run_external_diff(const char *pgm, add_external_diff_name(o->repo, &cmd.args, two); if (other) { strvec_push(&cmd.args, other); - strvec_push(&cmd.args, xfrm_msg); + if (xfrm_msg) + strvec_push(&cmd.args, xfrm_msg); } } diff --git a/t/t4053-diff-no-index.sh b/t/t4053-diff-no-index.sh index 5ce345d309..651ec77660 100755 --- a/t/t4053-diff-no-index.sh +++ b/t/t4053-diff-no-index.sh @@ -205,6 +205,18 @@ test_expect_success POSIXPERM,SYMLINKS 'diff --no-index normalizes: mode not lik test_cmp expected actual ' +test_expect_success POSIXPERM 'external diff with mode-only change' ' + echo content >not-executable && + echo content >executable && + chmod +x executable && + echo executable executable $(test_oid zero) 100755 \ + not-executable $(test_oid zero) 100644 not-executable \ + >expect && + test_expect_code 1 git -c diff.external=echo diff \ + --no-index executable not-executable >actual && + test_cmp expect actual +' + test_expect_success "diff --no-index treats '-' as stdin" ' cat >expect <<-EOF && diff --git a/- b/a/1