Message ID | 991aaad37b41f71faa19fdef4373ccc115edcc40.1635802069.git.gitgitgadget@gmail.com (mailing list archive) |
---|---|
State | Superseded |
Headers | show |
Series | Sparse Index: diff and blame builtins | expand |
"Lessley Dennington via GitGitGadget" <gitgitgadget@gmail.com> writes: > 2000.34: git diff --staged (full-v3) 0.08 0.08 +0.0% > 2000.35: git diff --staged (full-v4) 0.08 0.08 +0.0% > 2000.36: git diff --staged (sparse-v3) 0.17 0.04 -76.5% > 2000.37: git diff --staged (sparse-v4) 0.16 0.04 -75.0% Please do not add more use of the synonym to the test suite, other than the one that makes sure the synonym works the same way as the real option, which is "--cached". > diff --git a/builtin/diff.c b/builtin/diff.c > index dd8ce688ba7..cbf7b51c7c0 100644 > --- a/builtin/diff.c > +++ b/builtin/diff.c > @@ -437,6 +437,9 @@ int cmd_diff(int argc, const char **argv, const char *prefix) > > prefix = setup_git_directory_gently(&nongit); > > + prepare_repo_settings(the_repository); > + the_repository->settings.command_requires_full_index = 0; > + Doesn't the code need to be protected with if (!nongit) { prepare_repo_settings(the_repository); the_repository->settings.command_requires_full_index = 0; } at the very least? It may be that the code is getting lucky because the_repository may be initialized with a random value (after all, when we are not in a repository, there is nowhere to read the on-disk settings from) and we may even be able to set a bit in the settings structure without crashing, but conceptually, doing the above when we _know_ we are not in any repository is simply wrong. I wonder if prepare_repo_settings() needs be more strict. For example, shouldn't it check if we have a repository to begin with and BUG() if it was called when there is not a repository? After all, it tries to read from the repository configuration file, so any necessary set-up to discover where the gitdir is must have been done already before it can be called. With such a safety feature to catch a programmer errors, perhaps the above could have been caught before the patch hit the list. Thoughts? Am I missing some chicken-and-egg situation where prepare_repo_settings() must be callable before we know where the repository is, or something, which justifies why the function is so loose in its sanity checks in the current form?
On 11/3/21 10:05 AM, Junio C Hamano wrote: > "Lessley Dennington via GitGitGadget" <gitgitgadget@gmail.com> > writes: > >> 2000.34: git diff --staged (full-v3) 0.08 0.08 +0.0% >> 2000.35: git diff --staged (full-v4) 0.08 0.08 +0.0% >> 2000.36: git diff --staged (sparse-v3) 0.17 0.04 -76.5% >> 2000.37: git diff --staged (sparse-v4) 0.16 0.04 -75.0% > > Please do not add more use of the synonym to the test suite, other > than the one that makes sure the synonym works the same way as the > real option, which is "--cached". > Thank you, changed for v4. >> diff --git a/builtin/diff.c b/builtin/diff.c >> index dd8ce688ba7..cbf7b51c7c0 100644 >> --- a/builtin/diff.c >> +++ b/builtin/diff.c >> @@ -437,6 +437,9 @@ int cmd_diff(int argc, const char **argv, const char *prefix) >> >> prefix = setup_git_directory_gently(&nongit); >> >> + prepare_repo_settings(the_repository); >> + the_repository->settings.command_requires_full_index = 0; >> + > > Doesn't the code need to be protected with > > if (!nongit) { > prepare_repo_settings(the_repository); > the_repository->settings.command_requires_full_index = 0; > } > > at the very least? It may be that the code is getting lucky because > the_repository may be initialized with a random value (after all, > when we are not in a repository, there is nowhere to read the > on-disk settings from) and we may even be able to set a bit in the > settings structure without crashing, but conceptually, doing the > above when we _know_ we are not in any repository is simply wrong. > > I wonder if prepare_repo_settings() needs be more strict. For > example, shouldn't it check if we have a repository to begin with > and BUG() if it was called when there is not a repository? After > all, it tries to read from the repository configuration file, so any > necessary set-up to discover where the gitdir is must have been done > already before it can be called. > > With such a safety feature to catch a programmer errors, perhaps the > above could have been caught before the patch hit the list. > > Thoughts? Am I missing some chicken-and-egg situation where > prepare_repo_settings() must be callable before we know where the > repository is, or something, which justifies why the function is so > loose in its sanity checks in the current form? > > This seems like a good idea. I've added both the nongit check and the prepare_repo_settings() updates you've suggested for v4, pending review by my team. Best, Lessley
diff --git a/builtin/diff.c b/builtin/diff.c index dd8ce688ba7..cbf7b51c7c0 100644 --- a/builtin/diff.c +++ b/builtin/diff.c @@ -437,6 +437,9 @@ int cmd_diff(int argc, const char **argv, const char *prefix) prefix = setup_git_directory_gently(&nongit); + prepare_repo_settings(the_repository); + the_repository->settings.command_requires_full_index = 0; + if (!no_index) { /* * Treat git diff with at least one path outside of the diff --git a/t/perf/p2000-sparse-operations.sh b/t/perf/p2000-sparse-operations.sh index bfd332120c8..bff93f16e93 100755 --- a/t/perf/p2000-sparse-operations.sh +++ b/t/perf/p2000-sparse-operations.sh @@ -113,5 +113,7 @@ test_perf_on_all git checkout -f - test_perf_on_all git reset test_perf_on_all git reset --hard test_perf_on_all git reset -- does-not-exist +test_perf_on_all git diff +test_perf_on_all git diff --staged test_done diff --git a/t/t1092-sparse-checkout-compatibility.sh b/t/t1092-sparse-checkout-compatibility.sh index 44d5e11c762..53524660759 100755 --- a/t/t1092-sparse-checkout-compatibility.sh +++ b/t/t1092-sparse-checkout-compatibility.sh @@ -832,6 +832,52 @@ test_expect_success 'sparse-index is not expanded: merge conflict in cone' ' ) ' +test_expect_success 'sparse index is not expanded: diff' ' + init_repos && + + write_script edit-contents <<-\EOF && + echo text >>$1 + EOF + + # Add file within cone + test_sparse_match git sparse-checkout set deep && + run_on_all ../edit-contents deep/testfile && + test_all_match git add deep/testfile && + run_on_all ../edit-contents deep/testfile && + + test_all_match git diff && + test_all_match git diff --staged && + ensure_not_expanded diff && + ensure_not_expanded diff --staged && + + # Add file outside cone + test_all_match git reset --hard && + run_on_all mkdir newdirectory && + run_on_all ../edit-contents newdirectory/testfile && + test_sparse_match git sparse-checkout set newdirectory && + test_all_match git add newdirectory/testfile && + run_on_all ../edit-contents newdirectory/testfile && + test_sparse_match git sparse-checkout set && + + test_all_match git diff && + test_all_match git diff --staged && + ensure_not_expanded diff && + ensure_not_expanded diff --staged && + + # Merge conflict outside cone + # The sparse checkout will report a warning that is not in the + # full checkout, so we use `run_on_all` instead of + # `test_all_match` + run_on_all git reset --hard && + test_all_match git checkout merge-left && + test_all_match test_must_fail git merge merge-right && + + test_all_match git diff && + test_all_match git diff --staged && + ensure_not_expanded diff && + ensure_not_expanded diff --staged +' + # NEEDSWORK: a sparse-checkout behaves differently from a full checkout # in this scenario, but it shouldn't. test_expect_success 'reset mixed and checkout orphan' '