Message ID | 020b2f50c5703e8291577b008fdfa567093c6eab.1537551564.git.gitgitgadget@gmail.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | Use generation numbers for --topo-order | expand |
On 9/21/2018 1:39 PM, Derrick Stolee via GitGitGadget wrote: > From: Derrick Stolee <dstolee@microsoft.com> > > When running a command like 'git rev-list --topo-order HEAD', > Git performed the following steps: > > 1. Run limit_list(), which parses all reachable commits, > adds them to a linked list, and distributes UNINTERESTING > flags. If all unprocessed commits are UNINTERESTING, then > it may terminate without walking all reachable commits. > This does not occur if we do not specify UNINTERESTING > commits. > > 2. Run sort_in_topological_order(), which is an implementation > of Kahn's algorithm. It first iterates through the entire > set of important commits and computes the in-degree of each > (plus one, as we use 'zero' as a special value here). Then, > we walk the commits in priority order, adding them to the > priority queue if and only if their in-degree is one. As > we remove commits from this priority queue, we decrement the > in-degree of their parents. > > 3. While we are peeling commits for output, get_revision_1() > uses pop_commit on the full list of commits computed by > sort_in_topological_order(). > > In the new algorithm, these three steps correspond to three > different commit walks. We run these walks simultaneously, > and advance each only as far as necessary to satisfy the > requirements of the 'higher order' walk. We know when we can > pause each walk by using generation numbers from the commit- > graph feature. Hello, Git contributors. I understand that this commit message and patch are pretty daunting. There is a lot to read and digest. I would like to see if anyone is willing to put the work in to review this patch, as I quite like what it does, and the performance numbers below. > In my local testing, I used the following Git commands on the > Linux repository in three modes: HEAD~1 with no commit-graph, > HEAD~1 with a commit-graph, and HEAD with a commit-graph. This > allows comparing the benefits we get from parsing commits from > the commit-graph and then again the benefits we get by > restricting the set of commits we walk. > > Test: git rev-list --topo-order -100 HEAD > HEAD~1, no commit-graph: 6.80 s > HEAD~1, w/ commit-graph: 0.77 s > HEAD, w/ commit-graph: 0.02 s > > Test: git rev-list --topo-order -100 HEAD -- tools > HEAD~1, no commit-graph: 9.63 s > HEAD~1, w/ commit-graph: 6.06 s > HEAD, w/ commit-graph: 0.06 s If there is something I can do to make this easier to review, then please let me know. Thanks, -Stolee
Derrick Stolee <stolee@gmail.com> writes: > On 9/21/2018 1:39 PM, Derrick Stolee via GitGitGadget wrote: > Hello, Git contributors. > > I understand that this commit message and patch are pretty > daunting. There is a lot to read and digest. I would like to see if > anyone is willing to put the work in to review this patch, as I quite > like what it does, and the performance numbers below. I'll try to find time to review v3 of this patch series this week. >> In my local testing, I used the following Git commands on the >> Linux repository in three modes: HEAD~1 with no commit-graph, >> HEAD~1 with a commit-graph, and HEAD with a commit-graph. This >> allows comparing the benefits we get from parsing commits from >> the commit-graph and then again the benefits we get by >> restricting the set of commits we walk. >> >> Test: git rev-list --topo-order -100 HEAD >> HEAD~1, no commit-graph: 6.80 s >> HEAD~1, w/ commit-graph: 0.77 s >> HEAD, w/ commit-graph: 0.02 s >> >> Test: git rev-list --topo-order -100 HEAD -- tools >> HEAD~1, no commit-graph: 9.63 s >> HEAD~1, w/ commit-graph: 6.06 s >> HEAD, w/ commit-graph: 0.06 s > > If there is something I can do to make this easier to review, then > please let me know. > > Thanks, > -Stolee
On Fri, Sep 21, 2018 at 10:39:36AM -0700, Derrick Stolee via GitGitGadget wrote: > From: Derrick Stolee <dstolee@microsoft.com> > > When running a command like 'git rev-list --topo-order HEAD', > Git performed the following steps: > [...] > In the new algorithm, these three steps correspond to three > different commit walks. We run these walks simultaneously, A minor nit, but this commit message doesn't mention the most basic thing up front: that its main purpose is to introduce a new algorithm for topo-order. ;) It's obvious in the context of reviewing the series, but somebody reading "git log" later may want a bit more. Perhaps: revision.c: implement generation-based topo-order algorithm as a subject, and/or an introductory paragraph like: The current --topo-order algorithm requires walking all commits we are going to output up front, topo-sorting them, all before outputting the first value. This patch introduces a new algorithm which uses stored generation numbers to incrementally walk in topo-order, outputting commits as we go. Other than that, I find this to be a wonderfully explanatory commit message. :) > The walks are as follows: > > 1. EXPLORE: using the explore_queue priority queue (ordered by > maximizing the generation number), parse each reachable > commit until all commits in the queue have generation > number strictly lower than needed. During this walk, update > the UNINTERESTING flags as necessary. OK, this makes sense. If we know that everybody else in our queue is at generation X, then it is safe to output a commit at generation greater than X. I think this by itself would allow us to implement "show no parents before all of its children are shown", right? But --topo-order promises a bit more: "avoid showing commits no multiple lines of history intermixed". I guess also INFINITY generation numbers need more. For a real generation number, we know that "gen(A) == gen(B)" implies that there is no ancestry relationship between the two. But not so for INFINITY. > 2. INDEGREE: using the indegree_queue priority queue (ordered > by maximizing the generation number), add one to the in- > degree of each parent for each commit that is walked. Since > we walk in order of decreasing generation number, we know > that discovering an in-degree value of 0 means the value for > that commit was not initialized, so should be initialized to > two. (Recall that in-degree value "1" is what we use to say a > commit is ready for output.) As we iterate the parents of a > commit during this walk, ensure the EXPLORE walk has walked > beyond their generation numbers. I wondered how this would work for INFINITY. We can't know the order of a bunch of INFINITY nodes at all, so we never know when their in-degree values are "done". But if I understand the EXPLORE walk, we'd basically walk all of INFINITY down to something with a real generation number. Is that right? But after that, I'm not totally clear on why we need this INDEGREE walk. > 3. TOPO: using the topo_queue priority queue (ordered based on > the sort_order given, which could be commit-date, author- > date, or typical topo-order which treats the queue as a LIFO > stack), remove a commit from the queue and decrement the > in-degree of each parent. If a parent has an in-degree of > one, then we add it to the topo_queue. Before we decrement > the in-degree, however, ensure the INDEGREE walk has walked > beyond that generation number. OK, this makes sense to make --author-date-order, etc, work. Potentially those numbers might have no relationship at all with the graph structure, but we promise "no parent before its children are shown", so this is really just a tie-breaker after the topo-sort anyway. As long as steps 1 and 2 are correct and produce a complete set of commits for one "layer", this should be OK. I guess I'm not 100% convinced that we don't have a case where we haven't yet parsed or considered some commit that we know cannot have an ancestry relationship with commits we are outputting. But it may have an author-date-order relationship. (I'm not at all convinced that this _is_ a problem, and I suspect it isn't; I'm only suggesting I haven't fully grokked the proof). > --- > object.h | 4 +- > revision.c | 196 +++++++++++++++++++++++++++++++++++++++++++++++++++-- > revision.h | 2 + > 3 files changed, 194 insertions(+), 8 deletions(-) I'll pause here on evaluating the actual code. It looks sane from a cursory read, but there's no point in digging further until I'm sure I fully understand the algorithm. I think that needs a little more brain power from me, and hopefully discussion around my comments above will help trigger that. -Peff
On 10/11/2018 11:35 AM, Jeff King wrote: > On Fri, Sep 21, 2018 at 10:39:36AM -0700, Derrick Stolee via GitGitGadget wrote: > >> From: Derrick Stolee <dstolee@microsoft.com> >> >> When running a command like 'git rev-list --topo-order HEAD', >> Git performed the following steps: >> [...] >> In the new algorithm, these three steps correspond to three >> different commit walks. We run these walks simultaneously, > A minor nit, but this commit message doesn't mention the most basic > thing up front: that its main purpose is to introduce a new algorithm > for topo-order. ;) > > It's obvious in the context of reviewing the series, but somebody > reading "git log" later may want a bit more. Perhaps: > > revision.c: implement generation-based topo-order algorithm > > as a subject, and/or an introductory paragraph like: > > The current --topo-order algorithm requires walking all commits we > are going to output up front, topo-sorting them, all before > outputting the first value. This patch introduces a new algorithm > which uses stored generation numbers to incrementally walk in > topo-order, outputting commits as we go. > > Other than that, I find this to be a wonderfully explanatory commit > message. :) Good idea. I'll make that change. > >> The walks are as follows: >> >> 1. EXPLORE: using the explore_queue priority queue (ordered by >> maximizing the generation number), parse each reachable >> commit until all commits in the queue have generation >> number strictly lower than needed. During this walk, update >> the UNINTERESTING flags as necessary. > OK, this makes sense. If we know that everybody else in our queue is at > generation X, then it is safe to output a commit at generation greater > than X. > > I think this by itself would allow us to implement "show no parents > before all of its children are shown", right? But --topo-order promises > a bit more: "avoid showing commits no multiple lines of history > intermixed". > > I guess also INFINITY generation numbers need more. For a real > generation number, we know that "gen(A) == gen(B)" implies that there is > no ancestry relationship between the two. But not so for INFINITY. Yeah, to deal with INFINITY (and ZERO, but that won't happen if generation_numbers_enabled() returns true), we treat gen(A) == gen(B) as a "no information" state. So, to output a commit at generation X, we need to have our maximum generation number in the unexplored area to be at most X - 1. You'll see strict inequality when checking generations. >> 2. INDEGREE: using the indegree_queue priority queue (ordered >> by maximizing the generation number), add one to the in- >> degree of each parent for each commit that is walked. Since >> we walk in order of decreasing generation number, we know >> that discovering an in-degree value of 0 means the value for >> that commit was not initialized, so should be initialized to >> two. (Recall that in-degree value "1" is what we use to say a >> commit is ready for output.) As we iterate the parents of a >> commit during this walk, ensure the EXPLORE walk has walked >> beyond their generation numbers. > I wondered how this would work for INFINITY. We can't know the order of > a bunch of INFINITY nodes at all, so we never know when their in-degree > values are "done". But if I understand the EXPLORE walk, we'd basically > walk all of INFINITY down to something with a real generation number. Is > that right? > > But after that, I'm not totally clear on why we need this INDEGREE walk. The INDEGREE walk is an important element for Kahn's algorithm. The final output order is dictated by peeling commits of "indegree zero" to ensure all children are output before their parents. (Note: since we use literal 0 to mean "uninitialized", we peel commits when the indegree slab has value 1.) This walk replaces the indegree logic from sort_in_topological_order(). That method performs one walk that fills the indegree slab, then another walk that peels the commits with indegree 0 and inserts them into a list. >> 3. TOPO: using the topo_queue priority queue (ordered based on >> the sort_order given, which could be commit-date, author- >> date, or typical topo-order which treats the queue as a LIFO >> stack), remove a commit from the queue and decrement the >> in-degree of each parent. If a parent has an in-degree of >> one, then we add it to the topo_queue. Before we decrement >> the in-degree, however, ensure the INDEGREE walk has walked >> beyond that generation number. > OK, this makes sense to make --author-date-order, etc, work. Potentially > those numbers might have no relationship at all with the graph > structure, but we promise "no parent before its children are shown", so > this is really just a tie-breaker after the topo-sort anyway. As long as > steps 1 and 2 are correct and produce a complete set of commits for one > "layer", this should be OK. > > I guess I'm not 100% convinced that we don't have a case where we > haven't yet parsed or considered some commit that we know cannot have an > ancestry relationship with commits we are outputting. But it may have an > author-date-order relationship. > > (I'm not at all convinced that this _is_ a problem, and I suspect it > isn't; I'm only suggesting I haven't fully grokked the proof). The INDEGREE walk should not stop until it has explored at least to the point that all indegree 0 commits are exposed (relative to the current state of the walk). At initialization, we walk from all starting positions until the maximum generation number in our queue is less than the minimum generation of a starting commit. The starting positions that have indegree 0 are then added to the topo_queue, and the sort order dictates which is the best. From this point on, we can only create a new "indegree 0" commit by removing a commit from the topo_queue and decrementing the indegree of its parents. Those parents with indegree 0 are inserted into topo_queue and compared to all other indegree 0 commits. Thus, we will always explore enough to make the right choice relative our sort order. > >> --- >> object.h | 4 +- >> revision.c | 196 +++++++++++++++++++++++++++++++++++++++++++++++++++-- >> revision.h | 2 + >> 3 files changed, 194 insertions(+), 8 deletions(-) > I'll pause here on evaluating the actual code. It looks sane from a > cursory read, but there's no point in digging further until I'm sure I > fully understand the algorithm. I think that needs a little more brain > power from me, and hopefully discussion around my comments above will > help trigger that. Thanks for reading! I understand that reading the code is useless without understanding the high-level concepts. I'm happy to iterate on this. If I can find a better way to explain the algorithm in the commit message to avoid the "huh?" moments above, then I will. Thanks, -Stolee
On Fri, Sep 21, 2018 at 10:39 AM Derrick Stolee via GitGitGadget <gitgitgadget@gmail.com> wrote: > > From: Derrick Stolee <dstolee@microsoft.com> [...] > For the test above, I specifically selected a path that is changed > frequently, including by merge commits. A less-frequently-changed > path (such as 'README') has similar end-to-end time since we need > to walk the same number of commits (before determining we do not > have 100 hits). However, get get the benefit that the output is "get get"
On Thu, Oct 11, 2018 at 12:21:44PM -0400, Derrick Stolee wrote: > > > 2. INDEGREE: using the indegree_queue priority queue (ordered > > > by maximizing the generation number), add one to the in- > > > degree of each parent for each commit that is walked. Since > > > we walk in order of decreasing generation number, we know > > > that discovering an in-degree value of 0 means the value for > > > that commit was not initialized, so should be initialized to > > > two. (Recall that in-degree value "1" is what we use to say a > > > commit is ready for output.) As we iterate the parents of a > > > commit during this walk, ensure the EXPLORE walk has walked > > > beyond their generation numbers. > > I wondered how this would work for INFINITY. We can't know the order of > > a bunch of INFINITY nodes at all, so we never know when their in-degree > > values are "done". But if I understand the EXPLORE walk, we'd basically > > walk all of INFINITY down to something with a real generation number. Is > > that right? > > > > But after that, I'm not totally clear on why we need this INDEGREE walk. > > The INDEGREE walk is an important element for Kahn's algorithm. The final > output order is dictated by peeling commits of "indegree zero" to ensure all > children are output before their parents. (Note: since we use literal 0 to > mean "uninitialized", we peel commits when the indegree slab has value 1.) > > This walk replaces the indegree logic from sort_in_topological_order(). That > method performs one walk that fills the indegree slab, then another walk > that peels the commits with indegree 0 and inserts them into a list. I guess my big question here was: if we have generation numbers, do we need Kahn's algorithm? That is, in a fully populated set of generation numbers (i.e., no INFINITY), we could always just pick a commit with the highest generation number to show. So if we EXPLORE down to a real generation number in phase 1, why do we need to care about INDEGREE anymore? Or am I wrong that we always have a real generation number (i.e., not INFINITY) after EXPLORE? (And if so, why is exploring to a real generation number a bad idea; presumably it's due to a worst-case that goes deeper than we'd otherwise need to here). > [...] Everything else you said here made perfect sense. -Peff
On 10/25/2018 5:43 AM, Jeff King wrote: > On Thu, Oct 11, 2018 at 12:21:44PM -0400, Derrick Stolee wrote: > >>>> 2. INDEGREE: using the indegree_queue priority queue (ordered >>>> by maximizing the generation number), add one to the in- >>>> degree of each parent for each commit that is walked. Since >>>> we walk in order of decreasing generation number, we know >>>> that discovering an in-degree value of 0 means the value for >>>> that commit was not initialized, so should be initialized to >>>> two. (Recall that in-degree value "1" is what we use to say a >>>> commit is ready for output.) As we iterate the parents of a >>>> commit during this walk, ensure the EXPLORE walk has walked >>>> beyond their generation numbers. >>> I wondered how this would work for INFINITY. We can't know the order of >>> a bunch of INFINITY nodes at all, so we never know when their in-degree >>> values are "done". But if I understand the EXPLORE walk, we'd basically >>> walk all of INFINITY down to something with a real generation number. Is >>> that right? >>> >>> But after that, I'm not totally clear on why we need this INDEGREE walk. >> The INDEGREE walk is an important element for Kahn's algorithm. The final >> output order is dictated by peeling commits of "indegree zero" to ensure all >> children are output before their parents. (Note: since we use literal 0 to >> mean "uninitialized", we peel commits when the indegree slab has value 1.) >> >> This walk replaces the indegree logic from sort_in_topological_order(). That >> method performs one walk that fills the indegree slab, then another walk >> that peels the commits with indegree 0 and inserts them into a list. > I guess my big question here was: if we have generation numbers, do we > need Kahn's algorithm? That is, in a fully populated set of generation > numbers (i.e., no INFINITY), we could always just pick a commit with the > highest generation number to show. > > So if we EXPLORE down to a real generation number in phase 1, why do we > need to care about INDEGREE anymore? Or am I wrong that we always have a > real generation number (i.e., not INFINITY) after EXPLORE? (And if so, > why is exploring to a real generation number a bad idea; presumably > it's due to a worst-case that goes deeper than we'd otherwise need to > here). The issue is that we our final order (used by level 3) is unrelated to generation number. Yes, if we prioritized by generation number then we could output the commits in _some_ order that doesn't violate topological constraints. However, we are asking for a different priority, which is different than the generation number priority. In the case of "--topo-order", we want to output the commits reachable from the second parent of a merge before the commits reachable from the first parent. However, in most cases the generation number of the first parent is higher than the second parent (there are more things in the merge chain than in a small topic that got merged). The INDEGREE is what allows us to know when we can peel these commits while still respecting the priority we want at the end. Thanks, -Stolee
diff --git a/object.h b/object.h index 0feb90ae61..796792cb32 100644 --- a/object.h +++ b/object.h @@ -59,7 +59,7 @@ struct object_array { /* * object flag allocation: - * revision.h: 0---------10 2526 + * revision.h: 0---------10 25----28 * fetch-pack.c: 01 * negotiator/default.c: 2--5 * walker.c: 0-2 @@ -78,7 +78,7 @@ struct object_array { * builtin/show-branch.c: 0-------------------------------------------26 * builtin/unpack-objects.c: 2021 */ -#define FLAG_BITS 27 +#define FLAG_BITS 29 /* * The object type is stored in 3 bits. diff --git a/revision.c b/revision.c index 92012d5f45..c5d0cb6599 100644 --- a/revision.c +++ b/revision.c @@ -26,6 +26,7 @@ #include "argv-array.h" #include "commit-reach.h" #include "commit-graph.h" +#include "prio-queue.h" volatile show_early_output_fn_t show_early_output; @@ -2895,30 +2896,213 @@ static int mark_uninteresting(const struct object_id *oid, return 0; } -struct topo_walk_info {}; +define_commit_slab(indegree_slab, int); + +struct topo_walk_info { + uint32_t min_generation; + struct prio_queue explore_queue; + struct prio_queue indegree_queue; + struct prio_queue topo_queue; + struct indegree_slab indegree; + struct author_date_slab author_date; +}; + +static inline void test_flag_and_insert(struct prio_queue *q, struct commit *c, int flag) +{ + if (c->object.flags & flag) + return; + + c->object.flags |= flag; + prio_queue_put(q, c); +} + +static void explore_walk_step(struct rev_info *revs) +{ + struct topo_walk_info *info = revs->topo_walk_info; + struct commit_list *p; + struct commit *c = prio_queue_get(&info->explore_queue); + + if (!c) + return; + + if (parse_commit_gently(c, 1) < 0) + return; + + if (revs->max_age != -1 && (c->date < revs->max_age)) + c->object.flags |= UNINTERESTING; + + if (add_parents_to_list(revs, c, NULL, NULL) < 0) + return; + + if (c->object.flags & UNINTERESTING) + mark_parents_uninteresting(c); + + for (p = c->parents; p; p = p->next) + test_flag_and_insert(&info->explore_queue, p->item, TOPO_WALK_EXPLORED); +} + +static void explore_to_depth(struct rev_info *revs, + uint32_t gen) +{ + struct topo_walk_info *info = revs->topo_walk_info; + struct commit *c; + while ((c = prio_queue_peek(&info->explore_queue)) && + c->generation >= gen) + explore_walk_step(revs); +} + +static void indegree_walk_step(struct rev_info *revs) +{ + struct commit_list *p; + struct topo_walk_info *info = revs->topo_walk_info; + struct commit *c = prio_queue_get(&info->indegree_queue); + + if (!c) + return; + + if (parse_commit_gently(c, 1) < 0) + return; + + explore_to_depth(revs, c->generation); + + if (parse_commit_gently(c, 1) < 0) + return; + + for (p = c->parents; p; p = p->next) { + struct commit *parent = p->item; + int *pi = indegree_slab_at(&info->indegree, parent); + + if (*pi) + (*pi)++; + else + *pi = 2; + + test_flag_and_insert(&info->indegree_queue, parent, TOPO_WALK_INDEGREE); + + if (revs->first_parent_only) + return; + } +} + +static void compute_indegrees_to_depth(struct rev_info *revs) +{ + struct topo_walk_info *info = revs->topo_walk_info; + struct commit *c; + while ((c = prio_queue_peek(&info->indegree_queue)) && + c->generation >= info->min_generation) + indegree_walk_step(revs); +} static void init_topo_walk(struct rev_info *revs) { struct topo_walk_info *info; + struct commit_list *list; revs->topo_walk_info = xmalloc(sizeof(struct topo_walk_info)); info = revs->topo_walk_info; memset(info, 0, sizeof(struct topo_walk_info)); - limit_list(revs); - sort_in_topological_order(&revs->commits, revs->sort_order); + init_indegree_slab(&info->indegree); + memset(&info->explore_queue, '\0', sizeof(info->explore_queue)); + memset(&info->indegree_queue, '\0', sizeof(info->indegree_queue)); + memset(&info->topo_queue, '\0', sizeof(info->topo_queue)); + + switch (revs->sort_order) { + default: /* REV_SORT_IN_GRAPH_ORDER */ + info->topo_queue.compare = NULL; + break; + case REV_SORT_BY_COMMIT_DATE: + info->topo_queue.compare = compare_commits_by_commit_date; + break; + case REV_SORT_BY_AUTHOR_DATE: + init_author_date_slab(&info->author_date); + info->topo_queue.compare = compare_commits_by_author_date; + info->topo_queue.cb_data = &info->author_date; + break; + } + + info->explore_queue.compare = compare_commits_by_gen_then_commit_date; + info->indegree_queue.compare = compare_commits_by_gen_then_commit_date; + + info->min_generation = GENERATION_NUMBER_INFINITY; + for (list = revs->commits; list; list = list->next) { + struct commit *c = list->item; + test_flag_and_insert(&info->explore_queue, c, TOPO_WALK_EXPLORED); + test_flag_and_insert(&info->indegree_queue, c, TOPO_WALK_INDEGREE); + + if (parse_commit_gently(c, 1)) + continue; + if (c->generation < info->min_generation) + info->min_generation = c->generation; + } + + for (list = revs->commits; list; list = list->next) { + struct commit *c = list->item; + *(indegree_slab_at(&info->indegree, c)) = 1; + + if (revs->sort_order == REV_SORT_BY_AUTHOR_DATE) + record_author_date(&info->author_date, c); + } + compute_indegrees_to_depth(revs); + + for (list = revs->commits; list; list = list->next) { + struct commit *c = list->item; + + if (*(indegree_slab_at(&info->indegree, c)) == 1) + prio_queue_put(&info->topo_queue, c); + } + + /* + * This is unfortunate; the initial tips need to be shown + * in the order given from the revision traversal machinery. + */ + if (revs->sort_order == REV_SORT_IN_GRAPH_ORDER) + prio_queue_reverse(&info->topo_queue); } static struct commit *next_topo_commit(struct rev_info *revs) { - return pop_commit(&revs->commits); + struct commit *c; + struct topo_walk_info *info = revs->topo_walk_info; + + /* pop next off of topo_queue */ + c = prio_queue_get(&info->topo_queue); + + if (c) + *(indegree_slab_at(&info->indegree, c)) = 0; + + return c; } static void expand_topo_walk(struct rev_info *revs, struct commit *commit) { - if (add_parents_to_list(revs, commit, &revs->commits, NULL) < 0) { + struct commit_list *p; + struct topo_walk_info *info = revs->topo_walk_info; + if (add_parents_to_list(revs, commit, NULL, NULL) < 0) { if (!revs->ignore_missing_links) die("Failed to traverse parents of commit %s", - oid_to_hex(&commit->object.oid)); + oid_to_hex(&commit->object.oid)); + } + + for (p = commit->parents; p; p = p->next) { + struct commit *parent = p->item; + int *pi; + + if (parse_commit_gently(parent, 1) < 0) + continue; + + if (parent->generation < info->min_generation) { + info->min_generation = parent->generation; + compute_indegrees_to_depth(revs); + } + + pi = indegree_slab_at(&info->indegree, parent); + + (*pi)--; + if (*pi == 1) + prio_queue_put(&info->topo_queue, parent); + + if (revs->first_parent_only) + return; } } diff --git a/revision.h b/revision.h index e7bd059d80..7cc3bf5fc0 100644 --- a/revision.h +++ b/revision.h @@ -24,6 +24,8 @@ #define USER_GIVEN (1u<<25) /* given directly by the user */ #define TRACK_LINEAR (1u<<26) #define ALL_REV_FLAGS (((1u<<11)-1) | USER_GIVEN | TRACK_LINEAR) +#define TOPO_WALK_EXPLORED (1u<<27) +#define TOPO_WALK_INDEGREE (1u<<28) #define DECORATE_SHORT_REFS 1 #define DECORATE_FULL_REFS 2