From: Junio C Hamano <gitster@pobox•com>
To: "Bernhard R. Link" <brl+git@mail•brlink.eu>
Cc: git@vger•kernel.org
Subject: Re: [RFC] blame: new option to better handle merged cherry-picks
Date: Thu, 02 Jan 2014 12:29:43 -0800 [thread overview]
Message-ID: <xmqqlhyyp1oo.fsf@gitster.dls.corp.google.com> (raw)
In-Reply-To: <20140102175529.GA4669@client.brlink.eu> (Bernhard R. Link's message of "Thu, 2 Jan 2014 18:55:37 +0100")
"Bernhard R. Link" <brl+git@mail•brlink.eu> writes:
> Allows to disable the git blame optimization of assuming that if there is a
> parent of a merge commit that has the exactly same file content, then
> only this parent is to be looked at.
I think this is what we usually call --full-history in "git log"
family, but more importantly, I do not think this is solving a valid
problem.
> This optimization, while being faster in the usual case, means that in
> the case of cherry-picks the blamed commit depends on which other commits
> touched a file.
>
> If for example one commit A modified both files b and c. And there are
> commits B and C, B only modifies file b and C only modifies file c
> (so that no conflicts happen), and assume A is cherry-picked as A'
> and the two branches then merged:
>
> --o-----B---A
> \ \
> ---C---A'--M---
So the contents of b at M is as the same as in A, so following 'b'
will see A and B changed that path, which is correct.
The contents of c at M is? It is different from A because at A c
lacks the change made to it at C. The merged result at M would
match C in A', no? So following 'c' will see A' and C changed that
path, no?
So what is wrong about it? If the original history were like this
instead, and A' were a cherry-pick of A, then what should happen?
> --o-----B---A'
> \ \
> ---C---A---M---
Don't we want to see c blamed the same way?
Also, when handling a merge, we have to handle parents sequencially,
checking the difference between M with its first parent first, and
then passing blame for the remaining common lines to the remaining
parents. If you flip the order of parents of M when you merge A and
A' in your original history, and with your patch, what would you
see when you blame c? Wouldn't it notice that M:c is identical to c
in its first parent (now A') and pass the whole blame to A' anyway
with or without your change?
> Then without this new option git blame blames the A|A' changes of
> file b to A while blaming the changes of c to A'.
> With the new option --no-parent-shortcut it blames both changes to the
> same commit.
>
> Signed-off-by: Bernhard R. Link <brlink@debian•org>
> ---
> Documentation/blame-options.txt | 6 ++++++
> builtin/blame.c | 5 ++++-
> 2 files changed, 10 insertions(+), 1 deletion(-)
>
> diff --git a/Documentation/blame-options.txt b/Documentation/blame-options.txt
> index 0cebc4f..55dd12b 100644
> --- a/Documentation/blame-options.txt
> +++ b/Documentation/blame-options.txt
> @@ -48,6 +48,12 @@ include::line-range-format.txt[]
> Show the result incrementally in a format designed for
> machine consumption.
>
> +--no-parent-shortcut::
> + Always look at all parents of a merge and do not shortcut
> + to the first parent with no changes to the file looked at.
> + This takes more time but produces more reliable results
> + if branches with cherry-picked commits were merged.
> +
> --encoding=<encoding>::
> Specifies the encoding used to output author names
> and commit summaries. Setting it to `none` makes blame
> diff --git a/builtin/blame.c b/builtin/blame.c
> index 4916eb2..dab2c36 100644
> --- a/builtin/blame.c
> +++ b/builtin/blame.c
> @@ -45,6 +45,7 @@ static int incremental;
> static int xdl_opts;
> static int abbrev = -1;
> static int no_whole_file_rename;
> +static int no_parent_shortcut;
>
> static enum date_mode blame_date_mode = DATE_ISO8601;
> static size_t blame_date_width;
> @@ -1248,7 +1249,8 @@ static void pass_blame(struct scoreboard *sb, struct origin *origin, int opt)
> porigin = find(sb, p, origin);
> if (!porigin)
> continue;
> - if (!hashcmp(porigin->blob_sha1, origin->blob_sha1)) {
> + if (!no_parent_shortcut &&
> + !hashcmp(porigin->blob_sha1, origin->blob_sha1)) {
> pass_whole_blame(sb, origin, porigin);
> origin_decref(porigin);
> goto finish;
> @@ -2247,6 +2249,7 @@ int cmd_blame(int argc, const char **argv, const char *prefix)
> static const char *contents_from = NULL;
> static const struct option options[] = {
> OPT_BOOL(0, "incremental", &incremental, N_("Show blame entries as we find them, incrementally")),
> + OPT_BOOL(0, "no-parent-shortcut", &no_parent_shortcut, N_("Don't take shortcuts in some merges but handle cherry-picks better")),
> OPT_BOOL('b', NULL, &blank_boundary, N_("Show blank SHA-1 for boundary commits (Default: off)")),
> OPT_BOOL(0, "root", &show_root, N_("Do not treat root commits as boundaries (Default: off)")),
> OPT_BOOL(0, "show-stats", &show_stats, N_("Show work cost statistics")),
next prev parent reply other threads:[~2014-01-02 20:29 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-02 17:55 [RFC] blame: new option to better handle merged cherry-picks Bernhard R. Link
2014-01-02 20:29 ` Junio C Hamano [this message]
2014-01-02 21:15 ` Bernhard R. Link
2014-01-02 21:48 ` Junio C Hamano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=xmqqlhyyp1oo.fsf@gitster.dls.corp.google.com \
--to=gitster@pobox$(echo .)com \
--cc=brl+git@mail$(echo .)brlink.eu \
--cc=git@vger$(echo .)kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox