Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option for excluding rename commits from last update #148

Merged
merged 3 commits into from
Oct 17, 2024

Conversation

skywarth
Copy link
Contributor

@skywarth skywarth commented Sep 27, 2024

Discussed and proposed in #147

Did a couple of tests using one of my repositories where the aforementioned problem is present. Looking good so far.

This is just a draft PR, do not merge yet. Populate the TODO below if I'm missing something

TODO:

  • Manual test on an actual repository, via mkdocs serve
  • Automation tests defined in fixture
    • Maybe define a new fixture for this case
  • Update the documentation to include description for this new option

Sorry, something went wrong.

@timvink
Copy link
Owner

timvink commented Sep 27, 2024

I don't think we need an option for this. If there any use case where you would want this?

@skywarth
Copy link
Contributor Author

skywarth commented Sep 27, 2024

I don't think we need an option for this. If there any use case where you would want this?

You mean this should be the default behavior, without an option?

I think the feature itself would be useful, because in knowledge-base and documentation projects there are often quite a lot of folder hierarchy changes, and if we consider these changes as 'last update' it could be misleading. Say for an API documentation; it was written in 2020 and never updated, which is deprecated today. And lately it was decided to be moved to another folder, and now it shows as it is updated recently, and people may think it is up-to-date, whereas it is not.

The reason to have it as option and not default behavior could be; it brings an additional overhead since it runs with --follow or follow=True and some people might not want such overhead which might cripple the performance a bit where it is imperative.

@timvink
Copy link
Owner

timvink commented Sep 27, 2024

OK I had to dig back into the issue.

So if we use --follow, there is this really nasty bug that the date from git log --follow will be incorrect if there are any empty files committed in the git history (see https://blog.plover.com/prog/git-log-follow.html). Hence I was planning to make follow a plugin option, so users basically have to choose between incorrect creation/revision dates if there are file names, vs incorrect creation/revision dates if there are any empty files committed.

I do want to have sensible defaults. Here's my proposal:

  • We add a new plugin option follow, that defaults to True. This is a breaking change, because revision dates might changes (follow was already enabled for creation dates).
  • We add the diff filter r for revision dates. No plugin option, I think ignoring renames makes sense for both revision and creation dates. Does that diff filter = 'r'' need --follow? or can we just always add the diff filter? Maybe we can remove the conditional on follow like you have it now.
  • While we're making breaking changes, let's also add the --ignore-all-space and --ignore-blank-lines to the revision git log command. I think it's sensible, as those are not real edits. I can document it in the release notes.

Would you be willing to make the changes? I can take care of the documentation

commit_timestamp = git.log(
realpath, date="unix", format="%at", n=1, no_show_signature=True
realpath, date="unix", format="%at", diff_filter=diff_filter_param, n=1, no_show_signature=True, follow=follow_param
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that with follow enabled, you will get multiple commits, and you need to take the last one. See the logic used for the creation date.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doesn't n=1 also achieve the same desired effect as selecting the last one using commit_timestamp = commit_timestamp.split()[-1] ?

I saw that and though maybe it is just awaiting refactoring.

If it doesn't achieve the same result, I can do it like the creation date.

@skywarth
Copy link
Contributor Author

Sure @timvink, though I didn't fully wrap my head around it I'll try to implement these. If it gets way above my head I'll let you know 😄

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
…follow introduced, added additional parameter for revision changes, diff-filter "r" is added for both creation and revision commit histories
@skywarth
Copy link
Contributor Author

Hi @timvink, just pushed the changes:

  • We add a new plugin option follow, that defaults to True. This is a breaking change, because revision dates might changes (follow was already enabled for creation dates).
  • We add the diff filter r for revision dates. No plugin option, I think ignoring renames makes sense for both revision and creation dates.
  • While we're making breaking changes, let's also add the --ignore-all-space and --ignore-blank-lines to the revision git log command. I think it's sensible, as those are not real edits. I can document it in the release notes.

About the "r" and --follow

"Does that diff filter = 'r'' need --follow? or can we just always add the diff filter? Maybe we can remove the conditional on follow like you have it now."

We can always add the diff-filter="r", when it is used along with --follow it essentially takes moving operations into account too. When it is used without the --follow it only tracks actual renames; renames that are limited to current directory, moving operations are not included since the file is not tracked/followed.

I got one question though; why did we add diff-filter "r" for creation dates as well? I was trying to come up with a case where it would make a difference but couldn't think of any, could you describe a case for it please?

And for the manual tests, here's the difference between master branch and this branch's outputs, based on one of my repositories:

  • problems.md
    • master branch, (deployed): Revision date September 24, 2024. Creation date July 16, 2022.
    • PR branch, version_history_follow_enabled: true: Revision date July 16, 2022. Creation date July 16, 2022.
    • PR branch, version_history_follow_enabled: false: Revision date September 24, 2024. Creation date September 24, 2024.
  • great-advertisements.md
    • master branch, (deployed): Revision date September 28, 2024. Creation date August 10, 2024.
    • PR branch, version_history_follow_enabled: true: September 28, 2024. Creation date August 10, 2024.
    • PR branch, version_history_follow_enabled: false: September 28, 2024. Creation date September 24, 2024.

Please note September 24, 2024 is the date I moved all files to another directory.

In conclusion: based on the test outputs, I think it achieves what we are looking for.

Let me know if you need additional changes or tests. I'm here to help.

Thank you

Sorry, something went wrong.

@skywarth skywarth marked this pull request as ready for review September 29, 2024 21:54
@skywarth skywarth changed the title Draft: Option for excluding rename commits from last update Option for excluding rename commits from last update Sep 29, 2024
@timvink
Copy link
Owner

timvink commented Oct 2, 2024

Nice work!

I got one question though; why did we add diff-filter "r" for creation dates as well? I was trying to come up with a case where it would make a difference but couldn't think of any, could you describe a case for it please?

I assume speed. Not sure anymore tbh.

Let me know if you need additional changes or tests. I'm here to help.

Cool!

  • can you rename version_history_follow_enabled to enable_git_follow
  • if you want, you can already update docs/options.md

Almost there!

@skywarth
Copy link
Contributor Author

skywarth commented Oct 3, 2024

Certainly @timvink, I'm slightly busy nowadays but no worries I'll let you know once I update the PR, stay tuned. Thank you.

@skywarth
Copy link
Contributor Author

Hi @timvink. Sorry for the delay, I was busy with relocating. Can you take a quick look and see if it's up to par?

@skywarth skywarth requested a review from timvink October 17, 2024 00:26
@timvink
Copy link
Owner

timvink commented Oct 17, 2024

Great stuff, thanks for all your effort to push this!

@timvink timvink merged commit 9e064cd into timvink:master Oct 17, 2024
@timvink
Copy link
Owner

timvink commented Oct 17, 2024

Will make a release this weekend

@timvink
Copy link
Owner

timvink commented Oct 22, 2024

@skywarth
Copy link
Contributor Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants