Normalize inconsistent markdown headings in docstrings #16364
Merged
+43
−43
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
I am working on a project that uses ruff linters' docs to generate a fine-tuning dataset for LLMs.
To achieve this, I first ran the command
ruff rule --all --output-format json
to retrieve all the rules. Then, I parsed the explanation field to get these 3 consistent sections:Why is this bad?
What it does
Example
However, during the initial processing, I noticed that the markdown headings are not that consistent. For instance:
Use instead
appears as a normal paragraph within theExample
section, but in the filecrates/ruff_linter/src/rules/flake8_bandit/rules/django_extra.rs
it is a level-2 headingExample
headings and 65Examples
. But all of them only have one example caseThis PR normalized these across all rules.
Test Plan
CI are passed.