Use empty range when there's "gap" in token source #11032

dhruvmanila · 2024-04-19T10:09:37Z

Summary

This fixes a bug where the parser would panic when there is a "gap" in
the token source.

What's a gap?

The reason it's <= instead of just == is because there could be whitespaces between
the two tokens. For example:

#     last token end
#     | current token (newline) start
#     v v
def foo \n
#      ^
#      assume there's trailing whitespace here

Or, there could tokens that are considered "trivia" and thus aren't emitted by the token
source. These are comments and non-logical newlines. For example:

#     last token end
#     v
def foo # comment\n
#                ^ current token (newline) start

In either of the above cases, there's a "gap" between the end of the last token and start
of the current token.

Test Plan

Add test cases and update the snapshots.

github-actions · 2024-04-19T10:26:53Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

ℹ️ ecosystem check detected linter changes. (+0 -1 violations, +0 -0 fixes in 1 projects; 43 projects unchanged)

python/typeshed (+0 -1 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --preview --select E,F,FA,I,PYI,RUF,UP,W

- stdlib/pathlib.pyi:111:89: E999 SyntaxError: Expected ':', found newline

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
E999	1	0	1	0	0

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

MichaReiser · 2024-04-19T11:02:30Z

crates/ruff_python_parser/src/parser/mod.rs

+        // It's possible during error recovery that the parsing didn't consume any tokens. In that
+        // case, `last_token_end` still points to the end of the previous token but `start` is the
+        // start of the current token. Calling `TextRange::new(start, self.last_token_end)` would
+        // panic in that case because `start > end`. This path "detects" this case and creates an
+        // empty range instead.


Nit: i think this is specific to error recovery and instead is true for all "empty" nodes that don't consist of any tokens?

If that's the case, then I think we can remove the missing_node_range method that was only added to handle empty node ranges.

I think missing_node_range is still useful in places where you don't have the start value or rather the start value itself is the current token start.

dhruvmanila added bug Something isn't working parser Related to the parser labels Apr 19, 2024

dhruvmanila requested a review from MichaReiser as a code owner April 19, 2024 10:09

dhruvmanila mentioned this pull request Apr 19, 2024

[Panic] Unknow nvim error while working with python based project. #11020

Open

MichaReiser approved these changes Apr 19, 2024

View reviewed changes

Use empty range when there's "gap" in token source

337f16e

dhruvmanila force-pushed the dhruv/node-range branch from 6979dad to 337f16e Compare April 19, 2024 11:29

dhruvmanila enabled auto-merge (squash) April 19, 2024 11:29

dhruvmanila merged commit d3cd61f into main Apr 19, 2024
17 checks passed

dhruvmanila deleted the dhruv/node-range branch April 19, 2024 11:36

BrewTestBot mentioned this pull request Apr 19, 2024

ruff 0.4.1 Homebrew/homebrew-core#169523

Merged

MichaReiser mentioned this pull request Apr 19, 2024

[Panic] Ruff VSCode extension crashes every time the word def is written in a class #11037

Closed

AlexWaygood mentioned this pull request Apr 23, 2024

[Panic] Accidental Space after function name (instead of opening parenthesis) crashes Ruff #11105

Closed

dhruvmanila mentioned this pull request May 16, 2024

assertion failed: start.raw <= end.raw appears again #11429

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use empty range when there's "gap" in token source #11032

Use empty range when there's "gap" in token source #11032

dhruvmanila commented Apr 19, 2024

github-actions bot commented Apr 19, 2024 •

edited

MichaReiser Apr 19, 2024

dhruvmanila Apr 19, 2024

Use empty range when there's "gap" in token source #11032

Use empty range when there's "gap" in token source #11032

Conversation

dhruvmanila commented Apr 19, 2024

Summary

Test Plan

github-actions bot commented Apr 19, 2024 • edited

ruff-ecosystem results

Linter (stable)

Linter (preview)

Formatter (stable)

Formatter (preview)

MichaReiser Apr 19, 2024

Choose a reason for hiding this comment

dhruvmanila Apr 19, 2024

Choose a reason for hiding this comment

github-actions bot commented Apr 19, 2024 •

edited

`ruff-ecosystem` results