C901, PLR0912 and PLR0915 treat match/case as one statement #11421

jaap3 · 2024-05-14T09:21:47Z

Using ruff 0.4.4 the following code triggers C901, PLR0912 and PLR0915:

def grades_to_average(grades):
    numbers = []
    for grade in grades:
        if grade in {"F-", "F", "F+", "E-", "E"}:
            numbers.append(0)
        elif grade == "E+":
            numbers.append(.3)
        elif grade == "D-":
            numbers.append(.7)
        elif grade == "D":
            numbers.append(1.0)
        elif grade == "D+":
            numbers.append(1.3)
        elif grade == "C-":
            numbers.append(1.7)
        elif grade == "C":
            numbers.append(2.0)
        elif grade == "C+":
            numbers.append(2.3)
        elif grade == "B-":
            numbers.append(2.7)
        elif grade == "B":
            numbers.append(3.0)
        elif grade == "B+":
            numbers.append(3.3)
        elif grade == "A-":
            numbers.append(3.7)
        elif grade in {"A", "A+"}:
            numbers.append(4.0)
        else:
            raise ValueError(f"Unknown grade: {grade}")

    try:
        avg = sum(numbers) / len(numbers)
    except ZeroDivisionError:
        avg = 0

    if avg < 0.3:
        avg_grade = "F"
    elif avg < .7:
        avg_grade = "E+"
    elif avg < 1.0:
        avg_grade = "D-"
    elif avg < 1.3:
        avg_grade = "D"
    elif avg < 1.7:
        avg_grade = "D+"
    elif avg < 2.0:
        avg_grade = "C-"
    elif avg < 2.3:
        avg_grade = "C"
    elif avg < 2.7:
        avg_grade = "C+"
    elif avg < 3.0:
        avg_grade = "B-"
    elif avg < 3.3:
        avg_grade = "B"
    elif avg < 3.7:
        avg_grade = "B+"
    elif avg < 4.0:
        avg_grade = "A-"
    elif avg >= 4.0:
        avg_grade = "A"
    else:
        raise ValueError(f"Unexpected average: {avg}")
    return avg_grade

The equivalent code that uses match/case however does not:

def grades_to_average(grades):
    numbers = []
    for grade in grades:
        match grade:
            case "F-" | "F" | "F+" | "E-" | "E":
                numbers.append(0)
            case "E+":
                numbers.append(.3)
            case "D-":
                numbers.append(.7)
            case "D":
                numbers.append(1.0)
            case "D+":
                numbers.append(1.3)
            case "C-":
                numbers.append(1.7)
            case "C":
                numbers.append(2.0)
            case "C+":
                numbers.append(2.3)
            case "B-":
                numbers.append(2.7)
            case "B":
                numbers.append(3.0)
            case "B+":
                numbers.append(3.3)
            case "A-":
                numbers.append(3.7)
            case "A" | "A+":
                numbers.append(4.0)
            case _:
                raise ValueError(f"Unknown grade: {grade}")

    try:
        avg = sum(numbers) / len(numbers)
    except ZeroDivisionError:
        avg = 0

    match avg:
        case avg if avg < .3:
            avg_grade = "F"
        case avg if avg < .7:
            avg_grade = "E+"
        case avg if avg < 1.0:
            avg_grade = "D-"
        case avg if avg < 1.3:
            avg_grade = "D"
        case avg if avg < 1.7:
            avg_grade = "D+"
        case avg if avg < 2.0:
            avg_grade = "C-"
        case avg if avg < 2.3:
            avg_grade = "C"
        case avg if avg < 2.7:
            avg_grade = "C+"
        case avg if avg < 3.0:
            avg_grade = "B-"
        case avg if avg < 3.3:
            avg_grade = "B"
        case avg if avg < 3.7:
            avg_grade = "B+"
        case avg if avg < 4.0:
            avg_grade = "A-"
        case avg if avg >= 4.0:
            avg_grade = "A"
        case _:
            raise ValueError(f"Unexpected average: {avg}")
    return avg_grade

Is this intentional? Should each case count as a conditional?

The text was updated successfully, but these errors were encountered:

dhruvmanila · 2024-05-15T09:18:17Z

I think it is intentional as Pylint doesn't detect it either.

Should each case count as a conditional?

Personally, I wouldn't count it as I find the match statement to be more readable than an if statement. Additionally, the semantics of pattern matching is very different then the test expression of an if statement. I'd love to hear others opinion, cc @AlexWaygood @zanieb

jaap3 · 2024-05-16T08:41:49Z

I feel like the same complexity and maintainability arguments apply to match and case. They are definitely a way to achieve branching code and each case (especially the conditional ones containing if) could be considered to be a distinct statement right?

This issue should probably be converted to a discussion.

dhruvmanila · 2024-05-20T05:52:13Z

This issue should probably be converted to a discussion.

It's fine, we can discuss here.

I don't have any arguments against this, and it does make sense (I think?) to include the match statement similarly to an if statement. I would wait for others to share their opinions before we decide. It would also be useful to hear the thoughts of the Pylint maintainers. Do you want to open an issue on the Pylint repository similar to this? I can also do that :)

charliermarsh · 2024-05-22T03:17:21Z

\cc @Pierre-Sassoulas

Pierre-Sassoulas · 2024-05-22T05:07:17Z

Thank you for the ping @charliermarsh :) I see no reason for match to behave differently than if. I think it's an oversight when we added the match statement for python 3.10 we did not think to test the consequences on the already implemented "too-complex" (assumed the "too-complex" code was generic enough).

Pierre-Sassoulas · 2024-05-22T06:47:39Z

So, I did some light research and there might be a point for 'too-complex" behaving the way it does right now. It is based on Mc Cabe and:

McCabe originally recommended exempting modules consisting of single mul-
tiway decision (“switch” or “case”) statements from the complexity limit. The multiway deci-
sion issue has been interpreted in many ways over the years, sometimes with disastrous
results.

https://web.archive.org/web/20210908120324/https://www.mccabe.com/pdf/mccabe-nist235r.pdf (page 15, notice also that McCabe himself is one of the authors)

Also on page 26 we can see:

A less frequently occurring issue that has greater impact on complexity is the distinction between
“case-labeled statements” and “case labels.” When several case labels apply to the same pro-
gram statement, this is modeled as a single decision outcome edge in the control flow graph,
adding one to complexity. It is certainly possible to make a consistent flow graph model in
which each individual case label contributes a different decision outcome edge and hence also
adds one to complexity, but that is not the typical usage

Imo, the match case in python adds complexity because it's not a simple "case labels", it's behaving more like an if, or even a regex.

AlexWaygood · 2024-05-23T13:38:52Z

I agree with @Pierre-Sassoulas. For the Mccabe plugin, I believe one of the primary motivations for the rule is to ensure that each function is testable in isolation. If a function has too many branches, it becomes hard to write a unit test for it. I think this is just as much a concern for match/case as it is with if/elif, as each new case statement in the match/case treee can be seen as a wholly distinct branch that you would need to account for when writing tests.

dhruvmanila · 2024-05-23T13:57:35Z

Thank you @Pierre-Sassoulas for chiming in! I agree with the conclusion. I think we should update all three rules to consider match statement and we can see how many changes we see in the ecosystem checks.

blueraft · 2024-05-23T16:09:25Z

I can take a stab at this if that's ok.

charliermarsh · 2024-05-23T16:37:53Z

Go for it -- thanks!

Resolves #11421 ## Summary Instead of counting match/case as one statement, consider each `case` as a conditional. ## Test Plan `cargo test`

jaap3 · 2024-05-24T14:33:52Z

Thanks everyone!

qartik · 2024-05-29T12:44:44Z

I wish the consideration for match-case statement for PLR rules was configurable to be turned off with lint.pylint setting. Long match-case statements are very common while walking abstract syntax trees and are arguably way less complex than any other solution. See usage, e.g. at https://github.com/CQCL/pytket-phir/blob/main/pytket/phir/phirgen.py#L102-L148 which now raises both PLR 912 and 915.

@jaap3, @blueraft, @dhruvmanila what do you think? If there is agreement, I can file a separate issue.

AlexWaygood · 2024-05-29T12:52:56Z

Long match-case statements are very common while walking abstract syntax trees and are arguably way less complex than any other solution.

I'd agree that match/case statements are more readable for this purpose, and arguably more idiomatic, but I don't think there's any fewer branches than if you walked the tree using if/elif/else, and the number of branches is what these rules are concerned with. I don't see a strong reason to make match/case statements configurable here specifically. I think I'd find it just as difficult to write comprehensive unit tests for a function with many case branches as I would for a function with many elif branches

qartik · 2024-05-29T15:31:17Z

The only thing to add is there is no simpler way to process an enumeration than pattern matching (such as those that show up in AST walks as mentioned) and for those cases, I'd argue complexity considerations covered by these rules do not apply.

With current change, any parsers/compilers processing a grammar could be full of ruff ignores for each such match construct (or in each file, but then missing several other cases) or we can let users choose whether they want these code complexity heuristics to apply to match-case.

AlexWaygood · 2024-05-29T15:38:38Z

To me that just seems to be arbitrarily favouring some language constructs at the expense of others in the name of readability and style, which isn't what this rule is meant to be ultimately concerned about. If the argument is that there are many cases where writing a more complex function is in fact more maintainable and redable than splitting the function into several smaller functions, then I'd agree that that's a valid critique. But to me, that seems like a flaw of the rule in general rather than a flaw of the rule as specifically applied to match/case statements.

dhruvmanila added the question Asking for support or clarification label May 20, 2024

dhruvmanila mentioned this issue May 22, 2024

PLR0912: Pylint counts try: ... except: statements as single branch #11205

Closed

Pierre-Sassoulas mentioned this issue May 22, 2024

Add a test case for too-complex in match case, for discussion pylint-dev/pylint#9667

Open

dhruvmanila added rule Implementing or modifying a lint rule help wanted Contributions especially welcome and removed question Asking for support or clarification labels May 23, 2024

blueraft mentioned this issue May 23, 2024

Consider match-case stmts for C901, PLR0912, and PLR0915 #11521

Merged

dhruvmanila closed this as completed in #11521 May 24, 2024

dhruvmanila pushed a commit that referenced this issue May 24, 2024

Consider match-case stmts for C901, PLR0912, and PLR0915 (#11521)

33fd500

Resolves #11421 ## Summary Instead of counting match/case as one statement, consider each `case` as a conditional. ## Test Plan `cargo test`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

C901, PLR0912 and PLR0915 treat match/case as one statement #11421

C901, PLR0912 and PLR0915 treat match/case as one statement #11421

jaap3 commented May 14, 2024

dhruvmanila commented May 15, 2024

jaap3 commented May 16, 2024

dhruvmanila commented May 20, 2024

charliermarsh commented May 22, 2024

Pierre-Sassoulas commented May 22, 2024 •

edited

Pierre-Sassoulas commented May 22, 2024 •

edited

AlexWaygood commented May 23, 2024 •

edited

dhruvmanila commented May 23, 2024

blueraft commented May 23, 2024

charliermarsh commented May 23, 2024 via email •

edited

jaap3 commented May 24, 2024

qartik commented May 29, 2024 •

edited

AlexWaygood commented May 29, 2024 •

edited

qartik commented May 29, 2024 •

edited

AlexWaygood commented May 29, 2024

C901, PLR0912 and PLR0915 treat match/case as one statement #11421

C901, PLR0912 and PLR0915 treat match/case as one statement #11421

Comments

jaap3 commented May 14, 2024

dhruvmanila commented May 15, 2024

jaap3 commented May 16, 2024

dhruvmanila commented May 20, 2024

charliermarsh commented May 22, 2024

Pierre-Sassoulas commented May 22, 2024 • edited

Pierre-Sassoulas commented May 22, 2024 • edited

AlexWaygood commented May 23, 2024 • edited

dhruvmanila commented May 23, 2024

blueraft commented May 23, 2024

charliermarsh commented May 23, 2024 via email • edited

jaap3 commented May 24, 2024

qartik commented May 29, 2024 • edited

AlexWaygood commented May 29, 2024 • edited

qartik commented May 29, 2024 • edited

AlexWaygood commented May 29, 2024

Pierre-Sassoulas commented May 22, 2024 •

edited

Pierre-Sassoulas commented May 22, 2024 •

edited

AlexWaygood commented May 23, 2024 •

edited

charliermarsh commented May 23, 2024 via email •

edited

qartik commented May 29, 2024 •

edited

AlexWaygood commented May 29, 2024 •

edited

qartik commented May 29, 2024 •

edited