Unnecessary line breaks in method call on multiline string #256

lubomir · 2018-05-25T20:46:15Z

This is possibly not a bug, but the results are unexpected and don't really seem to improve readability. However feel free to close if this is actually correct.

Operating system: Fedora 28
Python version: 3.6.5
Black version: 18.5b0
Does also happen on master: yes

Formatting this file uses some extra unexpected new lines:

MULTILINE = """
foo
""".replace("\n", "")

@@ -1,4 +1,6 @@
 MULTILINE = """
 foo
-""".replace("\n", "")
+""".replace(
+    "\n", ""
+)

ambv · 2018-05-29T06:28:43Z

Black considers multiline strings as always not fitting a single line (per definition), so it tries to break the line if possible. I agree that in this case the result is sub-optimal. I'll see what I can do to improve this.

zsol · 2018-07-31T13:41:17Z

@AvdN points out another effect of how black treats multiline strings that are styled like this in #426 (in the case when the string is a parameter to a function call)

ambv · 2018-09-26T16:51:00Z

A related case is:

textwrap.dedent("""\
    Hello, I am
    a multiline string used with
    a common idiom
""")

We need to fix that, too.

OJFord · 2019-09-20T19:03:04Z

That related case is how I got here. For clarity here, the above:

textwrap.dedent("""\
    Hello, I am
    a multiline string used with
    a common idiom
""")

becomes:

textwrap.dedent(
    """\
    Hello, I am
    a multiline string used with
    a common idiom
"""
)

Amusingly you can of course instead change it to:

s = """\
    Hello, I am
    a multiline string used with
    a common idiom
"""
s = textwrap.dedent(s)

in one fewer lines and which black will accept (and won't churn when this is resolved) - or do the dedent call at the point of use.

brupelo · 2020-06-10T16:26:09Z

What's the status of this one? I've seen this one has been opened around 2018 and evolved quite a bit over the years... if I'm not mistaken it seems black is still screwing up this common & readable pattern:

if __name__ == "__main__":
    transform(textwrap.dedent("""
        # from random import random

        def random():
            return 10

        a = random()*10
    """))

into something like this:

if __name__ == "__main__":
    transform(
        textwrap.dedent(
            """
        # from random import random

        def random():
            return 10

        a = random()*10
    """
        )
    )

One of the main reasons to use textwrap.dedent pattern with multiline strings is because "indentation awareness"... But in this case, that sense of indentation will be lost thanks to black... said otherwise, in this particular case black has destroyed the code into something much worse.

Black version: black, version 19.10b0
Python: 3.6.8 (tags/v3.6.8:3c6b436a57, Dec 23 2018, 23:31:17) [MSC v.1916 32 bit (Intel)] on win32

brupelo · 2020-07-22T13:45:48Z

I love black... I really do... but this issue is really nasty... not just because black will breaks code folding completely but also because the output produced by black doesn't make any sense in comparison the original clean version. I find myself adding clauses # fmt: off & # fmt: on more than I should when that shouldn't be case.

I understand none of the maintainers is interested on this issue? I've see both this one and other related ones have been opened for years... question for people who've submitted patches to black before, do you what's the place in the codebase where this is handled?

Thanks.

Ps. Why this important issue hasn't been fixed already? Has it been considered a non-go? Too difficult to handle? I'd like to understand... :)

FlipperPA · 2020-07-22T15:06:00Z

This is a big pet peeve for me too. There's information about contributing here, if you're interested, @brupelo:

https://github.com/psf/black/blob/master/CONTRIBUTING.md

FlipperPA · 2020-08-30T14:00:21Z

Some quick questions:

would the preferred behavior be to leave the original code intact?
would the preferred behavior be to format all docstrings within parentheses the shorter, more beautiful way?

I'm trying to get a sense if this issue clears the hurdle mentioned in the contribution README: On the other hand, if your answer is "because I don't like a particular formatting" then you're not ready to embrace Black yet. Such changes are unlikely to get accepted. You can still try but prepare to be disappointed.

I'd be happy to take a stab if someone would be willing to do some hand holding along the way, as I'd be a brand new contributor.

Thanks for your efforts, black is awesome and amazing. 💯

merwok · 2020-08-30T15:11:14Z

The point about «liking» is dismissive of more serious concerns. Single vs double quotes is an annoying change but mostly irrelevant (it was nice to match the style that the Python REPL outputs, but quote style doesn’t matter and there is a point that it’s common to have apostrophe in a string, so 🤷). But people care about diffs and history, and the developer is the person best situated to decide if this is an extensible list/dict/set that is multiline from the start, or something that can be rewritten by a style program. (One of the worst cases is parameters to the pytest.parametrize decorator that are rewritten to waste screen space without benefit.)

aneeshusa · 2020-12-07T08:24:41Z

I kept running into this so I decided to spend some time figuring how to fix it. Happy to say I have a branch that should do just that! It's got a new 316 line test file with a bunch of cases (including the ones reported in this and linked issues). However, the code is still messy as I haven't yet taken the time to clean it up: right now it's ~100 new lines dumped into the is_line_short_enough function. I'll post a draft PR in the next day or two after some basic cleanup, just wanted to comment to let others know if they had been thinking of spending time on this.

FWIW, the branch is more of a POC right now and requires more work before it could be mergeable. I have some hacked-together code for bracket tracking, trailing comma handling, etc., which will require some refactoring so I can reuse the existing functions for that (will need help from the black folks on the best approach). There's also more work to do like finishing up draft changes to the black style guide docs & CHANGES.rst looking through the black-primer output to verify the changes and work with the relevant upstreams.

aneeshusa · 2020-12-22T19:42:04Z

Opened #1879 with my prototype fix! Would love feedback from folks on the new tests I added and whether the formatting I chose makes sense (AFAIK I included all the examples from this and linked issues).

arogozhnikov · 2022-01-18T20:55:21Z

Just want to bring this up if it is forgotten.

I think it's the only annoying (and undoubtedly annoying) part of black formatting to me,
specifically examples provided by @ambv and @OJFord : #256 (comment)

Maybe a heuristic like this would work: combinations like (""" and """) shouldn't be split by formatting (if they stick, just don't split them).

aneeshusa · 2022-01-24T17:00:00Z

Some context from me:

I'm planning to roll out black across all repos at my company (I'm on the languages team at Lyft). Similar to you, this is the only piece of black formatting that I've found annoying enough to consider a blocker since we have a number of repos that use multi-line strings heavily.
I haven't forgotten about the PR, but unfortunately it has been put on the backburner due to other internal priorities.

I would like to get back to the PR later this year but can't give any guarantees as to when that'll be. In the meantime, I welcome anyone to take over the work and make their own PR!

ichard26 · 2022-01-24T18:15:23Z

Worst case scenario is that I take over the PR and try to push it to completion but unfortunately I've been extremely busy lately too so it'll have to wait at least more week before I have time (and even then the core team is busy figuring out the stable release).

FlipperPA · 2022-02-05T16:39:45Z

@ichard26 @aneeshusa Any idea how much more code work the PR needs? Or is it more of someone to shepherd it across the finish line?

aneeshusa · 2022-02-07T17:44:25Z

#1879 (comment) is still pretty accurate. Here's my take on action items, though I haven't looked at this code or black internals since I opened that PR.

Looks like there is general consensus on the new formatting style, so checkboxes 1 & 2 are done.
The code needs to be moved around a bit now that black's source code has been split into multiple files and isn't one giant file - I suspect this would be pretty straightforward to do.
The current PR adds a bunch of extra work into is_line_short_enough, though it's a no-op when not looking at a MLS-containing line. I suspect it'll need to be reworked to use BracketTracker for speed (or at the very least pick up some of the bugfixes it has https://github.com/psf/black/pull/1879/files#diff-d23a6917d7262ca9815093bffb7148299e516575251959ed9b405b648e76aa29R6480 for correctness), but I hadn't gotten far enough to understand what exactly this could/would look like. It's also possible that refactor is is an optimization we could make later - some profiling data would confirm how this affects the performance. This is the biggest open AI as I see it. Advice from maintainers would be very helpful here as well (this is checkboxes 3 and 4).
Checkbox 5 - adding docs - might require a few iterations to make it clear but should be straightforward.

If, based on profiling and maintainer feedback, the current approach in the PR is mostly acceptable, then it's more of a "take it across the finish". If the approach needs some rework then it'll require more investment depending on what changes are needed.

JelleZijlstra · 2022-02-07T21:26:18Z

Also, with the new stability policy, this change would need to go only into the --preview style.

aneeshusa · 2022-10-03T20:54:38Z

Quick update: @olivia-hong who works with me on the Backend Language Tooling team at Lyft is going to be picking this work up again!

FlipperPA · 2022-10-31T16:44:17Z

Thank you @aneeshusa and @olivia-hong! Lyft was the go-to ride of choice for us in San Diego at DjangoCon US a few weeks ago. 😀

olivia-hong · 2022-11-08T19:21:06Z

Posting an update here for visibility as well: I've updated the PR Aneesh originally put up and it's now fully ready for review #1879 (comment)! Comments/feedback are much appreciated.

Since the issue is a few years old at this point, tagging some of the original reviewers @JelleZijlstra @ichard26 as an FYI. Apologies for any noise.

ambv added the T: bug Something isn't working label May 29, 2018

ambv mentioned this issue May 29, 2018

Opinion: Unwanted indentation of multiline string #233

Closed

zsol mentioned this issue Jul 31, 2018

module level multiline strings getting mangled into unreadability, forcing code changes #426

Closed

This was referenced Sep 7, 2018

Consider not breaking line on outermost parenthesis #500

Closed

black adds unnecessary lines around triple-quoted blocks #462

Closed

zsol mentioned this issue Jan 31, 2019

Unreadable output on multiline + .format() #590

Closed

zsol mentioned this issue Mar 1, 2019

Unnecessary reformat? #729

Closed

sitsofe mentioned this issue Mar 12, 2019

Triple-quote multiline strings could be more compact. #697

Closed

basnijholt mentioned this issue May 3, 2019

Incorrectly formats textwrap.dedent #570

Closed

zsol mentioned this issue Jul 30, 2019

Exploding short arguments after multi-line string #945

Closed

zsol mentioned this issue Dec 28, 2019

Style issue with triple quotes #1216

Closed

ambv added the stable label Mar 3, 2020

ambv added this to To Do in Stable release via automation Mar 3, 2020

Mattwmaster58 mentioned this issue Apr 3, 2020

Triple quote strings as function args are formatted badly #1333

Open

ichard26 mentioned this issue Sep 27, 2020

Single argument to nested function call placed on separate line #828

Closed

ichard26 mentioned this issue Dec 22, 2020

Undesired split after multiline triple quoted string. #1876

Closed

aneeshusa mentioned this issue Dec 22, 2020

Improve multiline string handling #1879

Merged

3 tasks

ichard26 mentioned this issue Apr 2, 2021

Indentation of method call after multiline string #2028

Closed

ichard26 mentioned this issue Apr 16, 2021

Improve formatting for multiline strings wrapped by parentheses with a method call #2113

Open

JelleZijlstra added F: strings Related to our handling of strings T: style What do we want Blackened code to look like? F: linebreak How should we split up lines? labels May 30, 2021

Akuli mentioned this issue Jun 5, 2021

prefer one-line string method at end of multiline string #2308

Closed

felix-hilden removed the stable label Jan 30, 2022

JelleZijlstra mentioned this issue Dec 19, 2022

Unnecessary split in left hand side of assignment due to multiline string #749

Closed

Sheile mentioned this issue Feb 14, 2023

replace create.py to install ckan command create table c-3lab/ckanext-feedback#6

Merged

JelleZijlstra closed this as completed in #1879 Mar 7, 2023

charliermarsh mentioned this issue Aug 15, 2023

Formatter: Expands function call after multi-line string astral-sh/ruff#6500

Closed

yuhrichard mentioned this issue Jan 23, 2024

multiline_string_handling sometimes leads to worse formatting #4159

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unnecessary line breaks in method call on multiline string #256

Unnecessary line breaks in method call on multiline string #256

lubomir commented May 25, 2018

ambv commented May 29, 2018

zsol commented Jul 31, 2018

ambv commented Sep 26, 2018

OJFord commented Sep 20, 2019 •

edited

brupelo commented Jun 10, 2020

brupelo commented Jul 22, 2020

FlipperPA commented Jul 22, 2020

FlipperPA commented Aug 30, 2020 •

edited

merwok commented Aug 30, 2020

aneeshusa commented Dec 7, 2020

aneeshusa commented Dec 22, 2020

arogozhnikov commented Jan 18, 2022

aneeshusa commented Jan 24, 2022 •

edited

ichard26 commented Jan 24, 2022

FlipperPA commented Feb 5, 2022

aneeshusa commented Feb 7, 2022

JelleZijlstra commented Feb 7, 2022

aneeshusa commented Oct 3, 2022

FlipperPA commented Oct 31, 2022

olivia-hong commented Nov 8, 2022

Unnecessary line breaks in method call on multiline string #256

Unnecessary line breaks in method call on multiline string #256

Comments

lubomir commented May 25, 2018

ambv commented May 29, 2018

zsol commented Jul 31, 2018

ambv commented Sep 26, 2018

OJFord commented Sep 20, 2019 • edited

brupelo commented Jun 10, 2020

brupelo commented Jul 22, 2020

FlipperPA commented Jul 22, 2020

FlipperPA commented Aug 30, 2020 • edited

merwok commented Aug 30, 2020

aneeshusa commented Dec 7, 2020

aneeshusa commented Dec 22, 2020

arogozhnikov commented Jan 18, 2022

aneeshusa commented Jan 24, 2022 • edited

ichard26 commented Jan 24, 2022

FlipperPA commented Feb 5, 2022

aneeshusa commented Feb 7, 2022

JelleZijlstra commented Feb 7, 2022

aneeshusa commented Oct 3, 2022

FlipperPA commented Oct 31, 2022

olivia-hong commented Nov 8, 2022

OJFord commented Sep 20, 2019 •

edited

FlipperPA commented Aug 30, 2020 •

edited

aneeshusa commented Jan 24, 2022 •

edited