[LoRA] fix `cross_attention_kwargs` problems and tighten tests #7388

sayakpaul · 2024-03-19T08:25:09Z

What does this PR do?

First of all, I would like to apologize for not being rigorous enough with #7338. This was actually breaking:

RUN_SLOW=1 pytest tests/lora/test_lora_layers_peft.py::StableDiffusionLoRATests::test_integration_logits_with_scale

This is because pop() pops the requested key forever from the underlying dictionary (for the first time) and uses the default value throughout the subsequent calls. Since unet within a DiffusionPipeline is iteratively called this phenomenon creates a lot of unexpected consequences. As a result, the above-mentioned test fails. Here are the lora_scale values:

lora scale: 0.5
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0
lora scale: 1.0

Notice how it is defaulting to 1.0 after the first round of denoising step.

A simple solution is to create a shallow copy of cross_attention_kwargs so that the original one is left untouched. This is what this PR does.

Additionally, you may wonder why the below set of tests PASS?

pytest tests/lora/test_lora_layers_peft.py -k "test_simple_inference_with_text_unet_lora_and_scale"

My best guess is that because we use a little too few num_inference_steps to validate things. To see if my hunch was right, I increased the num_inference_steps to 5 here, and run these tests WITHOUT the changes introduced in this PR (i.e., shallow copy). All of those tests failed. With the changes, they pass.

Once this PR is merged, I will take care of making another patch release.

Once again, I am genuinely sorry for the oversight on my end.

sayakpaul · 2024-03-19T08:25:23Z

Cc: @younesbelkada for viz.

HuggingFaceDocBuilderDev · 2024-03-19T08:31:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2024-03-19T08:41:51Z

Will also wait for @BenjaminBossan to approve it. And then I will proceed.

younesbelkada

Nice catch ! Thanks ! One could also use get to avoid copying the kwargs at each forward !

src/diffusers/models/unets/unet_2d_condition.py

sayakpaul · 2024-03-19T09:20:57Z

The problem with get() is that the scale value gets propagated to the internal layers of the UNet, causing unnecessary warnings. This will be confusing for the users. LMK if that makes sense.

younesbelkada · 2024-03-19T09:26:30Z

ok makes sense ! thanks for explaining !

BenjaminBossan

Thanks for fixing this bug, I think the copy solution is solid.

* debugging * let's see the numbers * let's see the numbers * let's see the numbers * restrict tolerance. * increase inference steps. * shallow copy of cross_attentionkwargs * remove print

sayakpaul added 8 commits March 19, 2024 13:10

debugging

f390f8f

let's see the numbers

64aee4a

let's see the numbers

8ed1214

let's see the numbers

d2d47d3

restrict tolerance.

d6bfd2f

increase inference steps.

b267bb5

shallow copy of cross_attentionkwargs

03515f0

remove print

Loading
Loading status checks…

f24c502

sayakpaul requested review from BenjaminBossan and yiyixuxu March 19, 2024 08:25

yiyixuxu approved these changes Mar 19, 2024

View reviewed changes

younesbelkada approved these changes Mar 19, 2024

View reviewed changes

src/diffusers/models/unets/unet_2d_condition.py Show resolved Hide resolved

src/diffusers/models/unets/unet_2d_condition.py Show resolved Hide resolved

Merge branch 'main' into debug-lora-scale-issue

Loading
Loading status checks…

dc7cd6a

BenjaminBossan approved these changes Mar 19, 2024

View reviewed changes

Merge branch 'main' into debug-lora-scale-issue

Loading
Loading status checks…

7ab0785

sayakpaul merged commit b09a2aa into main Mar 19, 2024
17 checks passed

sayakpaul deleted the debug-lora-scale-issue branch March 19, 2024 12:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] fix `cross_attention_kwargs` problems and tighten tests #7388

[LoRA] fix `cross_attention_kwargs` problems and tighten tests #7388

sayakpaul commented Mar 19, 2024

sayakpaul commented Mar 19, 2024

HuggingFaceDocBuilderDev commented Mar 19, 2024

sayakpaul commented Mar 19, 2024

younesbelkada left a comment

sayakpaul commented Mar 19, 2024

younesbelkada commented Mar 19, 2024

BenjaminBossan left a comment

[LoRA] fix cross_attention_kwargs problems and tighten tests #7388

[LoRA] fix cross_attention_kwargs problems and tighten tests #7388

Conversation

sayakpaul commented Mar 19, 2024

What does this PR do?

sayakpaul commented Mar 19, 2024

HuggingFaceDocBuilderDev commented Mar 19, 2024

sayakpaul commented Mar 19, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

sayakpaul commented Mar 19, 2024

younesbelkada commented Mar 19, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

[LoRA] fix `cross_attention_kwargs` problems and tighten tests #7388

[LoRA] fix `cross_attention_kwargs` problems and tighten tests #7388