fix sd2 switching #16079

light-and-ray · 2024-06-23T17:21:13Z

Closes: #13763
I start webui with not sd 2.1 model, then I try to load sd 2.1 checkpoint, and I get "NotImplementedError"
If I've started webui with sd2 model in --ckpt flag, it works, even if I switch to other and back to sd2

This bug appeared in October, now I think it's connected with device = devices.cpu in is_using_v_parameterization_for_sd2 inside this commit d04e3e9#diff-b710a9b8e9fbcc5bc5a014f938c9c74564c1dcfc86929f0dc9ff643ba3fe7873R30

NotImplementedError: No operator found for memory_efficient_attention_forward with inputs:
query : shape=(1, 64, 5, 64) (torch.float32)
key : shape=(1, 64, 5, 64) (torch.float32)
value : shape=(1, 64, 5, 64) (torch.float32)
attn_bias : <class 'NoneType'>
p : 0.0

decoderF is not supported because:
device=cpu (supported: {'cuda'})
attn_bias type is <class 'NoneType'>

flshattF@v2.3.6 is not supported because:
device=cpu (supported: {'cuda'})
dtype=torch.float32 (supported: {torch.float16, torch.bfloat16})

tritonflashattF is not supported because:
device=cpu (supported: {'cuda'})
dtype=torch.float32 (supported: {torch.float16, torch.bfloat16})
operator wasn't built - see python -m xformers.info for more info
triton is not available
Only work on pre-MLIR triton for now

cutlassF is not supported because:
device=cpu (supported: {'cuda'})

smallkF is not supported because:
max(query.shape[-1] != value.shape[-1]) > 32
device=cpu (supported: {'cuda'})
unsupported embed per head: 64

After this patch the bug is gone for me

Checklist:

I have read contributing wiki page
I have performed a self-review of my own code
My code follows the style guidelines
My code passes tests

AUTOMATIC1111 · 2024-07-06T07:00:23Z

The reason I didn't want to run the calculation on GPU, is that it takes substantially longer to transfer the model to GPU than to just do one inference on CPU. I merged the other one.

light-and-ray · 2024-07-06T07:04:39Z

I merged the other one.

Which? The hcl's solves the other issue

light-and-ray · 2024-07-06T07:08:16Z

The reason I didn't want to run the calculation on GPU

But it doesn't work, at least not for everyone. Read the description

fix sd2 switching

731eb72

light-and-ray requested a review from AUTOMATIC1111 as a code owner June 23, 2024 17:21

light-and-ray mentioned this pull request Jun 23, 2024

Fix SD2 loading #16078

Merged

4 tasks

jetjodh approved these changes Jun 28, 2024

View reviewed changes

AUTOMATIC1111 closed this Jul 6, 2024

AUTOMATIC1111 reopened this Jul 6, 2024

AUTOMATIC1111 approved these changes Jul 6, 2024

View reviewed changes

AUTOMATIC1111 merged commit 477869c into AUTOMATIC1111:dev Jul 6, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix sd2 switching #16079

fix sd2 switching #16079

light-and-ray commented Jun 23, 2024 •

edited

Loading

AUTOMATIC1111 commented Jul 6, 2024

light-and-ray commented Jul 6, 2024

light-and-ray commented Jul 6, 2024

fix sd2 switching #16079

fix sd2 switching #16079

Conversation

light-and-ray commented Jun 23, 2024 • edited Loading

Checklist:

AUTOMATIC1111 commented Jul 6, 2024

light-and-ray commented Jul 6, 2024

light-and-ray commented Jul 6, 2024

light-and-ray commented Jun 23, 2024 •

edited

Loading