[Bug] Vulkan Flash Attention not working

### Git commit

a564fdf642780d1df123f1c413b19961375b8346

### Operating System & Version

Artix Linux - Linux 7.0.0-1-cachyos

### GGML backends

Vulkan

### Command-line arguments used

./sd-cli -M img_gen -p "a cat" --sampling-method euler --steps 20 -W 1024 -H 1024 -b 1 --cfg-scale 5 -s -1 --clip-skip -1 --embd-dir /home/daniandtheweb/Workspace/sd.cpp-webui/models/embeddings/ --lora-model-dir /home/daniandtheweb/Workspace/sd.cpp-webui/models/loras/ -t 0 --rng cuda --sampler-rng cuda --lora-apply-mode auto -o /home/daniandtheweb/Workspace/sd.cpp-webui/outputs/txt2img/1765151582.png --diffusion-model /home/daniandtheweb/Workspace/sd.cpp-webui/models/unet/anima-preview3-base.safetensors --vae /home/daniandtheweb/Workspace/sd.cpp-webui/models/vae/qwen_image_vae.safetensors --llm /home/daniandtheweb/Workspace/sd.cpp-webui/models/text_encoders/qwen_3_06b_base.safetensors --scheduler kl_optimal --vae-tile-overlap 0.5 --vae-tile-size 32x32 --cache-mode spectrum --preview proj --preview-path /home/daniandtheweb/Workspace/sd.cpp-webui/outputs/txt2img/1765151582_preview.png --preview-interval 1 --offload-to-cpu --vae-tiling --fa --vae-conv-direct --mmap --color

### Steps to reproduce

Running Ernie or Anima models on Vulkan with Flash Attention enabled causes the generation to fail.
When Flash Attention is disabled the generation works correctly.

### What you expected to happen

<img width="1024" height="1024" alt="Image" src="https://github.com/user-attachments/assets/bb19f9f5-a81e-4d5f-8842-da2f8ca3b602" />

### What actually happened

<img width="128" height="128" alt="Image" src="https://github.com/user-attachments/assets/e3a47e91-058f-4ba4-bf95-b706792a6ba5" />

### Logs / error messages / stack trace

_No response_

### Additional context / environment details

This issue started with 8f2967c006c43c08e72e6748cb8e44993edf0265.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Vulkan Flash Attention not working #1431

Git commit

Operating System & Version

GGML backends

Command-line arguments used

Steps to reproduce

What you expected to happen

What actually happened

Logs / error messages / stack trace

Additional context / environment details

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Bug] Vulkan Flash Attention not working #1431

Description

Git commit

Operating System & Version

GGML backends

Command-line arguments used

Steps to reproduce

What you expected to happen

What actually happened

Logs / error messages / stack trace

Additional context / environment details

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions