whisper.cpp/ggml/src/ggml-vulkan
Jeff Bolz 751e42b21e vulkan: use aligned loads for flash attention mask (llama/12853)
Rewrite the stride logic for the mask tensor in the FA shader to force the
stride to be aligned, to allow using more efficient loads.
2025-04-24 20:39:16 +03:00
..
cmake cmake: fix ggml-shaders-gen compiler paths containing spaces (llama/12747) 2025-04-24 20:39:16 +03:00
vulkan-shaders vulkan: use aligned loads for flash attention mask (llama/12853) 2025-04-24 20:39:16 +03:00
CMakeLists.txt vulkan: Fix missing cmake logic for dot product extension (llama/12721) 2025-04-24 20:39:16 +03:00
ggml-vulkan.cpp vulkan: In coopmat2 mmq, load q4_k/q5_k scales through shared memory (llama/12833) 2025-04-24 20:39:16 +03:00