whisper.cpp/ggml
Jeff Bolz 668306ff2b vulkan: fix coopmat2 flash attention for non-contiguous inputs (llama/11281)
Add code similar to mul_mm_cm2 to force alignment of strides, to avoid
a performance regression.

Add noncontiguous FA tests in test-backend-ops.

Fixes #11268.
2025-02-03 22:00:57 +02:00
..
include rpc : early register backend devices (llama/11262) 2025-02-03 22:00:57 +02:00
src vulkan: fix coopmat2 flash attention for non-contiguous inputs (llama/11281) 2025-02-03 22:00:57 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt fix: ggml: fix vulkan-shaders-gen build (llama/10448) 2025-02-03 22:00:57 +02:00