whisper.cpp/ggml
Ruben Ortlam 686bc802d1 vulkan: add `v_dot2_f32_f16` support in matrix-matrix multiplication and Flash Attention (llama/24123)
* vulkan: add support for valve fp16 dot2 extension

* use macro for dot2 path choice

* properly check for the feature

* add dot_product abstraction to reduce preprocessor branching
2026-06-15 10:33:53 +03:00
..
cmake ggml : Parallelize quant LUT init (llama/23595) 2026-05-25 12:26:07 +03:00
include ggml : add GGML_OP_COL2IM_1D (llama/24206) 2026-06-15 10:33:53 +03:00
src vulkan: add `v_dot2_f32_f16` support in matrix-matrix multiplication and Flash Attention (llama/24123) 2026-06-15 10:33:53 +03:00
.gitignore
CMakeLists.txt ggml : bump version to 0.14.0 (ggml/1533) 2026-06-08 14:36:36 +03:00