whisper.cpp/ggml
Johannes Gäßler f63848eada CUDA: faster FA for GQA > 1 but not power of 2 (llama/19092) 2026-01-30 15:56:40 +02:00
..
cmake
include ggml : add ggml_build_forward_select (llama/18550) 2026-01-30 15:56:40 +02:00
src CUDA: faster FA for GQA > 1 but not power of 2 (llama/19092) 2026-01-30 15:56:40 +02:00
.gitignore
CMakeLists.txt ggml : bump version to 0.9.5 (ggml/1410) 2025-12-31 18:27:20 +02:00