whisper.cpp/ggml
Sachin Sharma 8b288f5d96 ggml-zendnn : adaptive fallback to CPU backend for small batch sizes (llama/22681)
* ggml-zendnn : add runtime env var GGML_ZENDNN_ADAPTIVE_FALLBACK to control adaptive fallback (default: enabled)

* ggml-zendnn : restore original fallback logic when adaptive fallback is disabled
2026-05-14 21:26:48 +03:00
..
cmake cmake : add FindNCCL.cmake (ggml/0) 2026-05-02 15:02:42 +03:00
include CUDA: lower-case PCI bus id, standardize for ggml (llama/22820) 2026-05-14 21:26:48 +03:00
src ggml-zendnn : adaptive fallback to CPU backend for small batch sizes (llama/22681) 2026-05-14 21:26:48 +03:00
.gitignore
CMakeLists.txt ggml: install ggml.pc in <libdir>/pkgconfig (ggml/1480) 2026-05-14 21:26:48 +03:00