whisper.cpp/ggml
Neo Zhang ec0c661950 Support Q4_1, Q5_0, Q5_1 in Flash-attention (llama/23812)
* support Q4_1, Q5_0, Q5_1

* update ut case
2026-06-08 14:36:36 +03:00
..
cmake ggml : Parallelize quant LUT init (llama/23595) 2026-05-25 12:26:07 +03:00
include ggml: `gguf_init_from_callback` and `gguf_init_from_buffer` (llama/22341) 2026-05-25 12:44:04 +03:00
src Support Q4_1, Q5_0, Q5_1 in Flash-attention (llama/23812) 2026-06-08 14:36:36 +03:00
.gitignore
CMakeLists.txt ggml : bump version to 0.13.1 (ggml/1523) 2026-05-29 09:47:30 +03:00