whisper.cpp

History

Ruben Ortlam bc944bddc8 Vulkan MMQ Integer Dot Refactor and K-Quant support (llama/16536) * vulkan: add mmq q2_k integer dot support * Refactor mmq caching * Reduce mmq register use * Load 4 quant blocks into shared memory in one step * Pack q2_k blocks into caches of 32 * Use 32-bit accumulators for integer dot matmul * Add q4_k mmq * Add q3_k mmq * Add q5_k mmq * Add q6_k mmq * Add mxfp4 mmq, enable MMQ MUL_MAT_ID * Fix mmv dm loads		2025-11-09 23:38:03 +02:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094)	2025-08-18 20:30:45 +03:00
include	Add experimental ggml-hexagon backend for the Hexagon NPU (llama/16547)	2025-11-09 23:38:03 +02:00
src	Vulkan MMQ Integer Dot Refactor and K-Quant support (llama/16536)	2025-11-09 23:38:03 +02:00
.gitignore	…
CMakeLists.txt	Add experimental ggml-hexagon backend for the Hexagon NPU (llama/16547)	2025-11-09 23:38:03 +02:00