whisper.cpp

History

lhez a87e950a06 opencl: improve get_rows, cpy, concat and q6_k flat gemv (llama/24160) * opencl: allow multiple workgroups for large rows * opencl: improve small cpy * opencl: packed concat for small input * opencl: tweak flat q6_K gemv, increase N_DST and remap threads		2026-06-08 14:36:36 +03:00
..
cmake	ggml : Parallelize quant LUT init (llama/23595)	2026-05-25 12:26:07 +03:00
include	TP: quantized KV cache support (llama/23792)	2026-06-08 14:36:36 +03:00
src	opencl: improve get_rows, cpy, concat and q6_k flat gemv (llama/24160)	2026-06-08 14:36:36 +03:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	ggml : bump version to 0.13.1 (ggml/1523)	2026-05-29 09:47:30 +03:00