whisper.cpp

History

Max Krasnyansky ffe1c832bd cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (llama/16833) Very similar implementation to the flash-attention chunking, with similar benefits.		2025-11-09 23:38:03 +02:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094)	2025-08-18 20:30:45 +03:00
include	model: add support for qwen3vl series (llama/16780)	2025-11-09 23:38:03 +02:00
src	cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (llama/16833)	2025-11-09 23:38:03 +02:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	Add experimental ggml-hexagon backend for the Hexagon NPU (llama/16547)	2025-11-09 23:38:03 +02:00