whisper.cpp

History

Georgi Gerganov 36019c35a3 graph : make FA compatible with MLA + add initial Metal kernels (llama/12953) * graph : make mla compatible with FA * metal : add exp FA kernels for DeepSeek models ggml-ci * llama : minor naming updates ggml-ci * ggml : disable FA for DS head sizes * tests : add FA tests for MLA shapes ggml-ci		2025-04-24 20:39:16 +03:00
..
cmake	ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0)	2025-03-27 11:06:03 +02:00
include	ggml : Depthwise 2D convolution (ggml/1152)	2025-04-24 20:39:16 +03:00
src	graph : make FA compatible with MLA + add initial Metal kernels (llama/12953)	2025-04-24 20:39:16 +03:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	CUDA/HIP: Share the same unified memory allocation logic. (llama/12934)	2025-04-24 20:39:16 +03:00