whisper.cpp/ggml
Diego Devesa 622dec5bf6
sched : copy only the used experts when offloading prompt processing (llama/15346)
2025-09-20 13:42:38 +03:00
..
cmake ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094) 2025-08-18 20:30:45 +03:00
include ggml: initial IBM zDNN backend (llama/14975) 2025-08-18 20:30:45 +03:00
src sched : copy only the used experts when offloading prompt processing (llama/15346) 2025-09-20 13:42:38 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt CUDA: replace GGML_CUDA_F16 with CUDA arch checks (llama/15433) 2025-09-20 13:42:38 +03:00