whisper.cpp

History

Jeff Bolz 91ab93b756 vulkan: handle mat_mul with A matrix > 4GB (llama/16176) * vulkan: handle mat_mul with A matrix > 4GB This change splits mat_mul operations with huge A matrix into chunks in the M dimension. This works well for stable-diffusion use cases where the im2col matrix has very large M. Fix the order of setting the stride in mul_mm_cm2 - setting the dimension clobbers the stride, so stride should be set after. * build fixes		2025-09-29 15:18:12 +03:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094)	2025-08-18 20:30:45 +03:00
include	llama: print memory breakdown on exit (llama/15860)	2025-09-29 15:18:10 +03:00
src	vulkan: handle mat_mul with A matrix > 4GB (llama/16176)	2025-09-29 15:18:12 +03:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	common : use cpp-httplib as a cURL alternative for downloads (llama/16185)	2025-09-29 15:18:11 +03:00