whisper.cpp/ggml
Jeff Bolz 91ab93b756
vulkan: handle mat_mul with A matrix > 4GB (llama/16176)
* vulkan: handle mat_mul with A matrix > 4GB

This change splits mat_mul operations with huge A matrix into chunks in the M
dimension. This works well for stable-diffusion use cases where the im2col
matrix has very large M.

Fix the order of setting the stride in mul_mm_cm2 - setting the dimension
clobbers the stride, so stride should be set after.

* build fixes
2025-09-29 15:18:12 +03:00
..
cmake ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094) 2025-08-18 20:30:45 +03:00
include llama: print memory breakdown on exit (llama/15860) 2025-09-29 15:18:10 +03:00
src vulkan: handle mat_mul with A matrix > 4GB (llama/16176) 2025-09-29 15:18:12 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt common : use cpp-httplib as a cURL alternative for downloads (llama/16185) 2025-09-29 15:18:11 +03:00