whisper.cpp/ggml
Jeff Bolz cea5f1c52f vulkan: optimize mul_mat for small values of N (llama/10991)
Make the mul_mat_vec shaders support N>1 (as a spec constant, NUM_COLS) where
the batch_strides are overloaded to hold the row strides. Put the loads from the
B matrix in the innermost loop because it should cache better.

Share some code for reducing the result values to memory in mul_mat_vec_base.
2025-01-04 10:45:01 +02:00
..
include tts : add OuteTTS support (llama/10784) 2025-01-04 10:45:01 +02:00
src vulkan: optimize mul_mat for small values of N (llama/10991) 2025-01-04 10:45:01 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : fix arm build (llama/10890) 2025-01-04 10:45:01 +02:00