whisper.cpp/ggml
Rémy O 3bab804981 vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (llama/11595)
* vulkan: implement specialized MMV kernels for IQ2 quantizations

* vulkan: add MMV kernels for IQ3 quants

* vulkan: Increase MMV batch size and unroll IQ LUT setup

* vulkan: fix init_iq_shmem for WG sizes larger than tables

* vulkan: common batch size for all I-quants
2025-03-08 15:13:01 +02:00
..
cmake cmake : sync cmake scripts 2025-02-03 22:00:57 +02:00
include ggml-cpu: Support s390x SIMD Instruction Set (llama/12019) 2025-02-27 08:55:36 +02:00
src vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (llama/11595) 2025-03-08 15:13:01 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt cmake: Fix ggml backend dependencies and installation (llama/11818) 2025-03-08 15:13:01 +02:00