whisper.cpp/ggml
fj-y-saito db6383094c ggml: aarch64: implement SVE kernels for q4_K_q8_K vector dot (llama/11227)
* Add SVE support for q4_K_q8_K

* Update ggml/src/ggml-cpu/ggml-cpu-quants.c

change to use K_SCALE_SIZE

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-02-03 22:00:57 +02:00
..
include RoPE: fix back, CUDA support for back + noncont. (llama/11240) 2025-02-03 22:00:57 +02:00
src ggml: aarch64: implement SVE kernels for q4_K_q8_K vector dot (llama/11227) 2025-02-03 22:00:57 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt fix: ggml: fix vulkan-shaders-gen build (llama/10448) 2025-02-03 22:00:57 +02:00