whisper.cpp/ggml
Srihari-mcw 1c6cb7df47 ggml : Q2k interleaving implementation - x86/x64 SIMD (llama/14373)
* Initial Q2_K Block Interleaving Implementation

* Addressed review comments and clean up of the code

* Post rebase fixes

* Initial CI/CD fixes

* Update declarations in arch-fallback.h

* Changes for GEMV Q2_K in arch-fallback.h

* Enable repacking only on AVX-512 machines

* Update comments in repack.cpp

* Address q2k comments

---------

Co-authored-by: Manogna-Sree <elisetti.manognasree@multicorewareinc.com>
2025-08-18 20:30:45 +03:00
..
cmake cmake : Fix BLAS link interface (ggml/1316) 2025-08-18 20:30:45 +03:00
include ggml : remove old kompute, cann (skip) (#3349) 2025-07-30 16:08:57 +03:00
src ggml : Q2k interleaving implementation - x86/x64 SIMD (llama/14373) 2025-08-18 20:30:45 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930) 2025-08-18 20:30:45 +03:00