whisper.cpp/ggml
Johannes Gäßler d6e7315717 CUDA: use mma FA kernel for gqa > 4 on RTX 4000 (llama/15035) 2025-08-18 20:30:45 +03:00
..
cmake cmake : Fix BLAS link interface (ggml/1316) 2025-08-18 20:30:45 +03:00
include ggml : remove old kompute, cann (skip) (#3349) 2025-07-30 16:08:57 +03:00
src CUDA: use mma FA kernel for gqa > 4 on RTX 4000 (llama/15035) 2025-08-18 20:30:45 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930) 2025-08-18 20:30:45 +03:00