whisper.cpp

History

yulo c6a495ae5d HIP: add fattn-mma-f16 for RDNA4 (llama/18481) * finish VQ mma * flash_attn_ext_f16_iter * KQ_rowsum * correct exp * fix scale error * fix softmax scale * fix softmax scale * enable fattn on cpu side * fix random error * disable fattn-mma-f16 on rdna3 * fix wrong col for rdna * use identity mat to transpose * resolve conflicts * basic tuning for DeepSeek-R1-Distill-Qwen-1.5B * fix volta compile error * align rdna4 policy for fattn * adjust fattn policy * adjust kernel selection logic * update as the review comments * keep fattn-wmma logic * adjust kernel selection logic --------- Co-authored-by: zhang hui <you@example.com> Co-authored-by: Johannes Gäßler <johannesg@5d6.de>		2026-01-30 15:56:40 +02:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094)	2025-08-18 20:30:45 +03:00
include	ggml-webgpu: Fix GGML_MEM_ALIGN to 8 for emscripten. (llama/18628)	2026-01-14 09:11:59 +02:00
src	HIP: add fattn-mma-f16 for RDNA4 (llama/18481)	2026-01-30 15:56:40 +02:00
.gitignore	…
CMakeLists.txt	ggml : bump version to 0.9.5 (ggml/1410)	2025-12-31 18:27:20 +02:00