whisper.cpp/ggml
R0CKSTAR a5e6a3c953 musa: enable fp16 mma (all) and cublas on qy2 (llama/13842)
* musa: enable fp16 mma (all) and cublas on qy2

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* Address review comments

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* Address review comments

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* musa: disable MUL_MAT_ID (q2_k × f32) due to precision issues

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2025-07-01 17:54:53 +03:00
..
cmake ggml-cpu : rework weak alias on apple targets (llama/14146) 2025-06-18 12:40:34 +03:00
include ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317) 2025-07-01 17:54:53 +03:00
src musa: enable fp16 mma (all) and cublas on qy2 (llama/13842) 2025-07-01 17:54:53 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317) 2025-07-01 17:54:53 +03:00