whisper.cpp/ggml
Shawn Gu 07854cd146
opencl: Adreno optimization for MoE - MxFP4 (llama/22301)
* MoE Mxfp4 CLC kernel added, router reorder on GPU

* Pass test-backend-ops for MoE mxfp4 Adreno CLC

* remove putenv in llama-model.cpp

* fix indent style and whitespace

* opencl: remove unnecessary headers

* opencl: do not save cl_program objects

* opencl: remove unnecessary assert

* fix precision issue

---------

Co-authored-by: Li He <lih@qti.qualcomm.com>
2026-05-10 17:26:27 +03:00
..
cmake cmake : add FindNCCL.cmake (ggml/0) 2026-05-02 15:02:42 +03:00
include CUDA: manage NCCL communicators in context (llama/21891) 2026-04-30 11:29:09 +03:00
src opencl: Adreno optimization for MoE - MxFP4 (llama/22301) 2026-05-10 17:26:27 +03:00
.gitignore
CMakeLists.txt ggml : bump version to 0.10.2 (ggml/1474) 2026-05-02 15:02:42 +03:00