whisper.cpp/ggml
Reese Levine 27ebde6afd ggml webgpu: add support for soft_max, optimize rms_norm (llama/16357)
* Add inplace softmax

* Move rms_norm to split row approach

* Update debug for supports_op

* clean up debug statements

* Update tests/test-backend-ops.cpp

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-10-12 11:16:23 +03:00
..
cmake ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094) 2025-08-18 20:30:45 +03:00
include ggml webgpu: add support for soft_max, optimize rms_norm (llama/16357) 2025-10-12 11:16:23 +03:00
src ggml webgpu: add support for soft_max, optimize rms_norm (llama/16357) 2025-10-12 11:16:23 +03:00
.gitignore
CMakeLists.txt HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 (llama/16221) 2025-10-12 11:16:23 +03:00