whisper.cpp/ggml
Nikhil Jain 490e50056c Implement 2D workgroups for scale, binary, and unary ops (llama/24044)
* Only run webgpu CI on my fork

* Add webgpu only workflow

* Implement 2d workgroups for more operations

* fix

* Fix type

* Move back to global_invocation_id
2026-06-15 10:33:53 +03:00
..
cmake ggml : Parallelize quant LUT init (llama/23595) 2026-05-25 12:26:07 +03:00
include TP: quantized KV cache support (llama/23792) 2026-06-08 14:36:36 +03:00
src Implement 2D workgroups for scale, binary, and unary ops (llama/24044) 2026-06-15 10:33:53 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : bump version to 0.14.0 (ggml/1533) 2026-06-08 14:36:36 +03:00