whisper.cpp/ggml
neha-ha 5f21fdcbb9
ggml-webgpu: updated matrix-vector multiplication (llama/21738)
* merged properly, but slow q3_k and q5_k with u32 indexing

* Start on new mat-vec

* New format float paths working

* Working q4_0

* Work on remaining legacy q-types

* port k-quants to new matvec

* remove old shader

* Remove old constants, format

* remove accidental file

---------

Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local>
Co-authored-by: Reese Levine <reeselevine1@gmail.com>
2026-04-30 11:29:13 +03:00
..
cmake cmake : remove unused file (ggml/1419) 2026-02-08 09:29:10 +02:00
include CUDA: manage NCCL communicators in context (llama/21891) 2026-04-30 11:29:09 +03:00
src ggml-webgpu: updated matrix-vector multiplication (llama/21738) 2026-04-30 11:29:13 +03:00
.gitignore
CMakeLists.txt cmake: remove CMP0194 policy to restore MSVC builds (llama/21934) 2026-04-30 11:29:12 +03:00