* Start work on flash_attn refactor * Refactor * Split k/v quantization * Refactor and abstract quantization logic for flash_attn and mul_mat * Add quantization support to tile path * formatting * Move to functions, add a check |
||
|---|---|---|
| .. | ||
| wgsl-shaders | ||
| CMakeLists.txt | ||
| ggml-webgpu-shader-lib.hpp | ||
| ggml-webgpu.cpp | ||
| pre_wgsl.hpp | ||