* ggml : add CUDA support for ggml_conv * whisper : remove ggml_repeat for conv bias + single backend * cuda : fix im2col kernel * metal : add im2col support + mul mat-vec f16 x f16 * bench-all : add q4 models |
||
|---|---|---|
| .. | ||
| bench-all.sh | ||
| bench-wts.sh | ||
| bench.py | ||
| convert-all.sh | ||
| deploy-wasm.sh | ||
| quantize-all.sh | ||
| sha-all.sh | ||
| sync-ggml.sh | ||