whisper.cpp/ggml/include
Gaurav Garg 1a1900f90c Remove padding and multiple D2D copies for MTP (llama/24086)
* Make ggml_gated_delta_net take only the initial recurrent state (D, 1, n_seqs) and passes the snapshot count K as an op parameter instead of inferring it from state->ne[1].

Remove the padding hack and copy all emitted snapshots into the recurrent cache with a single strided ggml_cpy

* Make GDN changes in all backends. Address review comments.

* Fix CI build errors
2026-06-15 10:33:53 +03:00
..
ggml-alloc.h
ggml-backend.h
ggml-blas.h
ggml-cann.h
ggml-cpp.h
ggml-cpu.h
ggml-cuda.h
ggml-hexagon.h
ggml-metal.h
ggml-opencl.h
ggml-openvino.h
ggml-opt.h
ggml-rpc.h
ggml-sycl.h
ggml-virtgpu.h
ggml-vulkan.h
ggml-webgpu.h
ggml-zdnn.h
ggml-zendnn.h
ggml.h
gguf.h