whisper.cpp

History

Radoslav Gerganov 321f628239 rpc : reuse compute graph buffers (llama/21299) Reuse the buffer for the ggml context which is used for creating the compute graph on the server side. This partially addresses a memory leak created by the CUDA backend due to using buffer addresses as cache keys. ref: #21265 ref: #20315		2026-04-30 11:29:00 +03:00
..
cmake	cmake : remove unused file (ggml/1419)	2026-02-08 09:29:10 +02:00
include	llama: fix llama-model-saver (llama/20503)	2026-03-29 15:04:36 +03:00
src	rpc : reuse compute graph buffers (llama/21299)	2026-04-30 11:29:00 +03:00
.gitignore	…
CMakeLists.txt	ggml : bump version to 0.9.11 (ggml/1456)	2026-04-30 11:29:00 +03:00