whisper.cpp/ggml
Radoslav Gerganov d26d1c8b85
rpc : cache and reuse compute graphs (llama/15405)
Store the last computed graph and reuse it when possible.
Also do not return response from GRAPH_COMPUTE and assume it always
completes successfully. If this this is not the case, the server closes
the connection. This saves us a network round trip to the server.
2025-12-12 17:53:11 +02:00
..
cmake ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (llama/15094) 2025-08-18 20:30:45 +03:00
include rpc : cache and reuse compute graphs (llama/15405) 2025-12-12 17:53:11 +02:00
src rpc : cache and reuse compute graphs (llama/15405) 2025-12-12 17:53:11 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt ggml : remove dirty flag from version string (ggml/1391) 2025-12-12 17:53:00 +02:00