whisper.cpp/ggml
Oliver Simons 3eb4905af1 CUDA: Do not mutate cgraph for fused ADDs (llama/19566)
* Do not mutate cgraph for fused ADDs

1. We should try to minimize in-place changes to the incoming
   ggml_cgraph where possible (those should happen in graph_optimize)
2. Modifying in-place leads to an additional, unnecessary graph capture
   step as we store the properties before modifying the graph in-place
   in the cuda-backend

* Assert ggml_tensor is trivially copyable

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Aman Gupta <amangupta052@gmail.com>

---------

Co-authored-by: Aman Gupta <amangupta052@gmail.com>
2026-02-15 21:44:37 +02:00
..
cmake cmake : remove unused file (ggml/1419) 2026-02-08 09:29:10 +02:00
include ggml-virtgpu: make the code thread safe (llama/19204) 2026-02-08 09:29:10 +02:00
src CUDA: Do not mutate cgraph for fused ADDs (llama/19566) 2026-02-15 21:44:37 +02:00
.gitignore
CMakeLists.txt Bump cmake max version (needed for Windows on Snapdragon builds) (llama/19188) 2026-02-08 09:29:10 +02:00