whisper.cpp/ggml/src
Jeff Bolz 71d80aa49e vulkan: don't hold the device mutex while compiling pipelines (llama/23641)
* vulkan: don't hold the device mutex while compiling pipelines

We need to hold a lock while we traverse all pipelines and lazily initialize
them, but we don't need to hold it while the pipeline is being compiled. And
it doesn't need to be the same lock as the device mutex. We call load_shaders
each time a pipeline is needed, so we only need to compile that one pipeline
(and, for example, don't want to end up compiling a pipeline that another
thread should be compiling).

* remove 'needed'
2026-06-08 14:36:36 +03:00
..
ggml-blas vulkan: add get/set tensor 2d functions (llama/22514) 2026-05-01 13:07:35 +03:00
ggml-cann vulkan: add get/set tensor 2d functions (llama/22514) 2026-05-01 13:07:35 +03:00
ggml-cpu ggml : add some lsx support (llama/23798) 2026-06-08 14:36:36 +03:00
ggml-cuda CUDA: Check PTX version on host side to guard PDL dispatch (llama/23530) 2026-06-08 14:36:36 +03:00
ggml-hexagon hexagon: basic/generic op fusion support and RMS_NORM+MUL fusion (llama/23835) 2026-05-29 09:47:30 +03:00
ggml-hip ggml: backend-agnostic tensor parallelism (experimental) (llama/19378) 2026-04-30 11:29:05 +03:00
ggml-metal metal : restore im2col implementation for large kernels (llama/23901) 2026-06-08 14:36:36 +03:00
ggml-musa ggml-cuda: native bf16 flash attention for vec kernel (llama/20525) 2026-03-29 15:04:36 +03:00
ggml-opencl opencl: support bf16 by converting to f16 (llama/23839) 2026-06-08 14:36:36 +03:00
ggml-openvino openvino: driver setup, CI split, thread safety, and NPU optimizations (llama/21944) 2026-04-30 11:29:15 +03:00
ggml-rpc rpc : keep last_graph_uid in the device context (llama/23273) 2026-05-25 12:26:07 +03:00
ggml-sycl Support Q4_1, Q5_0, Q5_1 in Flash-attention (llama/23812) 2026-06-08 14:36:36 +03:00
ggml-virtgpu ggml-virtgpu : include missing mutex header (llama/22810) 2026-05-14 21:26:48 +03:00
ggml-vulkan vulkan: don't hold the device mutex while compiling pipelines (llama/23641) 2026-06-08 14:36:36 +03:00
ggml-webgpu ggml-webgpu: Check earlier for WebGPU required features (llama/23879) 2026-06-08 14:36:36 +03:00
ggml-zdnn vulkan: add get/set tensor 2d functions (llama/22514) 2026-05-01 13:07:35 +03:00
ggml-zendnn ggml-zendnn : fixed naming of matmul function (llama/20964) 2026-05-29 09:47:30 +03:00
CMakeLists.txt ggml : Parallelize quant LUT init (llama/23595) 2026-05-25 12:26:07 +03:00
ggml-alloc.c ggml-alloc: fix out-of-bounds read in ggml_dyn_tallocr_remove_block (ggml/1492) 2026-05-25 12:26:07 +03:00
ggml-backend-dl.cpp hexagon: enable offloading to Hexagon on Windows on Snapdragon (llama/19150) 2026-01-30 15:56:40 +02:00
ggml-backend-dl.h hexagon: enable offloading to Hexagon on Windows on Snapdragon (llama/19150) 2026-01-30 15:56:40 +02:00
ggml-backend-impl.h ggml: backend-agnostic tensor parallelism (experimental) (llama/19378) 2026-04-30 11:29:05 +03:00
ggml-backend-meta.cpp TP: quantized KV cache support (llama/23792) 2026-06-08 14:36:36 +03:00
ggml-backend-reg.cpp ggml : skip already registered backends and devices (llama/22296) 2026-04-30 11:29:21 +03:00
ggml-backend.cpp ggml : Check the right iface method before using the fallback 2d get (llama/23514) 2026-05-25 12:26:07 +03:00
ggml-common.h ggml: add Q1_0 1-bit quantization support (CPU) (llama/21273) 2026-04-30 11:29:01 +03:00
ggml-impl.h ggml: add graph_reused (llama/21764) 2026-04-30 11:29:11 +03:00
ggml-opt.cpp fix: free ctx_copy in ggml_opt_free to plug per-training-session leak (llama/21592) 2026-04-30 11:29:03 +03:00
ggml-quants.c ggml : Parallelize quant LUT init (llama/23595) 2026-05-25 12:26:07 +03:00
ggml-quants.h ggml: add Q1_0 1-bit quantization support (CPU) (llama/21273) 2026-04-30 11:29:01 +03:00
ggml-threading.cpp ggml : build backends as libraries (llama/10256) 2024-11-20 21:00:08 +02:00
ggml-threading.h remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797) 2024-12-18 12:52:16 +02:00
ggml.c model : support for DeepseekV32ForCausalLM with generic DeepSeek Sparse Attention (DSA) implementation (llama/23346) 2026-06-08 14:36:36 +03:00
ggml.cpp ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 2025-05-29 09:56:26 +03:00
gguf.cpp ggml: `gguf_init_from_callback` and `gguf_init_from_buffer` (llama/22341) 2026-05-25 12:44:04 +03:00