whisper.cpp

History

Jeff Bolz 71d80aa49e vulkan: don't hold the device mutex while compiling pipelines (llama/23641) * vulkan: don't hold the device mutex while compiling pipelines We need to hold a lock while we traverse all pipelines and lazily initialize them, but we don't need to hold it while the pipeline is being compiled. And it doesn't need to be the same lock as the device mutex. We call load_shaders each time a pipeline is needed, so we only need to compile that one pipeline (and, for example, don't want to end up compiling a pipeline that another thread should be compiling). * remove 'needed'		2026-06-08 14:36:36 +03:00
..
ggml-blas	vulkan: add get/set tensor 2d functions (llama/22514)	2026-05-01 13:07:35 +03:00
ggml-cann	vulkan: add get/set tensor 2d functions (llama/22514)	2026-05-01 13:07:35 +03:00
ggml-cpu	ggml : add some lsx support (llama/23798)	2026-06-08 14:36:36 +03:00
ggml-cuda	CUDA: Check PTX version on host side to guard PDL dispatch (llama/23530)	2026-06-08 14:36:36 +03:00
ggml-hexagon	hexagon: basic/generic op fusion support and RMS_NORM+MUL fusion (llama/23835)	2026-05-29 09:47:30 +03:00
ggml-hip	ggml: backend-agnostic tensor parallelism (experimental) (llama/19378)	2026-04-30 11:29:05 +03:00
ggml-metal	metal : restore im2col implementation for large kernels (llama/23901)	2026-06-08 14:36:36 +03:00
ggml-musa	ggml-cuda: native bf16 flash attention for vec kernel (llama/20525)	2026-03-29 15:04:36 +03:00
ggml-opencl	opencl: support bf16 by converting to f16 (llama/23839)	2026-06-08 14:36:36 +03:00
ggml-openvino	openvino: driver setup, CI split, thread safety, and NPU optimizations (llama/21944)	2026-04-30 11:29:15 +03:00
ggml-rpc	rpc : keep last_graph_uid in the device context (llama/23273)	2026-05-25 12:26:07 +03:00
ggml-sycl	Support Q4_1, Q5_0, Q5_1 in Flash-attention (llama/23812)	2026-06-08 14:36:36 +03:00
ggml-virtgpu	ggml-virtgpu : include missing mutex header (llama/22810)	2026-05-14 21:26:48 +03:00
ggml-vulkan	vulkan: don't hold the device mutex while compiling pipelines (llama/23641)	2026-06-08 14:36:36 +03:00
ggml-webgpu	ggml-webgpu: Check earlier for WebGPU required features (llama/23879)	2026-06-08 14:36:36 +03:00
ggml-zdnn	vulkan: add get/set tensor 2d functions (llama/22514)	2026-05-01 13:07:35 +03:00
ggml-zendnn	ggml-zendnn : fixed naming of matmul function (llama/20964)	2026-05-29 09:47:30 +03:00
CMakeLists.txt	ggml : Parallelize quant LUT init (llama/23595)	2026-05-25 12:26:07 +03:00
ggml-alloc.c	ggml-alloc: fix out-of-bounds read in ggml_dyn_tallocr_remove_block (ggml/1492)	2026-05-25 12:26:07 +03:00
ggml-backend-dl.cpp	hexagon: enable offloading to Hexagon on Windows on Snapdragon (llama/19150)	2026-01-30 15:56:40 +02:00
ggml-backend-dl.h	hexagon: enable offloading to Hexagon on Windows on Snapdragon (llama/19150)	2026-01-30 15:56:40 +02:00
ggml-backend-impl.h	ggml: backend-agnostic tensor parallelism (experimental) (llama/19378)	2026-04-30 11:29:05 +03:00
ggml-backend-meta.cpp	TP: quantized KV cache support (llama/23792)	2026-06-08 14:36:36 +03:00
ggml-backend-reg.cpp	ggml : skip already registered backends and devices (llama/22296)	2026-04-30 11:29:21 +03:00
ggml-backend.cpp	ggml : Check the right iface method before using the fallback 2d get (llama/23514)	2026-05-25 12:26:07 +03:00
ggml-common.h	ggml: add Q1_0 1-bit quantization support (CPU) (llama/21273)	2026-04-30 11:29:01 +03:00
ggml-impl.h	ggml: add graph_reused (llama/21764)	2026-04-30 11:29:11 +03:00
ggml-opt.cpp	fix: free ctx_copy in ggml_opt_free to plug per-training-session leak (llama/21592)	2026-04-30 11:29:03 +03:00
ggml-quants.c	ggml : Parallelize quant LUT init (llama/23595)	2026-05-25 12:26:07 +03:00
ggml-quants.h	ggml: add Q1_0 1-bit quantization support (CPU) (llama/21273)	2026-04-30 11:29:01 +03:00
ggml-threading.cpp	ggml : build backends as libraries (llama/10256)	2024-11-20 21:00:08 +02:00
ggml-threading.h	remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797)	2024-12-18 12:52:16 +02:00
ggml.c	model : support for DeepseekV32ForCausalLM with generic DeepSeek Sparse Attention (DSA) implementation (llama/23346)	2026-06-08 14:36:36 +03:00
ggml.cpp	ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)	2025-05-29 09:56:26 +03:00
gguf.cpp	ggml: `gguf_init_from_callback` and `gguf_init_from_buffer` (llama/22341)	2026-05-25 12:44:04 +03:00