whisper.cpp

Commit Graph

Author	SHA1	Message	Date
Jeff Bolz	21b01a21b6	vulkan: Optimize contiguous copies (llama/10254) * tests: Fix memory bandwidth calculation for perf tests Add a flops calculation for flash attention. Add one GGML_OP_CPY perf test. * vulkan: Optimize contiguous copies Add a variant of the copy shader for when the tensors are contiguous. Avoid the complex addressing calculations, and do four elements per invocation to hide some other overhead. Apply similar changes to the scale shader, since scale is always contiguous. Add a "progress bar" for shader compiles.	2024-11-15 15:21:04 +02:00
Jeff Bolz	b54ce5edc5	vulkan: Throttle the number of shader compiles during the build step. (llama/10222) Fixes #9582 Spawning too many concurrent copies of glslc leads to "Failed to create pipes" errors on Linux. This change applies the same throttling we use for multithreaded pipeline creation.	2024-11-15 15:21:04 +02:00
Changyeon Kim	307712a903	ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (llama/9763) * ggml: Add POOL2D OP for GPU ACC to the Vulkan. - The MobileVLM model now supports inference acceleration through GPU by utilizing the Vulkan backend. - A GGML_OP_POOL_2D shader has been added. (Pooling) - The encoding performance of the CLIP model improved from 2.8s on the CPU to 0.7s on the GPU. Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com> * [fix] Correct the incorrect order of the parameters. fix casting to int. Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com> --------- Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com>	2024-11-15 15:21:04 +02:00
Markus Tavenrath	9e715e1b96	Improve Vulkan shader build system (llama/9239) * Improve Vulkan shader builds system - Add dependency to vulkan-shaders-gen to rebuild shaders when changing the shader compilation utility. - Add option to generate debug info for Vulkan shaders to provide shader source to Vulkan shader profiling tools * remove not required self dependency	2024-09-24 19:45:08 +03:00
Georgi Gerganov	d8e24b877d	vulkan : fix build (llama/0) ggml-ci	2024-09-02 15:24:50 +03:00
Georgi Gerganov	82b5c56f63	sync : vulkan (skip) (llama/0)	2024-08-28 13:22:20 +03:00
Georgi Gerganov	9e3c5345cd	sync : ggml vulkan (ggml/0) ggml-ci	2024-08-21 11:07:13 +03:00

7 Commits