whisper.cpp/ggml
uvos 96ab3b2465 CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (llama/12315)
When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to
selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need
to avoid launching them with parameters for warp64
2025-03-27 11:06:03 +02:00
..
cmake cmake: Comment out GGML_BIN_DIR for now (ggml/1139) 2025-03-27 11:06:03 +02:00
include ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154) 2025-03-08 15:13:01 +02:00
src CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (llama/12315) 2025-03-27 11:06:03 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt opencl: use OpenCL C standard supported by the device (llama/12221) 2025-03-27 11:06:03 +02:00