whisper.cpp/ggml
fxzjshm c310272fa0 HIP: force max threads per block to be 1024 (llama/11621)
Some old/vendor forked version of llvm still use 256. Explicitly set it to 1024 to align with upstream llvm.

Signed-off-by: fxzjshm <fxzjshm@163.com>
2025-02-27 08:55:36 +02:00
..
cmake cmake : sync cmake scripts 2025-02-03 22:00:57 +02:00
include CUDA: use mma PTX instructions for FlashAttention (llama/11583) 2025-02-03 22:00:57 +02:00
src HIP: force max threads per block to be 1024 (llama/11621) 2025-02-27 08:55:36 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096) 2025-02-04 13:03:03 +02:00