whisper.cpp/ggml
David Huang edd1d8686a HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)
Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16
2025-03-08 15:13:01 +02:00
..
cmake cmake : sync cmake scripts 2025-02-03 22:00:57 +02:00
include ggml : upgrade init_tensor API to return a ggml_status (llama/11854) 2025-03-08 15:13:01 +02:00
src HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032) 2025-03-08 15:13:01 +02:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032) 2025-03-08 15:13:01 +02:00