whisper.cpp

History

Bizhao Shi e35fecc2a1 CANN: Add the basic supports of Flash Attention kernel (llama/13627) * cann: add the basic FA support * cann: update the readme * cann: update the FlashAttention with PSEShift * cann: update the input parameters in FA * cann: update the alibi with max_bias * cann: add the constrints of softcap * cann: update the docs CANN.md * cann: update the docs CANN.md * cann: fix typo of CANN.md * cann: add some comments and update the CANN.md * cann: update the CANN.md * cann: update the inner precise for fusedInferAttention * cann: update the constraints of flash_attn_ext on ggml-cann.cpp * cann: clean the whitespace * cann: clean the whitespace * cann: add a new endline		2025-05-27 18:03:00 +03:00
..
cmake	ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0)	2025-03-27 11:06:03 +02:00
include	ggml : fix the order of ggml_unary_op (llama/13718)	2025-05-27 18:03:00 +03:00
src	CANN: Add the basic supports of Flash Attention kernel (llama/13627)	2025-05-27 18:03:00 +03:00
.gitignore	whisper : reorganize source code + improve CMake (#2256 )	2024-06-26 19:34:09 +03:00
CMakeLists.txt	sycl: use oneDNN for matrices multiplication (llama/12972)	2025-05-19 14:58:39 +03:00