whisper.cpp/ggml
Akarshan Biswas f7f92d0aab SYCL: Add mrope kernel (llama/13755)
* SYCL: Add mrope kernel

* feat: Optimize rope operations with vectorization

Uses `sycl::vec` to load and store two elements at a time,
significantly improving performance in `rope_norm`,
`rope_neox`, and `rope_multi`. This reduces the number of memory
accesses and leverages SIMD instructions for faster execution.

* Use ceil_div
2025-06-01 15:14:44 +03:00
..
cmake cmake: Factor out CPU architecture detection (llama/13883) 2025-06-01 15:14:44 +03:00
include ggml : add ggml_repeat_4d (llama/13824) 2025-06-01 15:14:44 +03:00
src SYCL: Add mrope kernel (llama/13755) 2025-06-01 15:14:44 +03:00
.gitignore whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt vulkan: use timestamp queries for GGML_VULKAN_PERF (llama/13817) 2025-06-01 15:14:44 +03:00