whisper.cpp/ggml/src
tamarPal 0e1b6c5fc4 sycl: add ROLL operation support (llama/16665)
* sycl: add ROLL operation support

- Implement ggml_sycl_roll function for F32 tensors
- Add multi-axis roll operation with SYCL kernel
- Support all 4 tensor dimensions with proper shift normalization
- Add roll.cpp and roll.hpp to SYCL backend
- Update backend dispatch and supports_op for GGML_OP_ROLL
- Tests: 17662/17662 pass with identical CPU reference results

* fix: remove trailing whitespace from roll.cpp

- Fix EditorConfig violations in ggml/src/ggml-sycl/roll.cpp
- Remove trailing spaces from lines 6, 11, 28, 47, 58, 60

* ci: retrigger

* sycl: remove wait() calls from ROLL operation

* fix: editorconfig — LF endings + final newline for roll.hpp

---------

Co-authored-by: tamarPal <tamarPal@example.com>
2025-11-09 23:38:03 +02:00
..
ggml-blas rename optimize_graph to graph_optimize (llama/16082) 2025-09-20 13:46:39 +03:00
ggml-cann CANN: format code using .clang-format (llama/15863) 2025-10-22 12:58:11 +03:00
ggml-cpu Revert "ggml : Leverage the existing GGML_F32_VEC helpers to vectorize ggml_v…" (#16723) 2025-11-09 23:38:03 +02:00
ggml-cuda CUDA: support for weight clamp in top-k norm (llama/16702) 2025-11-09 23:38:03 +02:00
ggml-hexagon Add experimental ggml-hexagon backend for the Hexagon NPU (llama/16547) 2025-11-09 23:38:03 +02:00
ggml-hip HIP: fix GPU_TARGETS (llama/16642) 2025-10-22 12:58:11 +03:00
ggml-metal metal : add `CONV_TRANSPOSE_2D` (llama/16542) 2025-10-22 12:58:11 +03:00
ggml-musa CUDA: faster tile FA, add oob checks, more HSs (llama/16492) 2025-10-15 09:29:17 +03:00
ggml-opencl opencl: fix warnings and clean up profiling (llama/16688) 2025-10-22 12:58:11 +03:00
ggml-rpc rpc : report actual free memory (llama/16616) 2025-10-22 12:58:11 +03:00
ggml-sycl sycl: add ROLL operation support (llama/16665) 2025-11-09 23:38:03 +02:00
ggml-vulkan vulkan: deduplicate Microsoft Direct3D12 devices (llama/16689) 2025-11-09 23:38:03 +02:00
ggml-webgpu ggml webgpu: profiling, CI updates, reworking of command submission (llama/16452) 2025-10-12 11:16:23 +03:00
ggml-zdnn zdnn: refactor codebase + add docs (llama/16178) 2025-09-29 15:18:09 +03:00
CMakeLists.txt Add experimental ggml-hexagon backend for the Hexagon NPU (llama/16547) 2025-11-09 23:38:03 +02:00
ggml-alloc.c ggml-alloc : make gallocr prefer chunks that allow memory reuse (llama/16788) 2025-11-09 23:38:03 +02:00
ggml-backend-impl.h rpc : add support for multiple devices (llama/16276) 2025-10-12 11:16:23 +03:00
ggml-backend-reg.cpp Add experimental ggml-hexagon backend for the Hexagon NPU (llama/16547) 2025-11-09 23:38:03 +02:00
ggml-backend.cpp llama: print memory breakdown on exit (llama/15860) 2025-09-29 15:18:10 +03:00
ggml-common.h llama : add gpt-oss (llama/15091) 2025-08-18 20:30:45 +03:00
ggml-impl.h ggml: add ggml_can_fuse_subgraph (llama/16662) 2025-10-22 12:58:11 +03:00
ggml-opt.cpp finetune: SGD optimizer, more CLI args (llama/13873) 2025-08-18 20:30:45 +03:00
ggml-quants.c ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (llama/15928) 2025-09-29 15:18:09 +03:00
ggml-quants.h llama : add gpt-oss (llama/15091) 2025-08-18 20:30:45 +03:00
ggml-threading.cpp ggml : build backends as libraries (llama/10256) 2024-11-20 21:00:08 +02:00
ggml-threading.h remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797) 2024-12-18 12:52:16 +02:00
ggml.c ggml: add ggml_can_fuse_subgraph (llama/16662) 2025-10-22 12:58:11 +03:00
ggml.cpp ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 2025-05-29 09:56:26 +03:00
gguf.cpp ggml : prevent integer overflow in gguf tensor size calculation (llama/14595) 2025-07-12 19:23:56 +03:00