| .. |
|
amx
|
ggml-amx : fix ggml_amx_init() on generic Linux (llama/16049)
|
2025-09-20 13:46:39 +03:00 |
|
arch
|
ggml-cpu : fix typo in gemm comments [no ci] (llama/16189)
|
2025-09-29 15:18:09 +03:00 |
|
cmake
|
ggml : build backends as libraries (llama/10256)
|
2024-11-20 21:00:08 +02:00 |
|
kleidiai
|
kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed (llama/15614)
|
2025-09-20 13:45:27 +03:00 |
|
llamafile
|
llamafile: PowerPC Sgemm Optimization (llama/15558)
|
2025-09-20 13:42:42 +03:00 |
|
CMakeLists.txt
|
ggml-cpu : add check for ARM MATMUL_INT8/i8mm support (llama/15922)
|
2025-09-20 13:45:28 +03:00 |
|
arch-fallback.h
|
ggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)
|
2025-09-20 13:42:39 +03:00 |
|
binary-ops.cpp
|
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
2025-03-31 14:56:53 +03:00 |
|
binary-ops.h
|
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
2025-03-31 14:56:53 +03:00 |
|
common.h
|
ggml : refactor forward_dup for cpu backend (llama/16062)
|
2025-09-20 13:46:39 +03:00 |
|
ggml-cpu-impl.h
|
ggml-cpu: clean up s390x SIMD (llama/15855)
|
2025-09-20 13:42:51 +03:00 |
|
ggml-cpu.c
|
ggml-cpu: Respect cpumask settings (llama/16164)
|
2025-09-29 15:18:09 +03:00 |
|
ggml-cpu.cpp
|
rename optimize_graph to graph_optimize (llama/16082)
|
2025-09-20 13:46:39 +03:00 |
|
hbm.cpp
|
ggml-cpu : split arch-specific implementations (llama/13892)
|
2025-06-10 12:40:33 +03:00 |
|
hbm.h
|
ggml-cpu : split arch-specific implementations (llama/13892)
|
2025-06-10 12:40:33 +03:00 |
|
ops.cpp
|
ggml : implement set_rows with i32 index (llama/16159)
|
2025-09-29 15:18:09 +03:00 |
|
ops.h
|
ggml: add ops for WAN video model (cuda && cpu) (llama/15669)
|
2025-09-20 13:42:49 +03:00 |
|
quants.c
|
llama : add gpt-oss (llama/15091)
|
2025-08-18 20:30:45 +03:00 |
|
quants.h
|
llama : add gpt-oss (llama/15091)
|
2025-08-18 20:30:45 +03:00 |
|
repack.cpp
|
ggml : repack block_iq4_nlx8 (llama/14904)
|
2025-08-18 20:30:45 +03:00 |
|
repack.h
|
ggml : repack block_iq4_nlx8 (llama/14904)
|
2025-08-18 20:30:45 +03:00 |
|
simd-mappings.h
|
ggml-cpu: drop support for nnpa intrinsics (llama/15821)
|
2025-09-20 13:42:50 +03:00 |
|
traits.cpp
|
ggml : fix fallback to CPU for ununsupported ops (llama/15118)
|
2025-08-18 20:30:45 +03:00 |
|
traits.h
|
ggml : fix fallback to CPU for ununsupported ops (llama/15118)
|
2025-08-18 20:30:45 +03:00 |
|
unary-ops.cpp
|
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
2025-03-31 14:56:53 +03:00 |
|
unary-ops.h
|
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
2025-03-31 14:56:53 +03:00 |
|
vec.cpp
|
ggml-cpu : optimize RVV kernels (llama/15720)
|
2025-09-20 13:42:48 +03:00 |
|
vec.h
|
ggml-cpu : optimize RVV kernels (llama/15720)
|
2025-09-20 13:42:48 +03:00 |