| .. |
|
amx
|
ggml : fix unaligned access in AMX code (llama/16315)
|
2025-10-12 11:16:23 +03:00 |
|
arch
|
ggml-cpu: use LUT for converting e8->f32 scales on x86 (llama/19288)
|
2026-02-08 09:29:10 +02:00 |
|
cmake
|
…
|
|
|
kleidiai
|
kleidiai: add and integrate SVE 256-bit vector-length kernel (llama/18458)
|
2025-12-31 17:52:09 +02:00 |
|
llamafile
|
ggml-cpu: Enable FP16 MMA kernels on PPC (llama/19060)
|
2026-01-30 15:56:40 +02:00 |
|
spacemit
|
ggml : fix SpaceMit IME array out-of-bounds in task assignment (llama/16629)
|
2025-10-22 12:58:11 +03:00 |
|
CMakeLists.txt
|
kleidiai: add and integrate SVE 256-bit vector-length kernel (llama/18458)
|
2025-12-31 17:52:09 +02:00 |
|
arch-fallback.h
|
ggml-cpu: aarm64: q6_K repack gemm and gemv (and generic) implementations (i8mm) #18860 (llama/18888)
|
2026-01-30 15:56:40 +02:00 |
|
binary-ops.cpp
|
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
2025-03-31 14:56:53 +03:00 |
|
binary-ops.h
|
cpu: de-duplicate some of the operators and refactor (ggml/1144)
|
2025-03-31 14:56:53 +03:00 |
|
common.h
|
ggml-cpu: Use tiled FA for prompt-processing (llama/19012)
|
2026-01-30 15:56:40 +02:00 |
|
ggml-cpu-impl.h
|
ggml-cpu: FA split across kv for faster TG (llama/19209)
|
2026-02-08 09:29:10 +02:00 |
|
ggml-cpu.c
|
ggml-cpu: use LUT for converting e8->f32 scales on x86 (llama/19288)
|
2026-02-08 09:29:10 +02:00 |
|
ggml-cpu.cpp
|
ggml-cpu: FA split across kv for faster TG (llama/19209)
|
2026-02-08 09:29:10 +02:00 |
|
hbm.cpp
|
ggml-cpu : split arch-specific implementations (llama/13892)
|
2025-06-10 12:40:33 +03:00 |
|
hbm.h
|
ggml-cpu : split arch-specific implementations (llama/13892)
|
2025-06-10 12:40:33 +03:00 |
|
ops.cpp
|
ggml-cpu: FA split across kv for faster TG (llama/19209)
|
2026-02-08 09:29:10 +02:00 |
|
ops.h
|
ggml : add ggml_top_k (llama/17365)
|
2025-12-12 17:53:08 +02:00 |
|
quants.c
|
llama : add gpt-oss (llama/15091)
|
2025-08-18 20:30:45 +03:00 |
|
quants.h
|
llama : add gpt-oss (llama/15091)
|
2025-08-18 20:30:45 +03:00 |
|
repack.cpp
|
ggml-cpu: aarm64: q6_K repack gemm and gemv (and generic) implementations (i8mm) #18860 (llama/18888)
|
2026-01-30 15:56:40 +02:00 |
|
repack.h
|
ggml-cpu: aarm64: q6_K repack gemm and gemv (and generic) implementations (i8mm) #18860 (llama/18888)
|
2026-01-30 15:56:40 +02:00 |
|
simd-mappings.h
|
ggml-cpu: use LUT for converting e8->f32 scales on x86 (llama/19288)
|
2026-02-08 09:29:10 +02:00 |
|
traits.cpp
|
ggml : fix fallback to CPU for ununsupported ops (llama/15118)
|
2025-08-18 20:30:45 +03:00 |
|
traits.h
|
ggml : fix fallback to CPU for ununsupported ops (llama/15118)
|
2025-08-18 20:30:45 +03:00 |
|
unary-ops.cpp
|
ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (llama/17063)
|
2025-11-17 21:05:46 +02:00 |
|
unary-ops.h
|
ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (llama/17063)
|
2025-11-17 21:05:46 +02:00 |
|
vec.cpp
|
ggml-cpu: optimize ggml_vec_dot_bf16 for Power9 (llama/18837)
|
2026-01-30 15:56:40 +02:00 |
|
vec.h
|
ggml-cpu: extend support for RVV floating-point kernels (llama/17318)
|
2025-12-31 17:52:09 +02:00 |