Gilad S
2979e5f95f
ggml: `gguf_init_from_callback` and `gguf_init_from_buffer` (llama/22341)
...
* ggml: implement `gguf_init_from_buffer`
* test: `gguf_init_from_buffer`
* fix: memory breakdown for a model loaded with `no_alloc` from a file is consistent with being loaded from a buffer
* fix: use `GGML_UNUSED`
Co-authored-by: Copilot <copilot@github.com>
* fix: remove `total_size` from `gguf_reader`
* fix: file offset calculation, rename `offset` to `data_offset`
Co-authored-by: Copilot <copilot@github.com>
* refactor: extract model loader bug fixes to another PR
* feat: add `gguf_init_from_callback`
* fix: always require a max expected size
* fix: change `gguf_reader_callback_t`'s `output` type to `void *`, change `max_expected_size` and offsets to `uint64_t`
* fix: harden against offset overflow in buffer read
* fix: remove seek behavior from the callback
* feat: `max_chunk_read == 0` means `SIZE_MAX`
* fix: seeking in a gguf file with no tensors
---------
Co-authored-by: Copilot <copilot@github.com>
2026-05-25 12:44:04 +03:00
Johannes Gäßler
3987857d2d
llama: fix llama-model-saver (llama/20503)
...
* llama : add fd-based model loading via llama_model_load_from_fd
* llama : address review feedback for fd-based model loading
* llama : use FILE pointer instead of fd in public API
* llama : use FILE pointer consistently, address review feedback
* fixup
* fix tensor names
* fix llama-model-saver
* roundtrip tests
* fixup
* refactor tests
* fix prints
* fix model saving
* fix CI, disable Chameleon
* print seed
---------
Co-authored-by: Siddhesh2377 <siddheshsonar2377@gmail.com>
2026-03-29 15:04:36 +03:00
Georgi Gerganov
9453b4b9be
gguf : sync (ggml/0)
2026-02-27 20:57:58 +02:00
Miaoqian Lin
48b18f9eb8
ggml : prevent integer overflow in gguf tensor size calculation (llama/14595)
2025-07-12 19:23:56 +03:00
Sigbjørn Skjæret
99764f5767
ggml : do not output unprintable characters on GGUF load failure (llama/14381)
2025-07-01 17:54:53 +03:00
Johannes Gäßler
9589645e72
gguf: fix failure on version == 0 (llama/13956)
2025-06-10 12:40:33 +03:00
Aaron Teo
20f913d119
ggml: check if non-native endian model is being loaded (llama/13943)
...
* gguf: prevent non-native endian models from being loaded
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
* gguf: update error message
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
* gguf: make the non-native endian check more verbose
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
* ggml: move ggml_assert location
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
* ggml: reword the endianness check error message
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
---------
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-10 12:40:33 +03:00
Diego Devesa
0c76acd08a
gguf : use ggml log system (llama/13571)
...
* gguf : use ggml log system
* llama : remove unnecessary new lines in exception messages
2025-05-19 14:58:39 +03:00
R0CKSTAR
6cb2b86581
Fix clang warning in gguf_check_reserved_keys (llama/12686)
...
* Fix clang warning in gguf_check_reserved_keys
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
* Fix typo
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-04-02 15:51:57 +03:00
Georgi Gerganov
90171055f3
cmake : add sanitizer flags for llama.cpp (llama/11279)
...
* cmake : add sanitizer flags for llama.cpp
ggml-ci
* tests : fix compile warnings
ggml-ci
* cmake : move sanitizer flags to llama_add_compile_flags
ggml-ci
* cmake : move llama.cpp compile flags to top level lists
ggml-ci
* cmake : apply only sanitizer flags at top level
ggml-ci
* tests : fix gguf context use in same_tensor_data
* gguf-test: tensor data comparison
* dummy : trigger ggml-ci
* unicode : silence gcc warnings
ggml-ci
* ci : use sanitizer builds only in Debug mode
ggml-ci
* cmake : add status messages [no ci]
---------
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2025-02-03 22:00:57 +02:00
Johannes Gäßler
d5ef1737d8
GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030)
...
ggml-ci
2025-01-14 10:38:01 +02:00