LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

Richard Palethorpe c68818a62e fix(llama-cpp): terminate tensor_buft_overrides with sentinel (#9919 ) llama.cpp's model loader asserts back().pattern == nullptr on params.tensor_buft_overrides (and on params.kv_overrides.back().key[0] == 0) before binding them into llama_model_params. PR #8560 attempted to satisfy llama_params_fit's placeholder requirement by pre-filling params.tensor_buft_overrides up to llama_max_tensor_buft_overrides() before the option-parse loop. Any subsequent push_back from override_tensor / draft_cpu_moe / draft_n_cpu_moe / draft_override_tensor then appended real entries after the placeholders, leaving back() with a real pattern and tripping the assert. The draft override vector likewise had no terminator at all. Mirror upstream common/arg.cpp:645-658 instead: real entries are pushed during option parsing, and after parsing we pad the main vector up to ntbo (placeholders land at the end, so back() is always nullptr) and append a single {nullptr, nullptr} to the draft vector when it is non-empty. The existing kv_overrides terminator block already matches upstream and stays. Verified against ggml-org/llama.cpp@5cbaa5e: only tensor_buft_overrides (main + draft) and kv_overrides are sentinel-terminated common_params fields; everything else is size-driven std::vector. Assisted-by: claude-code:claude-opus-4-7 Signed-off-by: Richard Palethorpe <io@richiejp.com>		2026-05-21 12:55:06 +02:00
..
ds4	chore: ⬆️ Update antirez/ds4 to `2606543be7a8c125a32cee37f5d1d85dc78f2fcf` (#9909 )	2026-05-20 21:22:26 +00:00
grpc	fix: speedup `git submodule update` with `--single-branch` (#2847 )	2024-07-13 22:32:25 +02:00
ik-llama-cpp	chore: ⬆️ Update ikawrakow/ik_llama.cpp to `11a1fea9e291f12ce2c803a9d7812c30ca806bcf` (#9914 )	2026-05-20 22:04:06 +00:00
llama-cpp	fix(llama-cpp): terminate tensor_buft_overrides with sentinel (#9919 )	2026-05-21 12:55:06 +02:00
turboquant	chore: ⬆️ Update TheTom/llama-cpp-turboquant to `5aeb2fdbe26cd4c534c6fa15de73cb5749bd0403` (#9740 )	2026-05-13 21:58:33 +02:00