LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

Ettore Di Giacinto 02cc8cbcaa feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120 ) * feat(llama.cpp): expose env vars as options for consistency This allows to configure everything in the YAML file of the model rather than have global configurations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(llama.cpp): respect usetokenizertemplate and use llama.cpp templating system to process messages Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Detect template exists if use tokenizer template is enabled Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Better recognization of chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixes to support tool calls while using templates from tokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop template guessing, fix passing tools to tokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Extract grammar and other options from chat template, add schema struct Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Automatically set use_jinja Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanups, identify by default gguf models for chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>		2025-11-07 21:23:50 +01:00
..
_index.en.md	docs: re-use original permalinks (#1610 )	2024-01-19 19:23:58 +01:00
audio-to-text.md	docs: re-use original permalinks (#1610 )	2024-01-19 19:23:58 +01:00
backends.md	Update backends.md	2025-07-24 16:18:25 +02:00
constrained_grammars.md	docs: rewording	2024-05-25 20:18:25 +02:00
distributed_inferencing.md	fix(docs): Improve responsiveness of tables (#5954 )	2025-08-01 22:13:53 +02:00
embeddings.md	docs: update links in advanced-usage and models documentation (#5994 )	2025-08-08 10:23:42 +02:00
gpt-vision.md	Point to LocalAI-examples repo for llava (#6241 )	2025-09-09 16:40:55 +02:00
GPU-acceleration.md	Fix Typos in Docs (#6204 )	2025-09-05 22:11:21 +02:00
image-generation.md	Update image-generation.md	2025-08-20 10:37:11 +02:00
mcp.md	chore(deps): bump cogito (#6785 )	2025-10-27 10:07:31 +01:00
model-gallery.md	fix(tests): pin to branch for config used in tests (#4721 )	2025-01-31 09:57:58 +01:00
object-detection.md	feat(rfdetr): add object detection API (#5923 )	2025-07-27 22:02:51 +02:00
openai-functions.md	docs: update links in advanced-usage and models documentation (#5994 )	2025-08-08 10:23:42 +02:00
reranker.md	docs: add distributed inferencing docs	2024-05-25 16:12:08 +02:00
stores.md	feat(stores): Vector store backend (#1795 )	2024-03-22 21:14:04 +01:00
text-generation.md	feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120 )	2025-11-07 21:23:50 +01:00
text-to-audio.md	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00