LocalAI/docs/content/docs/features
Ettore Di Giacinto 02cc8cbcaa
feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120)
* feat(llama.cpp): expose env vars as options for consistency

This allows to configure everything in the YAML file of the model rather
than have global configurations

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat(llama.cpp): respect usetokenizertemplate and use llama.cpp templating system to process messages

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* WIP

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Detect template exists if use tokenizer template is enabled

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Better recognization of chat

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fixes to support tool calls while using templates from tokenizer

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fixups

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Drop template guessing, fix passing tools to tokenizer

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Extract grammar and other options from chat template, add schema struct

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* WIP

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* WIP

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Automatically set use_jinja

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Cleanups, identify by default gguf models for chat

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Update docs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2025-11-07 21:23:50 +01:00
..
_index.en.md docs: re-use original permalinks (#1610) 2024-01-19 19:23:58 +01:00
audio-to-text.md docs: re-use original permalinks (#1610) 2024-01-19 19:23:58 +01:00
backends.md Update backends.md 2025-07-24 16:18:25 +02:00
constrained_grammars.md docs: rewording 2024-05-25 20:18:25 +02:00
distributed_inferencing.md fix(docs): Improve responsiveness of tables (#5954) 2025-08-01 22:13:53 +02:00
embeddings.md docs: update links in advanced-usage and models documentation (#5994) 2025-08-08 10:23:42 +02:00
gpt-vision.md Point to LocalAI-examples repo for llava (#6241) 2025-09-09 16:40:55 +02:00
GPU-acceleration.md Fix Typos in Docs (#6204) 2025-09-05 22:11:21 +02:00
image-generation.md Update image-generation.md 2025-08-20 10:37:11 +02:00
mcp.md chore(deps): bump cogito (#6785) 2025-10-27 10:07:31 +01:00
model-gallery.md fix(tests): pin to branch for config used in tests (#4721) 2025-01-31 09:57:58 +01:00
object-detection.md feat(rfdetr): add object detection API (#5923) 2025-07-27 22:02:51 +02:00
openai-functions.md docs: update links in advanced-usage and models documentation (#5994) 2025-08-08 10:23:42 +02:00
reranker.md docs: add distributed inferencing docs 2024-05-25 16:12:08 +02:00
stores.md feat(stores): Vector store backend (#1795) 2024-03-22 21:14:04 +01:00
text-generation.md feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120) 2025-11-07 21:23:50 +01:00
text-to-audio.md feat: Add backend gallery (#5607) 2025-06-15 14:56:52 +02:00