LocalAI/docs/content/advanced
LocalAI [bot] f73a158153
docs: Document GPU auto-fit mode limitations and trade-offs (closes #8562) (#8954)
* docs: Add documentation about GPU auto-fit mode limitations (closes #8562)

- Document the default gpu_layers behavior (9999999) that disables auto-fit
- Explain the trade-off between auto-fit and VRAM threshold unloading
- Add recommendations for users who want to enable gpu_layers: -1
- Note known issues with tensor_buft_override buffer errors
- Link to issue #8562 for future improvements

Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>

* Apply suggestion from @mudler

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

---------

Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2026-03-12 13:35:31 +01:00
..
_index.en.md feat: docs revamp (#7313) 2025-11-19 22:21:20 +01:00
_index.md feat: Expand section index pages with comprehensive navigation (M7) (#8929) 2026-03-10 07:34:44 +01:00
advanced-usage.md feat: docs revamp (#7313) 2025-11-19 22:21:20 +01:00
fine-tuning.md feat: docs revamp (#7313) 2025-11-19 22:21:20 +01:00
model-configuration.md docs: Document GPU auto-fit mode limitations and trade-offs (closes #8562) (#8954) 2026-03-12 13:35:31 +01:00
reverse-proxy-tls.md docs: add TLS reverse proxy configuration guide (#8673) 2026-02-28 23:02:17 +01:00
vram-management.md feat: disable force eviction (#7725) 2025-12-25 14:26:18 +01:00