mirror of
https://github.com/mudler/LocalAI
synced 2026-04-21 13:27:21 +00:00
* docs: Add documentation about GPU auto-fit mode limitations (closes #8562) - Document the default gpu_layers behavior (9999999) that disables auto-fit - Explain the trade-off between auto-fit and VRAM threshold unloading - Add recommendations for users who want to enable gpu_layers: -1 - Note known issues with tensor_buft_override buffer errors - Link to issue #8562 for future improvements Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: team-coding-agent-1 <team-coding-agent-1@localai.dev> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| _index.en.md | ||
| _index.md | ||
| advanced-usage.md | ||
| fine-tuning.md | ||
| model-configuration.md | ||
| reverse-proxy-tls.md | ||
| vram-management.md | ||