LocalAI/core/config
Ettore Di Giacinto fc5b9ebfcc
feat(loader): enhance single active backend to support LRU eviction (#7535)
* feat(loader): refactor single active backend support to LRU

This changeset introduces LRU management of loaded backends. Users can
set now a maximum number of models to be loaded concurrently, and, when
setting LocalAI in single active backend mode we set LRU to 1 for
backward compatibility.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* chore: add tests

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Update docs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fixups

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2025-12-12 12:28:38 +01:00
..
application_config.go feat(loader): enhance single active backend to support LRU eviction (#7535) 2025-12-12 12:28:38 +01:00
config_suite_test.go dependencies(grpcio): bump to fix CI issues (#2362) 2024-05-21 14:33:47 +02:00
gallery.go refactor: gallery inconsistencies (#2647) 2024-06-24 17:32:12 +02:00
gguf.go feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120) 2025-11-07 21:23:50 +01:00
guesser.go feat(backends): add system backend, refactor (#6059) 2025-08-14 19:38:26 +02:00
model_config.go fix(config): make syncKnownUsecasesFromString idempotent (#7493) 2025-12-09 21:08:22 +01:00
model_config_filter.go fix(config): make syncKnownUsecasesFromString idempotent (#7493) 2025-12-09 21:08:22 +01:00
model_config_loader.go fix(downloader): do not download model files if not necessary (#7492) 2025-12-09 19:08:10 +01:00
model_config_test.go feat(importer): unify importing code with CLI (#7299) 2025-11-19 20:52:11 +01:00
model_test.go feat: Add Agentic MCP support with a new chat/completion endpoint (#6381) 2025-10-05 17:51:41 +02:00