LocalAI/pkg
Ettore Di Giacinto fc5b9ebfcc
feat(loader): enhance single active backend to support LRU eviction (#7535)
* feat(loader): refactor single active backend support to LRU

This changeset introduces LRU management of loaded backends. Users can
set now a maximum number of models to be loaded concurrently, and, when
setting LocalAI in single active backend mode we set LRU to 1 for
backward compatibility.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* chore: add tests

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Update docs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fixups

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2025-12-12 12:28:38 +01:00
..
audio feat: Realtime API support reboot (#5392) 2025-05-25 22:25:05 +02:00
concurrency chore: update jobresult_test.go (#4124) 2024-11-12 08:52:18 +01:00
downloader fix(downloader): do not download model files if not necessary (#7492) 2025-12-09 19:08:10 +01:00
functions feat: import models via URI (#7245) 2025-11-12 20:48:56 +01:00
grpc feat: respect context and add request cancellation (#7187) 2025-11-09 18:19:19 +01:00
huggingface-api feat(hf-api): return files in nested directories (#7396) 2025-11-30 09:06:54 +01:00
langchain feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants (#2232) 2024-05-04 17:56:12 +02:00
model feat(loader): enhance single active backend to support LRU eviction (#7535) 2025-12-12 12:28:38 +01:00
oci feat(ui): allow to cancel ops (#7264) 2025-11-13 18:41:47 +01:00
signals chore: update cogito and simplify MCP logics (#6413) 2025-10-09 12:36:45 +02:00
sound feat: Realtime API support reboot (#5392) 2025-05-25 22:25:05 +02:00
store chore: fix go.mod module (#2635) 2024-06-23 08:24:36 +00:00
system feat: add cuda13 images (#7404) 2025-12-02 14:24:35 +01:00
utils feat(rfdetr): add object detection API (#5923) 2025-07-27 22:02:51 +02:00
xio feat(ui): allow to cancel ops (#7264) 2025-11-13 18:41:47 +01:00
xsync chore: fix go.mod module (#2635) 2024-06-23 08:24:36 +00:00
xsysinfo feat: improve RAM estimation by using values from summary (#5525) 2025-06-05 19:16:26 +02:00