LocalAI/core/http
Ettore Di Giacinto c844b7ac58
feat: disable force eviction (#7725)
* feat: allow to set forcing backends eviction while requests are in flight

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: try to make the request sit and retry if eviction couldn't be done

Otherwise calls that in order to pass would need to shutdown other
backends would just fail.

In this way instead we make the request sit and retry eviction until it
succeeds. The thresholds can be configured by the user.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* add tests

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* expose settings to CLI

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Update docs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2025-12-25 14:26:18 +01:00
..
endpoints feat: disable force eviction (#7725) 2025-12-25 14:26:18 +01:00
middleware chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
routes chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
static fix(chat-ui): model selection toggle and new chat (#7574) 2025-12-14 22:29:11 +01:00
views feat: disable force eviction (#7725) 2025-12-25 14:26:18 +01:00
app.go chore(logging): be consistent and do not emit logs from echo (#7710) 2025-12-24 09:22:27 +01:00
app_test.go chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
explorer.go chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
http_suite_test.go fix(llama-cpp): correctly calculate embeddings (#6259) 2025-09-13 23:11:54 +02:00
openai_mapping_test.go feat(api): OpenAI video create enpoint integration (#6777) 2025-10-27 10:06:57 +01:00
openai_videos_test.go feat: migrate to echo and enable cancellation of non-streaming requests (#7270) 2025-11-14 22:57:53 +01:00
render.go feat: migrate to echo and enable cancellation of non-streaming requests (#7270) 2025-11-14 22:57:53 +01:00