LocalAI/backend/python/vllm-omni
Ettore Di Giacinto 59108fbe32
feat: add distributed mode (#9124)
* feat: add distributed mode (experimental)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* fix data races, mutexes, transactions

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactorings

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* fixups

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* fix events and tool stream in agent chat

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* use ginkgo

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* fix(cron): compute correctly time boundaries avoiding re-triggering

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* enhancements, refactorings

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* do not flood of healthy checks

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* do not list obvious backends as text backends

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* tests fixups

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* refactoring and consolidation

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Drop redundant healthcheck

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* enhancements, refactorings

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-03-30 00:47:27 +02:00
..
backend.py feat: add distributed mode (#9124) 2026-03-30 00:47:27 +02:00
install.sh feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
Makefile feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
requirements-after.txt feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
requirements-cublas12-after.txt feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
requirements-cublas12.txt feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
requirements-hipblas.txt feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
requirements.txt feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
run.sh feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
test.py feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00
test.sh feat(vllm-omni): add new backend (#8188) 2026-01-24 22:23:30 +01:00