LocalAI/core/config
Ettore Di Giacinto a026277ab9
feat(mlx-distributed): add new MLX-distributed backend (#8801)
* feat(mlx-distributed): add new MLX-distributed backend

Add new MLX distributed backend with support for both TCP and RDMA for
model sharding.

This implementation ties in the discovery implementation already in
place, and re-uses the same P2P mechanism for the TCP MLX-distributed
inferencing.

The Auto-parallel implementation is inspired by Exo's
ones (who have been added to acknowledgement for the great work!)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* expose a CLI to facilitate backend starting

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: make manual rank0 configurable via model configs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Add missing features from mlx backend

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Apply suggestion from @mudler

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2026-03-09 17:29:32 +01:00
..
application_config.go feat(mlx-distributed): add new MLX-distributed backend (#8801) 2026-03-09 17:29:32 +01:00
application_config_test.go feat(watchdog): add Memory resource reclaimer (#7583) 2025-12-16 09:15:18 +01:00
config_suite_test.go dependencies(grpcio): bump to fix CI issues (#2362) 2024-05-21 14:33:47 +02:00
gallery.go refactor: gallery inconsistencies (#2647) 2024-06-24 17:32:12 +02:00
gguf.go feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00
guesser.go chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
model_config.go fix: Add vllm-omni backend to video generation model detection (#8659) (#8781) 2026-03-05 01:04:47 +01:00
model_config_filter.go fix(config): make syncKnownUsecasesFromString idempotent (#7493) 2025-12-09 21:08:22 +01:00
model_config_loader.go fix: allow reranking models configured with known_usecases (#8681) 2026-03-02 19:00:18 +01:00
model_config_test.go fix(realtime): Use user provided voice and allow pipeline models to have no backend (#8415) 2026-02-11 14:18:05 +01:00
model_test.go fix(realtime): Use user provided voice and allow pipeline models to have no backend (#8415) 2026-02-11 14:18:05 +01:00
runtime_settings.go feat: add agentic management (#8820) 2026-03-07 00:03:08 +01:00