LocalAI/core/http/endpoints
Ettore Di Giacinto a026277ab9
feat(mlx-distributed): add new MLX-distributed backend (#8801)
* feat(mlx-distributed): add new MLX-distributed backend

Add new MLX distributed backend with support for both TCP and RDMA for
model sharding.

This implementation ties in the discovery implementation already in
place, and re-uses the same P2P mechanism for the TCP MLX-distributed
inferencing.

The Auto-parallel implementation is inspired by Exo's
ones (who have been added to acknowledgement for the great work!)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* expose a CLI to facilitate backend starting

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: make manual rank0 configurable via model configs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Add missing features from mlx backend

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Apply suggestion from @mudler

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2026-03-09 17:29:32 +01:00
..
anthropic feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00
elevenlabs feat(audio): set audio content type (#8416) 2026-02-05 19:14:12 +01:00
explorer feat: migrate to echo and enable cancellation of non-streaming requests (#7270) 2025-11-14 22:57:53 +01:00
jina chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
localai feat(mlx-distributed): add new MLX-distributed backend (#8801) 2026-03-09 17:29:32 +01:00
mcp chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
openai feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00
openresponses feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00