LocalAI/docs/content
Ettore Di Giacinto a026277ab9
feat(mlx-distributed): add new MLX-distributed backend (#8801)
* feat(mlx-distributed): add new MLX-distributed backend

Add new MLX distributed backend with support for both TCP and RDMA for
model sharding.

This implementation ties in the discovery implementation already in
place, and re-uses the same P2P mechanism for the TCP MLX-distributed
inferencing.

The Auto-parallel implementation is inspired by Exo's
ones (who have been added to acknowledgement for the great work!)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* expose a CLI to facilitate backend starting

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: make manual rank0 configurable via model configs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Add missing features from mlx backend

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Apply suggestion from @mudler

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2026-03-09 17:29:32 +01:00
..
advanced feat: pass-by metadata to predict options (#8795) 2026-03-05 22:50:10 +01:00
features feat(mlx-distributed): add new MLX-distributed backend (#8801) 2026-03-09 17:29:32 +01:00
getting-started feat: Create comprehensive troubleshooting guide (M1 task) (#8856) 2026-03-08 21:58:32 +01:00
installation docs: add CDI driver config for NVIDIA GPU in containers (fix #8108) (#8677) 2026-02-28 08:42:53 +01:00
reference feat: Add --data-path CLI flag for persistent data separation (#8888) 2026-03-09 14:11:15 +01:00
_index.md chore(docs): center video 2025-12-08 16:59:11 +01:00
faq.md feat: docs revamp (#7313) 2025-11-19 22:21:20 +01:00
integrations.md chore(docs): Add Crush and VoxInput to the integrations (#7924) 2026-01-08 21:39:25 +01:00
overview.md feat: docs revamp (#7313) 2025-11-19 22:21:20 +01:00
whats-new.md feat: docs revamp (#7313) 2025-11-19 22:21:20 +01:00