mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
* feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| disabled | ||
| backend.yml | ||
| backend_build.yml | ||
| backend_build_darwin.yml | ||
| backend_pr.yml | ||
| build-test.yaml | ||
| bump_deps.yaml | ||
| bump_docs.yaml | ||
| checksum_checker.yaml | ||
| deploy-explorer.yaml | ||
| gallery-agent.yaml | ||
| generate_grpc_cache.yaml | ||
| generate_intel_image.yaml | ||
| image-pr.yml | ||
| image.yml | ||
| image_build.yml | ||
| notify-releases.yaml | ||
| release.yaml | ||
| secscan.yaml | ||
| stalebot.yml | ||
| test-extra.yml | ||
| test.yml | ||
| tests-e2e.yml | ||
| update_swagger.yaml | ||
| yaml-check.yml | ||