LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

Ettore Di Giacinto a026277ab9 feat(mlx-distributed): add new MLX-distributed backend (#8801 ) * feat(mlx-distributed): add new MLX-distributed backend Add new MLX distributed backend with support for both TCP and RDMA for model sharding. This implementation ties in the discovery implementation already in place, and re-uses the same P2P mechanism for the TCP MLX-distributed inferencing. The Auto-parallel implementation is inspired by Exo's ones (who have been added to acknowledgement for the great work!) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose a CLI to facilitate backend starting Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: make manual rank0 configurable via model configs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing features from mlx backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>		2026-03-09 17:29:32 +01:00
..
_index.en.md	feat: Add documentation for undocumented API endpoints (#8852 )	2026-03-08 17:59:33 +01:00
agents.md	feat: Add --data-path CLI flag for persistent data separation (#8888 )	2026-03-09 14:11:15 +01:00
audio-to-text.md	fix: rename json_verbose to verbose_json (#8627 )	2026-02-23 17:57:06 +00:00
backend-monitor.md	feat: Add documentation for undocumented API endpoints (#8852 )	2026-03-08 17:59:33 +01:00
backends.md	feat(voxtral): add voxtral backend (#8451 )	2026-02-09 09:12:05 +01:00
constrained_grammars.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
distributed_inferencing.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
embeddings.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
gpt-vision.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
GPU-acceleration.md	docs: expand GPU acceleration guide with L4T, multi-GPU, monitoring, and troubleshooting (#8858 )	2026-03-08 21:59:57 +01:00
image-generation.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
mcp.md	feat: Add LOCALAI_DISABLE_MCP environment variable to disable MCP support (#8816 )	2026-03-06 20:44:03 +01:00
mlx-distributed.md	feat(mlx-distributed): add new MLX-distributed backend (#8801 )	2026-03-09 17:29:32 +01:00
model-gallery.md	feat(ui): add model size estimation (#8684 )	2026-02-28 23:03:47 +01:00
object-detection.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
openai-functions.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
openai-realtime.md	feat(realtime): Add audio conversations (#6245 )	2026-01-29 08:44:53 +01:00
p2p.md	feat: Add documentation for undocumented API endpoints (#8852 )	2026-03-08 17:59:33 +01:00
reranker.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
runtime-settings.md	feat: add agentic management (#8820 )	2026-03-07 00:03:08 +01:00
sound-generation.md	feat: Add documentation for undocumented API endpoints (#8852 )	2026-03-08 17:59:33 +01:00
stores.md	feat: docs revamp (#7313 )	2025-11-19 22:21:20 +01:00
text-generation.md	feat: pass-by metadata to predict options (#8795 )	2026-03-05 22:50:10 +01:00
text-to-audio.md	feat(qwen-tts): Support using multiple voices (#8757 )	2026-03-04 09:47:21 +01:00
video-generation.md	feat: Add documentation for undocumented API endpoints (#8852 )	2026-03-08 17:59:33 +01:00
voice-activity-detection.md	feat: Add documentation for undocumented API endpoints (#8852 )	2026-03-08 17:59:33 +01:00