LocalAI/core/schema
Ettore Di Giacinto a026277ab9
feat(mlx-distributed): add new MLX-distributed backend (#8801)
* feat(mlx-distributed): add new MLX-distributed backend

Add new MLX distributed backend with support for both TCP and RDMA for
model sharding.

This implementation ties in the discovery implementation already in
place, and re-uses the same P2P mechanism for the TCP MLX-distributed
inferencing.

The Auto-parallel implementation is inspired by Exo's
ones (who have been added to acknowledgement for the great work!)

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* expose a CLI to facilitate backend starting

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: make manual rank0 configurable via model configs

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Add missing features from mlx backend

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Apply suggestion from @mudler

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2026-03-09 17:29:32 +01:00
..
agent_jobs.go feat(agent-jobs): add multimedia support (#7398) 2025-11-30 14:09:25 +01:00
anthropic.go chore: re-enable e2e tests, fixups anthropic API tools support (#8296) 2026-01-30 12:41:50 +01:00
anthropic_test.go chore: re-enable e2e tests, fixups anthropic API tools support (#8296) 2026-01-30 12:41:50 +01:00
backend.go feat: Add backend gallery (#5607) 2025-06-15 14:56:52 +02:00
elevenlabs.go feat(musicgen): add ace-step and UI interface (#8396) 2026-02-05 12:04:53 +01:00
gallery-model.schema.json [gallery] add JSON schema for gallery model specification (#7890) 2026-01-06 22:10:43 +01:00
jina.go fix(reranker): tests and top_n check fix #7212 (#7284) 2025-11-16 17:53:23 +01:00
localai.go feat(mlx-distributed): add new MLX-distributed backend (#8801) 2026-03-09 17:29:32 +01:00
message.go feat(api): support 'reasoning' api field (#7959) 2026-01-10 19:06:12 +01:00
message_test.go feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120) 2025-11-07 21:23:50 +01:00
openai.go fix: rename json_verbose to verbose_json (#8627) 2026-02-23 17:57:06 +00:00
openresponses.go feat: add WebSocket mode support for the response api (#8676) 2026-03-06 10:36:59 +00:00
prediction.go feat: add support to logitbias and logprobs (#7283) 2025-11-16 13:27:36 +01:00
request.go feat: import models via URI (#7245) 2025-11-12 20:48:56 +01:00
schema_suite_test.go feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120) 2025-11-07 21:23:50 +01:00
tokenize.go feat: Centralized Request Processing middleware (#3847) 2025-02-10 12:06:16 +01:00
transcription.go feat(whisperx): add whisperx backend for transcription with speaker diarization (#8299) 2026-02-02 16:33:12 +01:00