mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
* feat: Add backend gallery This PR add support to manage backends as similar to models. There is now available a backend gallery which can be used to install and remove extra backends. The backend gallery can be configured similarly as a model gallery, and API calls allows to install and remove new backends in runtime, and as well during the startup phase of LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip: Backend Dockerfile for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: drop extras images, build python backends separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup on all backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Tweaks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop old backends leftovers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Move dockerfile upper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix proto Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Feature dropped for consistency - we prefer model galleries Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing packages in the build image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * exllama is ponly available on cublas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pin torch on chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug CI * Install accellerators deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add target arch * Add cuda minor version Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted runners Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: use quay for test images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups for vllm and chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups on CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chatterbox is only available for nvidia Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify CI builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt test, use qwen3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(model gallery): add jina-reranker-v1-tiny-en-gguf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use reranker from llama.cpp in AIO images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Limit concurrent jobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| alpaca.yaml | ||
| arch-function.yaml | ||
| cerbero.yaml | ||
| chatml-hercules.yaml | ||
| chatml.yaml | ||
| codellama.yaml | ||
| command-r.yaml | ||
| deephermes.yaml | ||
| deepseek-r1.yaml | ||
| deepseek.yaml | ||
| dreamshaper.yaml | ||
| falcon3.yaml | ||
| flux-ggml.yaml | ||
| flux.yaml | ||
| gemma.yaml | ||
| granite.yaml | ||
| granite3-2.yaml | ||
| hermes-2-pro-mistral.yaml | ||
| hermes-vllm.yaml | ||
| index.yaml | ||
| llama3-instruct.yaml | ||
| llama3.1-instruct-grammar.yaml | ||
| llama3.1-instruct.yaml | ||
| llama3.1-reflective.yaml | ||
| llama3.2-fcall.yaml | ||
| llama3.2-quantized.yaml | ||
| llava.yaml | ||
| mathstral.yaml | ||
| mistral-0.3.yaml | ||
| moondream.yaml | ||
| mudler.yaml | ||
| noromaid.yaml | ||
| openvino.yaml | ||
| parler-tts.yaml | ||
| phi-2-chat.yaml | ||
| phi-2-orange.yaml | ||
| phi-3-chat.yaml | ||
| phi-3-vision.yaml | ||
| phi-4-chat-fcall.yaml | ||
| phi-4-chat.yaml | ||
| piper.yaml | ||
| qwen-fcall.yaml | ||
| qwen3-openbuddy.yaml | ||
| qwen3.yaml | ||
| rerankers.yaml | ||
| rwkv.yaml | ||
| sd-ggml.yaml | ||
| sentencetransformers.yaml | ||
| smolvlm.yaml | ||
| stablediffusion3.yaml | ||
| tuluv2.yaml | ||
| vicuna-chat.yaml | ||
| virtual.yaml | ||
| vllm.yaml | ||
| whisper-base.yaml | ||
| wizardlm2.yaml | ||