LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

Ettore Di Giacinto 2d64269763 feat: Add backend gallery (#5607 ) * feat: Add backend gallery This PR add support to manage backends as similar to models. There is now available a backend gallery which can be used to install and remove extra backends. The backend gallery can be configured similarly as a model gallery, and API calls allows to install and remove new backends in runtime, and as well during the startup phase of LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip: Backend Dockerfile for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: drop extras images, build python backends separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup on all backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Tweaks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop old backends leftovers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Move dockerfile upper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix proto Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Feature dropped for consistency - we prefer model galleries Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing packages in the build image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * exllama is ponly available on cublas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pin torch on chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug CI * Install accellerators deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add target arch * Add cuda minor version Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted runners Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: use quay for test images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups for vllm and chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups on CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chatterbox is only available for nvidia Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify CI builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt test, use qwen3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(model gallery): add jina-reranker-v1-tiny-en-gguf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use reranker from llama.cpp in AIO images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Limit concurrent jobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>		2025-06-15 14:56:52 +02:00
..
alpaca.yaml	models(gallery): add leetwizard (#3093 )	2024-07-31 10:43:45 +02:00
arch-function.yaml	models(gallery): add versatillama-llama-3.2-3b-instruct-abliterated (#3771 )	2024-10-09 16:58:34 +02:00
cerbero.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
chatml-hercules.yaml	models(gallery): add hercules and helpingAI (#2376 )	2024-05-22 22:42:41 +02:00
chatml.yaml	fix(chatml): add endoftext stopword	2025-03-01 21:16:10 +01:00
codellama.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
command-r.yaml	models(gallery): add mistral-0.3 and command-r, update functions (#2388 )	2024-05-23 19:16:08 +02:00
deephermes.yaml	fix(deephermes): correct typo	2025-03-01 17:07:12 +01:00
deepseek-r1.yaml	chore(model gallery): update deepseek-r1 prompt template (#4686 )	2025-01-25 09:04:38 +01:00
deepseek.yaml	feat: models(gallery): add deepseek-v2-lite (#2658 )	2024-07-13 17:09:59 -04:00
dreamshaper.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
falcon3.yaml	chore(model gallery): add falcon3-1b-instruct (#4423 )	2024-12-18 10:12:06 +01:00
flux-ggml.yaml	fix(flux): Set CFG=1 so that prompts are followed (#5378 )	2025-05-16 17:53:54 +02:00
flux.yaml	fix(flux): Set CFG=1 so that prompts are followed (#5378 )	2025-05-16 17:53:54 +02:00
gemma.yaml	fix(gemma): improve prompt for tool calls (#5142 )	2025-04-08 10:12:42 +02:00
granite.yaml	models(gallery): add granite-3.0-1b-a400m-instruct (#3994 )	2024-10-28 19:33:52 +01:00
granite3-2.yaml	chore(model gallery): add ibm-granite_granite-3.2-8b-instruct (#4927 )	2025-03-02 10:19:27 +01:00
hermes-2-pro-mistral.yaml	models(gallery): add hermes-3 (#3252 )	2024-08-16 00:02:21 +02:00
hermes-vllm.yaml	chore(model-gallery): add more quants for popular models (#3365 )	2024-08-24 00:29:24 +02:00
index.yaml	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00
llama3-instruct.yaml	Update llama3-instruct.yaml	2024-07-27 15:30:13 +02:00
llama3.1-instruct-grammar.yaml	Update llama3.1-instruct-grammar.yaml	2024-07-27 15:30:01 +02:00
llama3.1-instruct.yaml	Update llama3.1-instruct.yaml	2024-07-27 15:29:50 +02:00
llama3.1-reflective.yaml	models(gallery): add llama3.1-reflective config	2024-09-20 17:35:06 +02:00
llama3.2-fcall.yaml	chore(model gallery): small fixups to llama3.2-fcall template	2025-02-03 17:58:57 +01:00
llama3.2-quantized.yaml	chore(model gallery): add specific message templates for llama3.2 based models (#4707 )	2025-01-29 10:19:48 +01:00
llava.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
mathstral.yaml	models(gallery): add mathstral-7b-v0.1-imat (#2901 )	2024-07-17 18:19:54 +02:00
mistral-0.3.yaml	models(gallery): add mistral-0.3 and command-r, update functions (#2388 )	2024-05-23 19:16:08 +02:00
moondream.yaml	chore(gallery): do not specify backend with moondream	2024-10-10 19:54:07 +02:00
mudler.yaml	models(gallery): add LocalAI-Llama3-8b-Function-Call-v0.2-GGUF (#2355 )	2024-05-20 00:59:17 +02:00
noromaid.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
openvino.yaml	gallery: Added some OpenVINO models (#2249 )	2024-05-06 10:52:05 +02:00
parler-tts.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
phi-2-chat.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
phi-2-orange.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
phi-3-chat.yaml	models(gallery): add cream-phi-13b (#2417 )	2024-05-26 20:11:57 +02:00
phi-3-vision.yaml	fix(phi3-vision): add multimodal template (#3944 )	2024-10-23 15:34:45 +02:00
phi-4-chat-fcall.yaml	chore(model gallery): add LocalAI-functioncall-phi-4-v0.3 (#4599 )	2025-01-14 09:27:18 +01:00
phi-4-chat.yaml	chore(model gallery): add phi-4 (#4562 )	2025-01-08 23:26:25 +01:00
piper.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
qwen-fcall.yaml	chore(model gallery): add localai-functioncall-qwen2.5-7b-v0.5 (#4796 )	2025-02-10 12:07:35 +01:00
qwen3-openbuddy.yaml	chore(model gallery): add openbuddy_openbuddy-r1-0528-distill-qwen3-32b-preview0-qat (#5631 )	2025-06-11 11:27:30 +02:00
qwen3.yaml	chore(model gallery): add qwen3-30b-a3b (#5269 )	2025-04-29 09:44:44 +02:00
rerankers.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
rwkv.yaml	fix(rwkv model): add stoptoken (#4283 )	2024-11-28 09:34:35 +01:00
sd-ggml.yaml	chore(model gallery): add sd-3.5-large-ggml (#4647 )	2025-01-20 19:04:23 +01:00
sentencetransformers.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
smolvlm.yaml	chore(model gallery): add smolvlm-256m-instruct (#5412 )	2025-05-20 11:15:09 +02:00
stablediffusion3.yaml	feat(sd-3): add stablediffusion 3 support (#2591 )	2024-06-18 15:09:39 +02:00
tuluv2.yaml	models(gallery): add archangel_sft_pythia2-8b (#2933 )	2024-07-20 16:17:34 +02:00
vicuna-chat.yaml	models(gallery): add apollo2-9b (#3860 )	2024-10-17 10:16:52 +02:00
virtual.yaml	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
vllm.yaml	feat(vllm): Additional vLLM config options (Disable logging, dtype, and Per-Prompt media limits) (#4855 )	2025-02-18 19:27:58 +01:00
whisper-base.yaml	models(gallery): add all whisper variants (#2462 )	2024-06-01 20:04:03 +02:00
wizardlm2.yaml	models(gallery): add wizardlm2 (#2209 )	2024-05-02 18:31:02 +02:00