LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-04-21 21:37:21 +00:00

Author	SHA1	Message	Date
Richard Palethorpe	c1d0b10b14	chore(docs): Document using a local model gallery (#8426 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-02-06 10:28:41 +01:00
Ettore Di Giacinto	53276d28e7	feat(musicgen): add ace-step and UI interface (#8396 ) * feat(musicgen): add ace-step and UI interface Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Correctly handle model dir Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop auto-download Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to models, fixup UIs icons Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * l4t13 is incompatbile Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * avoid pinning version for cuda12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop l4t12 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-02-05 12:04:53 +01:00
Jonas Bernard	5ac50c9348	fix(docs): Promote DEBUG=false in production docker compose (#8390 ) fix(docs): Use DEBUG=false in production docker compose Signed-off-by: Jonas Bernard <public.jbernard@web.de>	2026-02-04 09:35:32 +01:00
Andres	b6459ddd57	feat(api): Add transcribe response format request parameter & adjust STT backends (#8318 ) * WIP response format implementation for audio transcriptions (cherry picked from commit e271dd764bbc13846accf3beb8b6522153aa276f) Signed-off-by: Andres Smith <andressmithdev@pm.me> * Rework transcript response_format and add more formats (cherry picked from commit 6a93a8f63e2ee5726bca2980b0c9cf4ef8b7aeb8) Signed-off-by: Andres Smith <andressmithdev@pm.me> * Add test and replace go-openai package with official openai go client (cherry picked from commit f25d1a04e46526429c89db4c739e1e65942ca893) Signed-off-by: Andres Smith <andressmithdev@pm.me> * Fix faster-whisper backend and refactor transcription formatting to also work on CLI Signed-off-by: Andres Smith <andressmithdev@pm.me> (cherry picked from commit 69a93977d5e113eb7172bd85a0f918592d3d2168) Signed-off-by: Andres Smith <andressmithdev@pm.me> --------- Signed-off-by: Andres Smith <andressmithdev@pm.me> Co-authored-by: nanoandrew4 <nanoandrew4@gmail.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-02-01 17:33:17 +01:00
Ettore Di Giacinto	68dd9765a0	feat(tts): add support for streaming mode (#8291 ) * feat(tts): add support for streaming mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Send first audio, make sure it's 16 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-30 11:58:01 +01:00
Richard Palethorpe	dd8e74a486	feat(realtime): Add audio conversations (#6245 ) * feat(realtime): Add audio conversations Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(realtime): Vendor the updated API and modify for server side Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat(realtime): Update to the GA realtime API Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore: Document realtime API and add docs to AGENTS.md Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat: Filter reasoning from spoken output Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Send delta and done events for tool calls and audio transcripts Ensure that content is sent in both deltas and done events for function call arguments and audio transcripts. This fixes compatibility with clients that rely on delta events for parsing. 💘 Generated with Crush Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Improve tool call handling and error reporting - Refactor Model interface to accept []types.ToolUnion and *types.ToolChoiceUnion instead of JSON strings, eliminating unnecessary marshal/unmarshal cycles - Fix Parameters field handling: support both map[string]any and JSON string formats - Add PredictConfig() method to Model interface for accessing model configuration - Add comprehensive debug logging for tool call parsing and function config - Add missing return statement after prediction error (critical bug fix) - Add warning logs for NoAction function argument parsing failures - Improve error visibility throughout generateResponse function 💘 Generated with Crush Assisted-by: Claude Sonnet 4.5 via Crush <crush@charm.land> Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>	2026-01-29 08:44:53 +01:00
Ettore Di Giacinto	6804ce1c39	chore(docs): change MEMORY_FILE_PATH to MEMORY_INDEX_PATH Updated MEMORY_FILE_PATH to MEMORY_INDEX_PATH in memory configuration. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-25 22:14:11 +01:00
Ettore Di Giacinto	26a374b717	chore: drop bark which is unmaintained (#8207 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-25 09:26:40 +01:00
Ettore Di Giacinto	05904c77f5	chore(exllama): drop backend now almost deprecated (#8186 ) exllama2 development has stalled and only old architectures are supported. exllamav3 is still in development, meanwhile cleaning up exllama2 from the gallery. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-24 08:57:37 +01:00
LocalAI [bot]	a1e3acc590	docs: ⬆️ update docs version mudler/LocalAI (#8182 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-23 22:03:47 +01:00
Ettore Di Giacinto	923ebbb344	feat(qwen-tts): add Qwen-tts backend (#8163 ) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-23 15:18:41 +01:00
Ettore Di Giacinto	c491c6ca90	feat(openresponses): Support reasoning blocks (#8133 ) * feat(openresponses): support reasoning blocks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * allow to disable reasoning, refactor common logic Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add option to only strip reasoning Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add configurations for custom reasoning tokens Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-21 00:11:45 +01:00
Ettore Di Giacinto	4bf2f8bbd8	chore(docs): update docs with Anthropic API and openresponses Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-20 09:25:24 +01:00
LocalAI [bot]	54c5a2d9ea	docs: ⬆️ update docs version mudler/LocalAI (#8120 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-19 21:18:24 +00:00
Ettore Di Giacinto	44d78b4d15	chore(doc): put alert on install.sh until is fixed (#8042 ) See: https://github.com/mudler/LocalAI/issues/8032 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-14 22:08:48 +01:00
Ettore Di Giacinto	64d0a96ba3	feat(ui): add video gen UI (#8020 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-14 11:43:32 +01:00
Ettore Di Giacinto	a6ff354c86	feat(tts): add pocket-tts backend (#8018 ) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-13 23:35:19 +01:00
Richard Palethorpe	98f28bf583	chore(docs): Add Crush and VoxInput to the integrations (#7924 ) * chore(docs): Add Crush and VoxInput to the integrations Signed-off-by: Richard Palethorpe <io@richiejp.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-08 21:39:25 +01:00
Richard Palethorpe	e6ba26c3e7	chore: Update to Ubuntu24.04 (cont #7423 ) (#7769 ) * ci(workflows): bump GitHub Actions images to Ubuntu 24.04 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): remove CUDA 11.x support from GitHub Actions (incompatible with ubuntu:24.04) Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): bump GitHub Actions CUDA support to 12.9 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(docker): bump base image to ubuntu:24.04 and adjust Vulkan SDK/packages Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * fix(backend): correct context paths for Python backends in workflows, Makefile and Dockerfile Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(make): disable parallel backend builds to avoid race conditions Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(make): export CUDA_MAJOR_VERSION and CUDA_MINOR_VERSION for override Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(backend): update backend Dockerfiles to Ubuntu 24.04 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(backend): add ROCm env vars and default AMDGPU_TARGETS for hipBLAS builds Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(chatterbox): bump ROCm PyTorch to 2.9.1+rocm6.4 and update index URL; align hipblas requirements Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore: add local-ai-launcher to .gitignore Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): fix backends GitHub Actions workflows after rebase Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(docker): use build-time UBUNTU_VERSION variable Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(docker): remove libquadmath0 from requirements-stage base image Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(make): add backends/vllm to .NOTPARALLEL to prevent parallel builds Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * fix(docker): correct CUDA installation steps in backend Dockerfiles Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(backend): update ROCm to 6.4 and align Python hipblas requirements Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): switch GitHub Actions runners to Ubuntu-24.04 for CUDA on arm64 builds Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(docker): update base image and backend Dockerfiles for Ubuntu 24.04 compatibility on arm64 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(backend): increase timeout for uv installs behind slow networks on backend/Dockerfile.python Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): switch GitHub Actions runners to Ubuntu-24.04 for vibevoice backend Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): fix failing GitHub Actions runners Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * fix: Allow FROM_SOURCE to be unset, use upstream Intel images etc. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): rm all traces of CUDA 11 Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Add Ubuntu codename as an argument Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com>	2026-01-06 15:26:42 +01:00
Ettore Di Giacinto	d38811560c	chore(docs): add opencode, GHA, and realtime voice assistant examples Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-03 22:03:43 +01:00
Ettore Di Giacinto	c844b7ac58	feat: disable force eviction (#7725 ) * feat: allow to set forcing backends eviction while requests are in flight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: try to make the request sit and retry if eviction couldn't be done Otherwise calls that in order to pass would need to shutdown other backends would just fail. In this way instead we make the request sit and retry eviction until it succeeds. The thresholds can be configured by the user. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose settings to CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-25 14:26:18 +01:00
Ettore Di Giacinto	bf2f95c684	chore(docs): update docs with cuda 13 instructions and the new vibevoice backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-25 10:00:07 +01:00
LocalAI [bot]	94069f2751	docs: ⬆️ update docs version mudler/LocalAI (#7716 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-12-24 21:06:02 +00:00
Mikhail Khludnev	53b0530275	docs: Add `langchain-localai` integration package to documentation (#7677 ) Add `langchain-localai` integration package to documentation Signed-off-by: Mikhail Khludnev <mkhludnev@users.noreply.github.com>	2025-12-21 21:02:14 +01:00
Ettore Di Giacinto	2387b266d8	chore(llama.cpp): Add Missing llama.cpp Options to gRPC Server (#7584 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-15 21:55:20 +01:00
Ettore Di Giacinto	fc5b9ebfcc	feat(loader): enhance single active backend to support LRU eviction (#7535 ) * feat(loader): refactor single active backend support to LRU This changeset introduces LRU management of loaded backends. Users can set now a maximum number of models to be loaded concurrently, and, when setting LocalAI in single active backend mode we set LRU to 1 for backward compatibility. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-12 12:28:38 +01:00
Ettore Di Giacinto	00a05208bc	chore(docs): center video Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-12-08 16:59:11 +01:00
Ettore Di Giacinto	a27d0d151f	Embed YouTube video in documentation Added an embedded YouTube video to the documentation. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-12-08 16:53:20 +01:00
Igor B. Poretsky	96e123d53a	Messages output fix (#7424 ) The internal echo command in sh does not support "-e" and "-E" options and interprets backslash escape sequences by default. So we prefer the external echo command when it is available.	2025-12-04 11:30:02 +01:00
LocalAI [bot]	4c41f96157	docs: ⬆️ update docs version mudler/LocalAI (#7381 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-11-27 21:49:31 +01:00
Igor B. Poretsky	a8eb1c421b	Clean data directory (#7378 ) It seems to be no point to copy /etc/skel content to newly created data directory.	2025-11-27 17:48:32 +01:00
Igor B. Poretsky	d27a281783	Correct user deletion with all its data (#7368 ) Actually it is not necessary to remove particularly the local-ai data directory before user deletion. It will be accomplished automatically by the userdel command. But it is crucial to remove additional users from the local-ai group to allow userdel command to delete the group itself.	2025-11-27 17:47:55 +01:00
Igor B. Poretsky	c411fe09fb	Conventional way of adding extra apt repository (#7362 )	2025-11-27 17:46:26 +01:00
Igor B. Poretsky	acbcb44dbc	Initialize sudo reference before its first actual use (#7367 ) Unfortunately, in my previous pr I missed the fact that uninstall procedure uses sudo as well. La colpa mia.	2025-11-27 15:20:46 +01:00
Igor B. Poretsky	ab022172a9	chore: switch from /usr/share to /var/lib for data storage (#7361 ) * More appropriate place for data storing The /usr/share subtree in Linux is used for data that generally are not supposed to change. Conventional places for changeable data are usually located under /var, so /var/lib seems to be a reasonable default here. * Data paths consistency fix * Directory name consistency fix	2025-11-27 09:18:28 +01:00
Igor B. Poretsky	c0d1d0211f	fix: Initialize sudo reference before its first actual use (#7360 )	2025-11-26 16:03:42 +01:00
Igor B. Poretsky	f617bec686	fix: double sudo invocation fix in the install script (#7359 ) Double sudo invocation fix in the install script	2025-11-26 16:03:10 +01:00
Ettore Di Giacinto	71ed03102f	feat(ui): add chat history (#7325 ) * feat(chat): add history and management Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display in progress chats Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fetch available context size as we switch chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add search Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display MCP toggle correctly Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Re-ordering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Re-style Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Stable ordering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display token/sec correctly Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Visual changes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display chat time Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-24 11:48:24 +01:00
Ettore Di Giacinto	dd2828241c	chore(docs): add documentation about import (#7315 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-20 23:07:36 +01:00
Copilot	16e5689162	feat(importers): Add diffuser backend importer with ginkgo tests and UI support (#7316 ) * Initial plan * Add diffuser backend importer with ginkgo tests Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Finalize diffuser backend importer implementation Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Add diffuser preferences to model-editor import section Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Use gopkg.in/yaml.v3 for consistency in diffuser importer Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-11-20 22:38:30 +01:00
Ettore Di Giacinto	2dd42292dc	feat(ui): runtime settings (#7320 ) * feat(ui): add watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not re-read env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Some refactor, move other settings to runtime (p2p) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add API Keys handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable runtime settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show MCP toggle in index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop context default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-20 22:37:20 +01:00
Ettore Di Giacinto	53d51671d7	Update Docker installation recommendation wording Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-20 17:27:48 +01:00
Ettore Di Giacinto	95b6c9bb5a	Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-19 22:25:33 +01:00
Ettore Di Giacinto	2cc4809b0d	feat: docs revamp (#7313 ) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-19 22:21:20 +01:00
Ettore Di Giacinto	18d11396cd	chore(docs): improve documentation and split into sections bigger topics (#7292 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-17 18:39:21 +01:00
Copilot	34bc1bda1e	fix(api): SSE streaming format to comply with specification (#7182 ) * Initial plan * Fix SSE streaming format to comply with specification - Replace json.Encoder with json.Marshal for explicit formatting - Use explicit \n\n for all SSE messages (instead of relying on implicit newlines) - Change %v to %s format specifier for proper string formatting - Fix error message streaming to include proper SSE format - Ensure consistency between chat.go and completion.go endpoints Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Add proper error handling for JSON marshal failures in streaming - Handle json.Marshal errors explicitly in error response paths - Add fallback simple error message if marshal fails - Prevents sending 'data: <nil>' on marshal failures - Addresses code review feedback Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Fix SSE streaming format to comply with specification Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Fix finish_reason field to use pointer for proper null handling - Change FinishReason from string to string in Choice schema - Streaming chunks now omit finish_reason (null) instead of empty string - Final chunks properly set finish_reason to "stop", "tool_calls", etc. - Remove empty content from initial streaming chunks (only send role) - Final streaming chunk sends empty delta with finish_reason - Addresses OpenAI API compliance issues causing client failures Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Improve code consistency for string pointer creation - Use consistent pattern: declare variable then take address - Remove inline anonymous function for better readability - Addresses code review feedback Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Move common finish reasons to constants - Create constants.go with FinishReasonStop, FinishReasonToolCalls, FinishReasonFunctionCall - Replace all string literals with constants in chat.go, completion.go, realtime.go - Improves code maintainability and prevents typos Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Make it build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix finish_reason to always be present with null or string value - Remove omitempty from FinishReason field in Choice struct - Explicitly set FinishReason to nil for all streaming chunks - Ensures finish_reason appears as null in JSON for streaming chunks - Final chunks still properly set finish_reason to "stop", "tool_calls", etc. - Complies with OpenAI API specification example Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-09 22:00:27 +01:00
Ettore Di Giacinto	02cc8cbcaa	feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120 ) * feat(llama.cpp): expose env vars as options for consistency This allows to configure everything in the YAML file of the model rather than have global configurations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(llama.cpp): respect usetokenizertemplate and use llama.cpp templating system to process messages Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Detect template exists if use tokenizer template is enabled Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Better recognization of chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixes to support tool calls while using templates from tokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop template guessing, fix passing tools to tokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Extract grammar and other options from chat template, add schema struct Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Automatically set use_jinja Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanups, identify by default gguf models for chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-07 21:23:50 +01:00
Ettore Di Giacinto	79247a5d17	Clarify note about DMGs not being signed by Apple Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:09:28 +01:00
Ettore Di Giacinto	46b7a4c5f2	Add macOS DMG download information Added download link and note for macOS DMG installation. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:09:07 +01:00
Ettore Di Giacinto	436e2d91d0	Enhance overview with Docker and installer details Added Docker instructions and clarified one-liner installer for Linux. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:08:03 +01:00
Ettore Di Giacinto	a86fdc4087	Update binaries.md with macOS download instructions Added download instructions for macOS DMG file and updated command for Linux and macOS. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:06:56 +01:00
LocalAI [bot]	e485bdf9ab	docs: ⬆️ update docs version mudler/LocalAI (#6996 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-11-01 21:08:08 +00:00
Ettore Di Giacinto	238aad666e	chore(deps): bump cogito (#6785 ) chore(deps): Bump cogito Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-27 10:07:31 +01:00
Chakib Benziane	32c0ab3a7f	fix: properly terminate llama.cpp kv_overrides array with empty key + updated doc (#6672 ) * fix: properly terminate kv_overrides array with empty key The llama model loading function expects KV overrides to be terminated with an empty key (key[0] == 0). Previously, the kv_overrides vector was not being properly terminated, causing an assertion failure. This commit ensures that after parsing all KV override strings, we add a final terminating entry with an empty key to satisfy the C-style array termination requirement. This fixes the assertion error and allows the model to load correctly with custom KV overrides. Fixes #6643 - Also included a reference to the usage of the `overrides` option in the advanced-usage section. Signed-off-by: blob42 <contact@blob42.xyz> * doc: document the `overrides` option --------- Signed-off-by: blob42 <contact@blob42.xyz>	2025-10-23 09:31:55 +02:00
Ettore Di Giacinto	a22f6a499d	feat(mcp): add planning and reevaluation (#6541 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-18 18:26:32 +02:00
Ettore Di Giacinto	e963e16bc5	Remove model size guidance from FAQ Removed redundant information about model sizes in the WebUI. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-10 21:50:56 +02:00
Ettore Di Giacinto	1e9b115251	chore(docs): enhancements and clarifications (#6433 ) chore(docs): Small enhancements Fixes: https://github.com/mudler/LocalAI/issues/6250 Relates to: https://github.com/mudler/LocalAI/issues/6251 Fixes: https://github.com/mudler/LocalAI/issues/6249 Fixes: https://github.com/mudler/LocalAI/issues/6250 Fixes: https://github.com/mudler/LocalAI/issues/6253 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-10 21:31:11 +02:00
Ettore Di Giacinto	cb0ed55d89	feat(neutts): add backend (#6404 ) * feat(neutts): add backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): add images to CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): add Neutts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make it work with quantized versions Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-09 21:51:28 +02:00
Ettore Di Giacinto	c38564e22c	Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-07 16:25:03 +02:00
Ettore Di Giacinto	183559bb98	chore(docs): add MCP example (#6405 ) docs update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-07 11:42:28 +02:00
Ettore Di Giacinto	85e27ec74c	feat: add agent options to model config (#6383 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-05 21:54:04 +02:00
Ettore Di Giacinto	698205a2f3	Add links to Awesome MCPs and MCPs by mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-05 21:27:44 +02:00
Ettore Di Giacinto	930553ef60	Update mcp.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-05 18:21:02 +02:00
Ettore Di Giacinto	60b6472fa0	feat: Add Agentic MCP support with a new chat/completion endpoint (#6381 ) * WIP - add endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Rename Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Wire the Completion API Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to make it functional Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Almost functional Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Bump golang versions used in tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add description of the tool Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make it working Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small optimizations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanup/refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-05 17:51:41 +02:00
LocalAI [bot]	530c174fd3	docs: ⬆️ update docs version mudler/LocalAI (#6378 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-10-03 23:26:09 +02:00
LocalAI [bot]	357bf571a3	docs: ⬆️ update docs version mudler/LocalAI (#6318 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-21 08:40:00 +02:00
LocalAI [bot]	f7f26b8efa	docs: ⬆️ update docs version mudler/LocalAI (#6315 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-20 09:41:58 +02:00
LocalAI [bot]	d3c5c02837	docs: ⬆️ update docs version mudler/LocalAI (#6307 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-18 23:48:02 +02:00
LocalAI [bot]	542f07ab2d	docs: ⬆️ update docs version mudler/LocalAI (#6305 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-17 21:06:50 +00:00
Gianluca Boiano	d0e99562af	chore(aio): upgrade minicpm-v model to latest 4.5 (#6262 ) chore(aio): upgrade vision model to MiniCPM-V 4.5 Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2025-09-14 15:04:58 +02:00
Mauro Morales	59311d8b1e	Point to LocalAI-examples repo for llava (#6241 ) Signed-off-by: Mauro Morales <contact@mauromorales.com>	2025-09-09 16:40:55 +02:00
Ettore Di Giacinto	0b528458d8	chore(docs): add MacOS dmg download button (#6233 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-09-09 00:19:37 +02:00
Ettore Di Giacinto	e905e90dd7	Add MLX-audio entry to compatibility table Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-09-08 09:54:01 +02:00
Aliz Fara	9911ec84a3	Fix Typos in Docs (#6204 ) Signed-off-by: alizfara112 <alizfaraafa@gmail.com>	2025-09-05 22:11:21 +02:00
LocalAI [bot]	326f6e5ccb	docs: ⬆️ update docs version mudler/LocalAI (#6201 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-04 21:03:02 +00:00
Ettore Di Giacinto	43e0437db6	Revise GPU usage recommendations in documentation Updated recommendations for GPU usage on Xorg. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-09-01 22:20:41 +02:00
Ettore Di Giacinto	195aa22e77	chore(docs): update list of supported backends (#6134 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-08-24 20:09:19 +02:00
Ettore Di Giacinto	c899e90277	Update image-generation.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-08-20 10:37:11 +02:00
LocalAI [bot]	b70ee45fff	docs: ⬆️ update docs version mudler/LocalAI (#6046 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-08-12 22:05:50 +02:00
lnnt	7d41551e10	docs: update links in advanced-usage and models documentation (#5994 ) * docs: update links in advanced-usage and models documentation * docs: update links in advanced-usage and models documentation	2025-08-08 10:23:42 +02:00
LocalAI [bot]	e83652489c	docs: ⬆️ update docs version mudler/LocalAI (#5967 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-08-04 21:00:23 +00:00
LocalAI [bot]	a1e1942d83	docs: ⬆️ update docs version mudler/LocalAI (#5956 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-08-01 22:14:23 +02:00
Dedy F. Setyawan	787302b204	fix(docs): Improve responsiveness of tables (#5954 ) Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com>	2025-08-01 22:13:53 +02:00
Richard Palethorpe	c07bc55fee	fix(intel): Set GPU vendor on Intel images and cleanup (#5945 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2025-07-31 19:44:46 +02:00
LocalAI [bot]	9d7ec09ec0	docs: ⬆️ update docs version mudler/LocalAI (#5929 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-28 21:03:44 +00:00
Ettore Di Giacinto	949e5b9be8	feat(rfdetr): add object detection API (#5923 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-27 22:02:51 +02:00
LocalAI [bot]	078c22f485	docs: ⬆️ update docs version mudler/LocalAI (#5920 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-26 20:58:54 +00:00
Ettore Di Giacinto	6ef3852de5	chore(docs): fixup tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-26 21:25:07 +02:00
Ettore Di Giacinto	a8057b952c	fix(cuda): be consistent with image tag naming (#5916 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-26 08:30:59 +02:00
Ettore Di Giacinto	fd5c1d916f	chore(docs): add documentation on backend detection override (#5915 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-26 08:18:31 +02:00
LocalAI [bot]	a760f7ff39	docs: ⬆️ update docs version mudler/LocalAI (#5912 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-25 22:15:16 +02:00
Ettore Di Giacinto	facf7625f3	fix(vulkan): use correct image suffix (#5911 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-25 19:20:20 +02:00
Ettore Di Giacinto	3973e6e5da	fix(install.sh): update to use the new binary naming (#5903 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-25 10:43:22 +02:00
Ettore Di Giacinto	deda3a4972	Update build documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-24 22:53:08 +02:00
Ettore Di Giacinto	a28f27604a	Update backends.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-07-24 16:18:25 +02:00
Nathaniel Hyson	4db1b80278	Update quickstart.md (#5898 ) Fixed spelling mistake Signed-off-by: Nathaniel Hyson <Shinrai@users.noreply.github.com>	2025-07-24 15:04:02 +02:00
Ettore Di Giacinto	5f7ece3e94	fix(p2p): adapt to backend changes, general improvements (#5889 ) The binary is now named "llama-cpp-rpc-server" for p2p workers. We also decrease the default token rotation interval, in this way peer discovery is much more responsive. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-23 12:40:32 +02:00
Ettore Di Giacinto	98e5291afc	feat: refactor build process, drop embedded backends (#5875 ) * feat: split remaining backends and drop embedded backends - Drop silero-vad, huggingface, and stores backend from embedded binaries - Refactor Makefile and Dockerfile to avoid building grpc backends - Drop golang code that was used to embed backends - Simplify building by using goreleaser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): be specific with llama-cpp backend templates Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(docs): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): minor fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: drop all ffmpeg references Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: run protogen-go Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Always enable p2p mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update gorelease file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(stores): do not always load Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Mac OS fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-22 16:31:04 +02:00
Dedy F. Setyawan	a1d061c835	fix(docs): Resolve logo overlap on tablet view (#5853 ) * fix(docs): Resolve logo overlap on tablet view Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com> * fix(docs): Adjust header logo size Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com> * refactor(docs): Rework header logo sizing implementation Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com> --------- Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com>	2025-07-18 15:55:44 +02:00
Ettore Di Giacinto	7e1f2657d5	Update GPU-acceleration.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-07-06 19:03:34 +02:00

1 2 3 4 5 ...

461 commits