LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

Author	SHA1	Message	Date
LocalAI [bot]	a1e3acc590	docs: ⬆️ update docs version mudler/LocalAI (#8182 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-23 22:03:47 +01:00
Ettore Di Giacinto	923ebbb344	feat(qwen-tts): add Qwen-tts backend (#8163 ) * feat(qwen-tts): add Qwen-tts backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update intel deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop flash-attn for cuda13 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-23 15:18:41 +01:00
Ettore Di Giacinto	c491c6ca90	feat(openresponses): Support reasoning blocks (#8133 ) * feat(openresponses): support reasoning blocks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * allow to disable reasoning, refactor common logic Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add option to only strip reasoning Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add configurations for custom reasoning tokens Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-21 00:11:45 +01:00
Ettore Di Giacinto	4bf2f8bbd8	chore(docs): update docs with Anthropic API and openresponses Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-20 09:25:24 +01:00
LocalAI [bot]	54c5a2d9ea	docs: ⬆️ update docs version mudler/LocalAI (#8120 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2026-01-19 21:18:24 +00:00
Ettore Di Giacinto	44d78b4d15	chore(doc): put alert on install.sh until is fixed (#8042 ) See: https://github.com/mudler/LocalAI/issues/8032 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-14 22:08:48 +01:00
Ettore Di Giacinto	64d0a96ba3	feat(ui): add video gen UI (#8020 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-14 11:43:32 +01:00
Ettore Di Giacinto	a6ff354c86	feat(tts): add pocket-tts backend (#8018 ) * feat(pocket-tts): add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add to the gallery Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-13 23:35:19 +01:00
Richard Palethorpe	98f28bf583	chore(docs): Add Crush and VoxInput to the integrations (#7924 ) * chore(docs): Add Crush and VoxInput to the integrations Signed-off-by: Richard Palethorpe <io@richiejp.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2026-01-08 21:39:25 +01:00
Richard Palethorpe	e6ba26c3e7	chore: Update to Ubuntu24.04 (cont #7423 ) (#7769 ) * ci(workflows): bump GitHub Actions images to Ubuntu 24.04 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): remove CUDA 11.x support from GitHub Actions (incompatible with ubuntu:24.04) Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): bump GitHub Actions CUDA support to 12.9 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(docker): bump base image to ubuntu:24.04 and adjust Vulkan SDK/packages Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * fix(backend): correct context paths for Python backends in workflows, Makefile and Dockerfile Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(make): disable parallel backend builds to avoid race conditions Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(make): export CUDA_MAJOR_VERSION and CUDA_MINOR_VERSION for override Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(backend): update backend Dockerfiles to Ubuntu 24.04 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(backend): add ROCm env vars and default AMDGPU_TARGETS for hipBLAS builds Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(chatterbox): bump ROCm PyTorch to 2.9.1+rocm6.4 and update index URL; align hipblas requirements Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore: add local-ai-launcher to .gitignore Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): fix backends GitHub Actions workflows after rebase Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(docker): use build-time UBUNTU_VERSION variable Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(docker): remove libquadmath0 from requirements-stage base image Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(make): add backends/vllm to .NOTPARALLEL to prevent parallel builds Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * fix(docker): correct CUDA installation steps in backend Dockerfiles Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * chore(backend): update ROCm to 6.4 and align Python hipblas requirements Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): switch GitHub Actions runners to Ubuntu-24.04 for CUDA on arm64 builds Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(docker): update base image and backend Dockerfiles for Ubuntu 24.04 compatibility on arm64 Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * build(backend): increase timeout for uv installs behind slow networks on backend/Dockerfile.python Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): switch GitHub Actions runners to Ubuntu-24.04 for vibevoice backend Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * ci(workflows): fix failing GitHub Actions runners Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> * fix: Allow FROM_SOURCE to be unset, use upstream Intel images etc. Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): rm all traces of CUDA 11 Signed-off-by: Richard Palethorpe <io@richiejp.com> * chore(build): Add Ubuntu codename as an argument Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: Alessandro Sturniolo <alessandro.sturniolo@gmail.com>	2026-01-06 15:26:42 +01:00
Ettore Di Giacinto	d38811560c	chore(docs): add opencode, GHA, and realtime voice assistant examples Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2026-01-03 22:03:43 +01:00
Ettore Di Giacinto	c844b7ac58	feat: disable force eviction (#7725 ) * feat: allow to set forcing backends eviction while requests are in flight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: try to make the request sit and retry if eviction couldn't be done Otherwise calls that in order to pass would need to shutdown other backends would just fail. In this way instead we make the request sit and retry eviction until it succeeds. The thresholds can be configured by the user. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose settings to CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-25 14:26:18 +01:00
Ettore Di Giacinto	bf2f95c684	chore(docs): update docs with cuda 13 instructions and the new vibevoice backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-25 10:00:07 +01:00
LocalAI [bot]	94069f2751	docs: ⬆️ update docs version mudler/LocalAI (#7716 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-12-24 21:06:02 +00:00
Mikhail Khludnev	53b0530275	docs: Add `langchain-localai` integration package to documentation (#7677 ) Add `langchain-localai` integration package to documentation Signed-off-by: Mikhail Khludnev <mkhludnev@users.noreply.github.com>	2025-12-21 21:02:14 +01:00
Ettore Di Giacinto	2387b266d8	chore(llama.cpp): Add Missing llama.cpp Options to gRPC Server (#7584 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-15 21:55:20 +01:00
Ettore Di Giacinto	fc5b9ebfcc	feat(loader): enhance single active backend to support LRU eviction (#7535 ) * feat(loader): refactor single active backend support to LRU This changeset introduces LRU management of loaded backends. Users can set now a maximum number of models to be loaded concurrently, and, when setting LocalAI in single active backend mode we set LRU to 1 for backward compatibility. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-12-12 12:28:38 +01:00
Ettore Di Giacinto	00a05208bc	chore(docs): center video Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-12-08 16:59:11 +01:00
Ettore Di Giacinto	a27d0d151f	Embed YouTube video in documentation Added an embedded YouTube video to the documentation. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-12-08 16:53:20 +01:00
Igor B. Poretsky	96e123d53a	Messages output fix (#7424 ) The internal echo command in sh does not support "-e" and "-E" options and interprets backslash escape sequences by default. So we prefer the external echo command when it is available.	2025-12-04 11:30:02 +01:00
LocalAI [bot]	4c41f96157	docs: ⬆️ update docs version mudler/LocalAI (#7381 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-11-27 21:49:31 +01:00
Igor B. Poretsky	a8eb1c421b	Clean data directory (#7378 ) It seems to be no point to copy /etc/skel content to newly created data directory.	2025-11-27 17:48:32 +01:00
Igor B. Poretsky	d27a281783	Correct user deletion with all its data (#7368 ) Actually it is not necessary to remove particularly the local-ai data directory before user deletion. It will be accomplished automatically by the userdel command. But it is crucial to remove additional users from the local-ai group to allow userdel command to delete the group itself.	2025-11-27 17:47:55 +01:00
Igor B. Poretsky	c411fe09fb	Conventional way of adding extra apt repository (#7362 )	2025-11-27 17:46:26 +01:00
Igor B. Poretsky	acbcb44dbc	Initialize sudo reference before its first actual use (#7367 ) Unfortunately, in my previous pr I missed the fact that uninstall procedure uses sudo as well. La colpa mia.	2025-11-27 15:20:46 +01:00
Igor B. Poretsky	ab022172a9	chore: switch from /usr/share to /var/lib for data storage (#7361 ) * More appropriate place for data storing The /usr/share subtree in Linux is used for data that generally are not supposed to change. Conventional places for changeable data are usually located under /var, so /var/lib seems to be a reasonable default here. * Data paths consistency fix * Directory name consistency fix	2025-11-27 09:18:28 +01:00
Igor B. Poretsky	c0d1d0211f	fix: Initialize sudo reference before its first actual use (#7360 )	2025-11-26 16:03:42 +01:00
Igor B. Poretsky	f617bec686	fix: double sudo invocation fix in the install script (#7359 ) Double sudo invocation fix in the install script	2025-11-26 16:03:10 +01:00
Ettore Di Giacinto	71ed03102f	feat(ui): add chat history (#7325 ) * feat(chat): add history and management Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display in progress chats Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fetch available context size as we switch chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add search Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display MCP toggle correctly Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Re-ordering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Re-style Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Stable ordering Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display token/sec correctly Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Visual changes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Display chat time Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-24 11:48:24 +01:00
Ettore Di Giacinto	dd2828241c	chore(docs): add documentation about import (#7315 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-20 23:07:36 +01:00
Copilot	16e5689162	feat(importers): Add diffuser backend importer with ginkgo tests and UI support (#7316 ) * Initial plan * Add diffuser backend importer with ginkgo tests Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Finalize diffuser backend importer implementation Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Add diffuser preferences to model-editor import section Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Use gopkg.in/yaml.v3 for consistency in diffuser importer Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-11-20 22:38:30 +01:00
Ettore Di Giacinto	2dd42292dc	feat(ui): runtime settings (#7320 ) * feat(ui): add watchdog settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Do not re-read env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Some refactor, move other settings to runtime (p2p) Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add API Keys handling Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Allow to disable runtime settings Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * show MCP toggle in index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop context default Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-20 22:37:20 +01:00
Ettore Di Giacinto	53d51671d7	Update Docker installation recommendation wording Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-20 17:27:48 +01:00
Ettore Di Giacinto	95b6c9bb5a	Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-19 22:25:33 +01:00
Ettore Di Giacinto	2cc4809b0d	feat: docs revamp (#7313 ) * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Enhancements * Default to zen-dark Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-19 22:21:20 +01:00
Ettore Di Giacinto	18d11396cd	chore(docs): improve documentation and split into sections bigger topics (#7292 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-17 18:39:21 +01:00
Copilot	34bc1bda1e	fix(api): SSE streaming format to comply with specification (#7182 ) * Initial plan * Fix SSE streaming format to comply with specification - Replace json.Encoder with json.Marshal for explicit formatting - Use explicit \n\n for all SSE messages (instead of relying on implicit newlines) - Change %v to %s format specifier for proper string formatting - Fix error message streaming to include proper SSE format - Ensure consistency between chat.go and completion.go endpoints Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Add proper error handling for JSON marshal failures in streaming - Handle json.Marshal errors explicitly in error response paths - Add fallback simple error message if marshal fails - Prevents sending 'data: <nil>' on marshal failures - Addresses code review feedback Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Fix SSE streaming format to comply with specification Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Fix finish_reason field to use pointer for proper null handling - Change FinishReason from string to string in Choice schema - Streaming chunks now omit finish_reason (null) instead of empty string - Final chunks properly set finish_reason to "stop", "tool_calls", etc. - Remove empty content from initial streaming chunks (only send role) - Final streaming chunk sends empty delta with finish_reason - Addresses OpenAI API compliance issues causing client failures Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Improve code consistency for string pointer creation - Use consistent pattern: declare variable then take address - Remove inline anonymous function for better readability - Addresses code review feedback Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Move common finish reasons to constants - Create constants.go with FinishReasonStop, FinishReasonToolCalls, FinishReasonFunctionCall - Replace all string literals with constants in chat.go, completion.go, realtime.go - Improves code maintainability and prevents typos Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * Make it build Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix finish_reason to always be present with null or string value - Remove omitempty from FinishReason field in Choice struct - Explicitly set FinishReason to nil for all streaming chunks - Ensures finish_reason appears as null in JSON for streaming chunks - Final chunks still properly set finish_reason to "stop", "tool_calls", etc. - Complies with OpenAI API specification example Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-09 22:00:27 +01:00
Ettore Di Giacinto	02cc8cbcaa	feat(llama.cpp): consolidate options and respect tokenizer template when enabled (#7120 ) * feat(llama.cpp): expose env vars as options for consistency This allows to configure everything in the YAML file of the model rather than have global configurations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(llama.cpp): respect usetokenizertemplate and use llama.cpp templating system to process messages Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Detect template exists if use tokenizer template is enabled Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Better recognization of chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixes to support tool calls while using templates from tokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop template guessing, fix passing tools to tokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Extract grammar and other options from chat template, add schema struct Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * WIP Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Automatically set use_jinja Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanups, identify by default gguf models for chat Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-11-07 21:23:50 +01:00
Ettore Di Giacinto	79247a5d17	Clarify note about DMGs not being signed by Apple Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:09:28 +01:00
Ettore Di Giacinto	46b7a4c5f2	Add macOS DMG download information Added download link and note for macOS DMG installation. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:09:07 +01:00
Ettore Di Giacinto	436e2d91d0	Enhance overview with Docker and installer details Added Docker instructions and clarified one-liner installer for Linux. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:08:03 +01:00
Ettore Di Giacinto	a86fdc4087	Update binaries.md with macOS download instructions Added download instructions for macOS DMG file and updated command for Linux and macOS. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-11-04 12:06:56 +01:00
LocalAI [bot]	e485bdf9ab	docs: ⬆️ update docs version mudler/LocalAI (#6996 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-11-01 21:08:08 +00:00
Ettore Di Giacinto	238aad666e	chore(deps): bump cogito (#6785 ) chore(deps): Bump cogito Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-27 10:07:31 +01:00
Chakib Benziane	32c0ab3a7f	fix: properly terminate llama.cpp kv_overrides array with empty key + updated doc (#6672 ) * fix: properly terminate kv_overrides array with empty key The llama model loading function expects KV overrides to be terminated with an empty key (key[0] == 0). Previously, the kv_overrides vector was not being properly terminated, causing an assertion failure. This commit ensures that after parsing all KV override strings, we add a final terminating entry with an empty key to satisfy the C-style array termination requirement. This fixes the assertion error and allows the model to load correctly with custom KV overrides. Fixes #6643 - Also included a reference to the usage of the `overrides` option in the advanced-usage section. Signed-off-by: blob42 <contact@blob42.xyz> * doc: document the `overrides` option --------- Signed-off-by: blob42 <contact@blob42.xyz>	2025-10-23 09:31:55 +02:00
Ettore Di Giacinto	a22f6a499d	feat(mcp): add planning and reevaluation (#6541 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-18 18:26:32 +02:00
Ettore Di Giacinto	e963e16bc5	Remove model size guidance from FAQ Removed redundant information about model sizes in the WebUI. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-10 21:50:56 +02:00
Ettore Di Giacinto	1e9b115251	chore(docs): enhancements and clarifications (#6433 ) chore(docs): Small enhancements Fixes: https://github.com/mudler/LocalAI/issues/6250 Relates to: https://github.com/mudler/LocalAI/issues/6251 Fixes: https://github.com/mudler/LocalAI/issues/6249 Fixes: https://github.com/mudler/LocalAI/issues/6250 Fixes: https://github.com/mudler/LocalAI/issues/6253 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-10 21:31:11 +02:00
Ettore Di Giacinto	cb0ed55d89	feat(neutts): add backend (#6404 ) * feat(neutts): add backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): add images to CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): add Neutts Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make it work with quantized versions Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Apply suggestion from @mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-09 21:51:28 +02:00
Ettore Di Giacinto	c38564e22c	Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-07 16:25:03 +02:00
Ettore Di Giacinto	183559bb98	chore(docs): add MCP example (#6405 ) docs update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-07 11:42:28 +02:00
Ettore Di Giacinto	85e27ec74c	feat: add agent options to model config (#6383 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-05 21:54:04 +02:00
Ettore Di Giacinto	698205a2f3	Add links to Awesome MCPs and MCPs by mudler Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-05 21:27:44 +02:00
Ettore Di Giacinto	930553ef60	Update mcp.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-10-05 18:21:02 +02:00
Ettore Di Giacinto	60b6472fa0	feat: Add Agentic MCP support with a new chat/completion endpoint (#6381 ) * WIP - add endpoint Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Rename Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Wire the Completion API Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Try to make it functional Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Almost functional Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Bump golang versions used in tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add description of the tool Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Make it working Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small optimizations Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Cleanup/refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-10-05 17:51:41 +02:00
LocalAI [bot]	530c174fd3	docs: ⬆️ update docs version mudler/LocalAI (#6378 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-10-03 23:26:09 +02:00
LocalAI [bot]	357bf571a3	docs: ⬆️ update docs version mudler/LocalAI (#6318 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-21 08:40:00 +02:00
LocalAI [bot]	f7f26b8efa	docs: ⬆️ update docs version mudler/LocalAI (#6315 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-20 09:41:58 +02:00
LocalAI [bot]	d3c5c02837	docs: ⬆️ update docs version mudler/LocalAI (#6307 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-18 23:48:02 +02:00
LocalAI [bot]	542f07ab2d	docs: ⬆️ update docs version mudler/LocalAI (#6305 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-17 21:06:50 +00:00
Gianluca Boiano	d0e99562af	chore(aio): upgrade minicpm-v model to latest 4.5 (#6262 ) chore(aio): upgrade vision model to MiniCPM-V 4.5 Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2025-09-14 15:04:58 +02:00
Mauro Morales	59311d8b1e	Point to LocalAI-examples repo for llava (#6241 ) Signed-off-by: Mauro Morales <contact@mauromorales.com>	2025-09-09 16:40:55 +02:00
Ettore Di Giacinto	0b528458d8	chore(docs): add MacOS dmg download button (#6233 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-09-09 00:19:37 +02:00
Ettore Di Giacinto	e905e90dd7	Add MLX-audio entry to compatibility table Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-09-08 09:54:01 +02:00
Aliz Fara	9911ec84a3	Fix Typos in Docs (#6204 ) Signed-off-by: alizfara112 <alizfaraafa@gmail.com>	2025-09-05 22:11:21 +02:00
LocalAI [bot]	326f6e5ccb	docs: ⬆️ update docs version mudler/LocalAI (#6201 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-09-04 21:03:02 +00:00
Ettore Di Giacinto	43e0437db6	Revise GPU usage recommendations in documentation Updated recommendations for GPU usage on Xorg. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-09-01 22:20:41 +02:00
Ettore Di Giacinto	195aa22e77	chore(docs): update list of supported backends (#6134 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-08-24 20:09:19 +02:00
Ettore Di Giacinto	c899e90277	Update image-generation.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-08-20 10:37:11 +02:00
LocalAI [bot]	b70ee45fff	docs: ⬆️ update docs version mudler/LocalAI (#6046 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-08-12 22:05:50 +02:00
lnnt	7d41551e10	docs: update links in advanced-usage and models documentation (#5994 ) * docs: update links in advanced-usage and models documentation * docs: update links in advanced-usage and models documentation	2025-08-08 10:23:42 +02:00
LocalAI [bot]	e83652489c	docs: ⬆️ update docs version mudler/LocalAI (#5967 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-08-04 21:00:23 +00:00
LocalAI [bot]	a1e1942d83	docs: ⬆️ update docs version mudler/LocalAI (#5956 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-08-01 22:14:23 +02:00
Dedy F. Setyawan	787302b204	fix(docs): Improve responsiveness of tables (#5954 ) Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com>	2025-08-01 22:13:53 +02:00
Richard Palethorpe	c07bc55fee	fix(intel): Set GPU vendor on Intel images and cleanup (#5945 ) Signed-off-by: Richard Palethorpe <io@richiejp.com>	2025-07-31 19:44:46 +02:00
LocalAI [bot]	9d7ec09ec0	docs: ⬆️ update docs version mudler/LocalAI (#5929 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-28 21:03:44 +00:00
Ettore Di Giacinto	949e5b9be8	feat(rfdetr): add object detection API (#5923 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-27 22:02:51 +02:00
LocalAI [bot]	078c22f485	docs: ⬆️ update docs version mudler/LocalAI (#5920 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-26 20:58:54 +00:00
Ettore Di Giacinto	6ef3852de5	chore(docs): fixup tag Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-26 21:25:07 +02:00
Ettore Di Giacinto	a8057b952c	fix(cuda): be consistent with image tag naming (#5916 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-26 08:30:59 +02:00
Ettore Di Giacinto	fd5c1d916f	chore(docs): add documentation on backend detection override (#5915 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-26 08:18:31 +02:00
LocalAI [bot]	a760f7ff39	docs: ⬆️ update docs version mudler/LocalAI (#5912 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-25 22:15:16 +02:00
Ettore Di Giacinto	facf7625f3	fix(vulkan): use correct image suffix (#5911 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-25 19:20:20 +02:00
Ettore Di Giacinto	3973e6e5da	fix(install.sh): update to use the new binary naming (#5903 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-25 10:43:22 +02:00
Ettore Di Giacinto	deda3a4972	Update build documentation Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-24 22:53:08 +02:00
Ettore Di Giacinto	a28f27604a	Update backends.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-07-24 16:18:25 +02:00
Nathaniel Hyson	4db1b80278	Update quickstart.md (#5898 ) Fixed spelling mistake Signed-off-by: Nathaniel Hyson <Shinrai@users.noreply.github.com>	2025-07-24 15:04:02 +02:00
Ettore Di Giacinto	5f7ece3e94	fix(p2p): adapt to backend changes, general improvements (#5889 ) The binary is now named "llama-cpp-rpc-server" for p2p workers. We also decrease the default token rotation interval, in this way peer discovery is much more responsive. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-23 12:40:32 +02:00
Ettore Di Giacinto	98e5291afc	feat: refactor build process, drop embedded backends (#5875 ) * feat: split remaining backends and drop embedded backends - Drop silero-vad, huggingface, and stores backend from embedded binaries - Refactor Makefile and Dockerfile to avoid building grpc backends - Drop golang code that was used to embed backends - Simplify building by using goreleaser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(gallery): be specific with llama-cpp backend templates Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(docs): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(ci): minor fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore: drop all ffmpeg references Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: run protogen-go Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Always enable p2p mode Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update gorelease file Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(stores): do not always load Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix linting issues Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Mac OS fixup Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-07-22 16:31:04 +02:00
Dedy F. Setyawan	a1d061c835	fix(docs): Resolve logo overlap on tablet view (#5853 ) * fix(docs): Resolve logo overlap on tablet view Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com> * fix(docs): Adjust header logo size Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com> * refactor(docs): Rework header logo sizing implementation Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com> --------- Signed-off-by: Dedy F. Setyawan <dedyfajars@gmail.com>	2025-07-18 15:55:44 +02:00
Ettore Di Giacinto	7e1f2657d5	Update GPU-acceleration.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-07-06 19:03:34 +02:00
LocalAI [bot]	df7ed49889	docs: ⬆️ update docs version mudler/LocalAI (#5781 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-02 22:45:21 +00:00
LocalAI [bot]	61376c0fa7	docs: ⬆️ update docs version mudler/LocalAI (#5775 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-07-01 22:45:24 +00:00
Dedy F. Setyawan	9f957d547d	fix(docs): Improve Header Responsiveness - Hide "Star us on GitHub!" on Mobile (#5770 )	2025-07-01 12:15:16 +02:00
LocalAI [bot]	f8ff6fa1fd	docs: ⬆️ update docs version mudler/LocalAI (#5752 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-06-28 22:17:49 +02:00
LocalAI [bot]	665562b850	docs: ⬆️ update docs version mudler/LocalAI (#5741 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-06-27 22:23:43 +02:00
Ettore Di Giacinto	e1cc7ee107	fix(ci): enable tag-latest to auto (#5738 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-06-27 18:17:01 +02:00
Ettore Di Giacinto	6644af10c6	feat: ⚠️ reduce images size and stop bundling sources (#5721 ) feat: reduce images size and stop bundling sources Do not copy sources anymore, and reduce packages of the base images by not using builder images. If needed to rebuild, just build the container image from scratch by following the docs. We will slowly try to migrate all backends to the gallery to keep the core small. This PR is a breaking change, it also sets the base folders to /models and /backends instead of /build/models and /build/backends. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-06-26 18:41:38 +02:00
Ettore Di Giacinto	7c4a2e9b85	chore(ci): ⚠️ fix latest tag by using docker meta action (#5722 ) chore(ci): fix latest tag by using docker meta action Also uniform tagging names Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-06-26 18:40:25 +02:00
kilavvy	b68d6e8088	Docs: Fix typos (#5709 ) * Update GPU-acceleration.md Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com> * Update image-generation.md Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com> --------- Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com>	2025-06-23 18:15:06 +02:00
Ettore Di Giacinto	3796558aeb	Update quickstart.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-06-21 20:11:57 +02:00
Ettore Di Giacinto	79abe0ad77	Drop latest references to extras images	2025-06-20 15:51:16 +02:00
Ettore Di Giacinto	8131d11d1f	Update quickstart.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-06-19 22:42:38 +02:00
LocalAI [bot]	beb01c91f3	docs: ⬆️ update docs version mudler/LocalAI (#5690 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-06-19 22:13:16 +02:00
Ettore Di Giacinto	1ccd64ff6a	chore: drop extras references from docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-06-19 22:04:28 +02:00
Ettore Di Giacinto	49d026a229	Update backends.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-06-19 19:47:09 +02:00
leopardracer	f9b968e19d	Fix Typos and Improve Clarity in GPU Acceleration Documentation (#5688 ) Update GPU-acceleration.md Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com>	2025-06-19 15:41:13 +02:00
Maxim Evtush	add8fc35a2	Fix Typos in Documentation and Python Comments (#5658 ) * Update istftnet.py Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com> * Update GPU-acceleration.md Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com> --------- Signed-off-by: Maxim Evtush <154841002+maximevtush@users.noreply.github.com>	2025-06-18 22:11:13 +02:00
Ettore Di Giacinto	80b3139fa0	Update landing.yaml Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-06-18 19:48:17 +02:00
Ettore Di Giacinto	867db3f888	chore(docs): add backend url Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-06-17 22:35:21 +02:00
Ettore Di Giacinto	b79aa31398	chore: move backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-06-17 22:26:40 +02:00
FT	1f29b5f38e	Fix Typos and Improve Documentation Clarity (#5648 ) * Update p2p.go Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com> * Update GPU-acceleration.md Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com> --------- Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com>	2025-06-15 16:04:44 +02:00
Ettore Di Giacinto	2d64269763	feat: Add backend gallery (#5607 ) * feat: Add backend gallery This PR add support to manage backends as similar to models. There is now available a backend gallery which can be used to install and remove extra backends. The backend gallery can be configured similarly as a model gallery, and API calls allows to install and remove new backends in runtime, and as well during the startup phase of LocalAI. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add backends docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip: Backend Dockerfile for python backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: drop extras images, build python backends separately Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixup on all backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * test CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Tweaks Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Drop old backends leftovers Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixup CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Move dockerfile upper Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fix proto Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Feature dropped for consistency - we prefer model galleries Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add missing packages in the build image Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * exllama is ponly available on cublas Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * pin torch on chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups to index Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Debug CI * Install accellerators deps Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add target arch * Add cuda minor version Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use self-hosted runners Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: use quay for test images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups for vllm and chatterbox Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Small fixups on CI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chatterbox is only available for nvidia Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Simplify CI builds Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adapt test, use qwen3 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * chore(model gallery): add jina-reranker-v1-tiny-en-gguf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Use reranker from llama.cpp in AIO images Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Limit concurrent jobs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-06-15 14:56:52 +02:00
Gavin Mogan	63116a2c6a	docs: Update docs metadata headers so when mentioned on slack it doesn't say hugo (#5642 ) Update docs metadata headers so when mentioned on slack it doesn't say hugo Signed-off-by: Gavin Mogan <github@gavinmogan.com>	2025-06-13 19:54:57 +02:00
Gavin Mogan	cbd61dccd4	fix(install.sh): vulkan docker tag (#5589 ) vulkan docker tag is not prefixed with gpu ``` regctl tag ls localai/localai \| grep 2.29 \| grep vulkan v2.29.0-vulkan ``` Signed-off-by: Gavin Mogan <github@gavinmogan.com>	2025-06-05 08:12:16 +02:00
David Thole	38c5d16b57	feat(docs): updating the documentation on fine tuning and advanced guide. (#5420 ) updating the documentation on fine tuning and advanced guide. This mirrors how modern version of llama.cpp operate	2025-05-21 19:11:00 +02:00
omahs	0f365ac204	fix: typos (#5376 ) Signed-off-by: omahs <73983677+omahs@users.noreply.github.com>	2025-05-16 12:45:48 +02:00
Ettore Di Giacinto	e52c66c76e	chore(docs/install.sh): image changes (#5354 ) chore(docs): image changes Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-05-14 19:28:30 +02:00
LocalAI [bot]	029f97c2a2	docs: ⬆️ update docs version mudler/LocalAI (#5363 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-05-14 01:54:34 +00:00
Ettore Di Giacinto	0e8af53a5b	chore: update quickstart Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-05-01 22:36:33 +02:00
Simon Redman	88857696d4	fix(CUDA): Add note for how to run CUDA with SELinux (#5259 ) * Add note to help run nvidia containers with SELinux * Use correct CUDA container references as noted in the dockerhub overview * Clean trailing whitespaces	2025-04-28 09:00:52 +02:00
Mohit Gaur	b6e3dc5f02	docs: update docs for DisableWebUI flag (#5256 ) Signed-off-by: Mohit Gaur <56885276+Mohit-Gaur@users.noreply.github.com>	2025-04-27 16:02:02 +02:00
Alessandro Pirastru	69667521e2	fix(install/gpu):Fix docker not being able to leverage the GPU on systems that have SELinux Enforced (#5252 ) * Update installation script for improved compatibility and clarity - Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables. - Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility. - Adjusted default Fedora version handling for CUDA installation. - Updated Docker image tag handling to use LOCALAI_VERSION consistently. - Improved logging messages for repository and LocalAI binary downloads. - Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition. * feat: Add SELinux configuration for NVIDIA GPU support in containers - Introduced `enable_selinux_container_booleans` function to handle SELinux configuration changes for GPU access. - Included user confirmation prompt to enable SELinux `container_use_devices` boolean due to security implications. - Added NVIDIA Container Runtime to Docker runtimes and restarted Docker to ensure proper GPU support. - Applied SELinux adjustments conditionally for Fedora, RHEL, CentOS, Rocky, and openSUSE distributions. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * fix: Correct SELinux boolean parsing and add loop break - Fixed incorrect parsing of `container_use_devices` boolean by changing the awk field from `$2` to `$3` to retrieve the correct value. - Added a `break` statement after enabling the SELinux boolean to prevent unnecessary loop iterations after user prompt. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * fix: typo in install.sh Signed-off-by: Alessandro Pirastru <57262788+Bloodis94@users.noreply.github.com> --------- Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> Signed-off-by: Alessandro Pirastru <57262788+Bloodis94@users.noreply.github.com>	2025-04-27 16:01:29 +02:00
Simon Redman	a65e012aa2	docs(Vulkan): Add GPU docker documentation for Vulkan (#5255 ) Add GPU docker documentation for Vulkan	2025-04-27 09:20:26 +02:00
Ettore Di Giacinto	2c9279a542	feat(video-gen): add endpoint for video generation (#5247 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-04-26 18:05:01 +02:00
Alessandro Pirastru	a0244e3fb4	feat(install): added complete process for installing nvidia drivers on fedora without pulling X11 (#5246 ) * Update installation script for improved compatibility and clarity - Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables. - Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility. - Adjusted default Fedora version handling for CUDA installation. - Updated Docker image tag handling to use LOCALAI_VERSION consistently. - Improved logging messages for repository and LocalAI binary downloads. - Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition. * Enhance log functions with ANSI color formatting - Added ANSI escape codes for improved log styling: light blue for info, orange for warnings, and red for errors. - Updated all log functions (`info`, `warn`, `fatal`) to include bold and colored output. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * feat: Enhance log functions with ANSI color formatting - Added ANSI escape codes for improved log styling: light blue for info, orange for warnings, and red for errors. - Updated all log functions (`info`, `warn`, `fatal`) to include bold and colored output. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * chore: ⬆️ Update ggml-org/llama.cpp to `ecda2ec4b347031a9b8a89ee2efc664ce63f599c` (#5238) ⬆️ Update ggml-org/llama.cpp Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> * fix(stablediffusion-ggml): Build with DSD CUDA, HIP and Metal flags (#5236) Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat(install): enhance script with choice functions and logs - Added custom `choice_info`, `choice_warn`, and `choice_fatal` functions for interactive input logging. - Adjusted Docker volume creation message for better clarity. - Included NVIDIA driver check log for improved feedback to users. - Added consistent logging before starting LocalAI Docker containers across configurations. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * feat(install): add Fedora NVIDIA driver installation option - Introduced a new function to install NVIDIA kernel drivers on Fedora using akmod packages. - Added user prompt to choose between installing drivers automatically or exiting for manual setup. - Integrated the new function into the existing Fedora-specific CUDA toolkit installation workflow. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * fix(install): correct repository ID for DNF5 configuration - Update repository ID from 'nome-repo' to 'nvidia-cuda' for DNF5. - Ensures the correct repository name matches expected configuration. - Fix prevents potential misconfiguration during installation process. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * feat(install): enhance NVIDIA driver handling on Fedora - fixed `install_cuda_driver_yum` function call in `install_fedora_nvidia_kernel_drivers` - Added `cuda-toolkit` for Fedora installations, as recommended by RPM Fusion. - Adjusted driver repository commands for compatibility with DNF5. - Improved URL and version handling for package manager installations. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * Refactor NVIDIA driver installation process in install.sh - Removed redundant empty lines for cleaner formatting. - Standardized URL formatting by removing unnecessary quotes around URLs. - Reverted logic by removing Fedora-specific exclusions for cuda-toolkit and using `cuda-drivers` universally. - Refined repository addition for `dnf` by explicitly setting `id` and `name` parameters for clarity and accuracy. - Fixed minor formatting inconsistencies in parameter passing. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * feat: Update NVIDIA module installation warning in install script - Clarified that Akmod installation may inhibit the reboot command. - Added a cautionary note to the warning to inform users of potential risks. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> * Update NVIDIA driver installation warning message - Clarify prerequisites by noting the need for rpmfusion free/nonfree repos. - Improve formatting of the warning box for better readability. - Inform users that the script will install missing repos if necessary. Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> --------- Signed-off-by: Alessandro Pirastru <alessandro.pirastru.94@gmail.com> Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Richard Palethorpe <io@richiejp.com> Co-authored-by: LocalAI [bot] <139863280+localai-bot@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> Co-authored-by: Richard Palethorpe <io@richiejp.com>	2025-04-26 09:44:40 +02:00
Alessandro Pirastru	1ae0b896fa	fix: installation script compatibility with fedora 41 and later, fedora headless unclear errors (#5239 ) Update installation script for improved compatibility and clarity - Renamed VERSION to LOCALAI_VERSION to avoid conflicts with system variables. - Enhanced NVIDIA and CUDA repository installation for DNF5 compatibility. - Adjusted default Fedora version handling for CUDA installation. - Updated Docker image tag handling to use LOCALAI_VERSION consistently. - Improved logging messages for repository and LocalAI binary downloads. - Added a temporary bypass for nvidia-smi installation on Fedora Cloud Edition.	2025-04-24 09:34:25 +02:00
Ettore Di Giacinto	cc3df759f8	chore(docs): improve installer.sh docs (#5232 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-04-21 22:11:43 +02:00
Ettore Di Giacinto	61cc76c455	chore(autogptq): drop archived backend (#5214 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-04-19 15:52:29 +02:00
Ettore Di Giacinto	72693b3917	feat(install.sh): allow to uninstall with --uninstall (#5202 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-04-17 16:32:23 +02:00
LocalAI [bot]	161c9fe2db	docs: ⬆️ update docs version mudler/LocalAI (#5191 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-04-16 22:13:49 +02:00
Ettore Di Giacinto	7547463f81	Update quickstart.md Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2025-04-16 08:48:55 +02:00
Ettore Di Giacinto	56f44d448c	chore(docs): decrease logo size, minor enhancements Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-04-15 22:00:51 +02:00
Ettore Di Giacinto	4f239bac89	feat: rebrand - LocalAGI and LocalRecall joins the LocalAI stack family (#5159 ) * wip Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update lotusdocs and hugo Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * rephrasing Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Latest fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Adjust readme section Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-04-15 17:51:24 +02:00
LocalAI [bot]	f09b33f2ef	docs: ⬆️ update docs version mudler/LocalAI (#5104 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-03-31 22:48:03 +02:00
dependabot[bot]	d88ec1209e	chore(deps): Bump docs/themes/hugo-theme-relearn from `4a4b60e` to `9a020e7` (#4988 ) chore(deps): Bump docs/themes/hugo-theme-relearn Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn) from `4a4b60e` to `9a020e7`. - [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases) - [Commits](`4a4b60ef04...9a020e7ead`) --- updated-dependencies: - dependency-name: docs/themes/hugo-theme-relearn dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-03-11 09:39:04 +01:00
dependabot[bot]	8664b1c7a2	chore(deps): Bump docs/themes/hugo-theme-relearn from `02bba0f` to `4a4b60e` (#4934 ) chore(deps): Bump docs/themes/hugo-theme-relearn Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn) from `02bba0f` to `4a4b60e`. - [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases) - [Commits](`02bba0f199...4a4b60ef04`) --- updated-dependencies: - dependency-name: docs/themes/hugo-theme-relearn dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-03-03 19:56:41 +00:00
dependabot[bot]	5a5f3a899a	chore(deps): Bump docs/themes/hugo-theme-relearn from `66bc366` to `02bba0f` (#4898 ) chore(deps): Bump docs/themes/hugo-theme-relearn Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn) from `66bc366` to `02bba0f`. - [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases) - [Commits](`66bc366c47...02bba0f199`) --- updated-dependencies: - dependency-name: docs/themes/hugo-theme-relearn dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-02-25 09:50:46 +01:00
Ettore Di Giacinto	ac4991b069	chore(docs): update sponsor logo Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-20 15:31:41 +01:00
Ettore Di Giacinto	f3ae94ca70	chore: update Image generation docs and examples (#4841 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-17 16:51:06 +01:00
LocalAI [bot]	20119fc580	docs: ⬆️ update docs version mudler/LocalAI (#4834 ) ⬆️ Update docs version mudler/LocalAI Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2025-02-15 22:45:11 +00:00
Ettore Di Giacinto	09941c0bfb	chore(docs): update license year Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-15 18:17:15 +01:00
Ettore Di Giacinto	5f64cc6328	Revert "chore(deps): Bump docs/themes/lotusdocs from `f5785a2` to `975da91`" (#4808 ) Revert "chore(deps): Bump docs/themes/lotusdocs from `f5785a2` to `975da91` (…" This reverts commit `e57b750ca3`.	2025-02-11 10:05:57 +01:00
dependabot[bot]	e57b750ca3	chore(deps): Bump docs/themes/lotusdocs from `f5785a2` to `975da91` (#4801 ) Bumps [docs/themes/lotusdocs](https://github.com/colinwilson/lotusdocs) from `f5785a2` to `975da91`. - [Release notes](https://github.com/colinwilson/lotusdocs/releases) - [Commits](`f5785a2399...975da91e83`) --- updated-dependencies: - dependency-name: docs/themes/lotusdocs dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-02-10 22:27:14 +00:00
Ettore Di Giacinto	7f90ff7aec	chore(llama-ggml): drop deprecated backend (#4775 ) The GGML format is now dead, since in the next version of LocalAI we already bring many breaking compatibility changes, taking the occasion also to drop ggml support (pre-gguf). Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-06 18:36:23 +01:00
Ettore Di Giacinto	28a1310890	chore(docs): enhance visibility Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-05 19:50:32 +01:00
Ettore Di Giacinto	2a702e9ca4	chore(docs): small updates Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-05 19:49:11 +01:00
Ettore Di Giacinto	3ecaea1b6e	chore(docs): update sponsors in the website Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-02-05 19:41:55 +01:00
dependabot[bot]	96cb407ee0	chore(deps): Bump docs/themes/hugo-theme-relearn from `5bcb9fe` to `66bc366` (#4750 ) chore(deps): Bump docs/themes/hugo-theme-relearn Bumps [docs/themes/hugo-theme-relearn](https://github.com/McShelby/hugo-theme-relearn) from `5bcb9fe` to `66bc366`. - [Release notes](https://github.com/McShelby/hugo-theme-relearn/releases) - [Commits](`5bcb9fe5e6...66bc366c47`) --- updated-dependencies: - dependency-name: docs/themes/hugo-theme-relearn dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-02-04 08:57:19 +01:00
Ettore Di Giacinto	af41436f1b	fix(tests): pin to branch for config used in tests (#4721 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2025-01-31 09:57:58 +01:00

1 2 3 4 5 ...

502 commits