LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

Ettore Di Giacinto 3280b9a287 Some checks failed Security Scan / tests (push) Has been cancelled Details fix(distributed): per-replica backend logs (store aggregation + UI) The multi-replica refactor (PR #9583) changed the worker's process key from `modelID` to `modelID#replicaIndex`, but the BackendLogStore kept the bare-modelID lookup. Result: every distributed deployment lost backend logs in the Nodes UI — single-replica too, since even the default capacity of 1 produces a `#0` suffix. Two changes wired together: * pkg/model: BackendLogStore.GetLines/Subscribe now treat a modelID without `#` as a model prefix and merge across all `modelID#N` replica buffers (timestamp-sorted for GetLines; fan-in for Subscribe). Calls with a full `modelID#N` key resolve exactly. ListModels strips replica suffixes and deduplicates so the listing surfaces one entry per loaded model. * react-ui: per-replica log streams as the default. Loaded Models table disambiguates each row with a `rep N` pill (only when the node hosts >1 replica of a model). Each row's "View logs" link routes to the per-replica process key so operators see only that replica's output. The logs page renders the replica context as a chip in the title and surfaces a segmented control — `Replica 0 / 1 / … / All merged` — when the model has multiple replicas; the merged segment uses the bare-modelID URL (delegating to the store's prefix aggregation) for the side-by-side comparison case. Single-replica deployments see no extra UI. Tests added first (TDD): the regression set in backend_log_store_test.go reproduces the bug at the exact failure point — GetLines/ListModels/Subscribe assertions all fail against the broken code, all pass against the fix. TestSubscribe_PerReplicaFilter pins the exact-key path so a future change can't silently break it. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Assisted-by: claude-code:opus-4-7 [Edit] [Skill:critique] [Skill:audit] [Skill:polish] [Skill:distill]		2026-04-27 20:55:24 +00:00
..
audio	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
concurrency	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
downloader	feat: add biometrics UI (#9524 )	2026-04-24 08:50:34 +02:00
functions	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
grpc	feat: voice recognition (#9500 )	2026-04-23 12:07:14 +02:00
huggingface-api	fix(importer): emit all shards for multi-part GGUF models (#9513 )	2026-04-23 15:00:02 +02:00
model	fix(distributed): per-replica backend logs (store aggregation + UI)	2026-04-27 20:55:24 +00:00
oci	feat: backend versioning, upgrade detection and auto-upgrade (#9315 )	2026-04-11 22:31:15 +02:00
reasoning	fix(reasoning): suppress partial tag tokens during autoparser warm-up	2026-04-04 20:45:57 +00:00
sanitize	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
signals	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
sound	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
store	chore: fix go.mod module (#2635 )	2024-06-23 08:24:36 +00:00
system	feat(importer): expand importer flow to almost all backends (#9466 )	2026-04-22 22:42:37 +02:00
utils	feat: Add Sherpa ONNX backend for ASR and TTS (#8523 )	2026-04-24 14:40:06 +02:00
vram	feat(api): Allow coding agents to interactively discover how to control and configure LocalAI (#9084 )	2026-04-04 15:14:35 +02:00
xio	feat(ui): allow to cancel ops (#7264 )	2025-11-13 18:41:47 +01:00
xsync	chore: fix go.mod module (#2635 )	2024-06-23 08:24:36 +00:00
xsysinfo	fix(distributed): correct VRAM/RAM reporting on NVIDIA unified-memory hosts (#9545 )	2026-04-24 22:02:23 +02:00