LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-04-21 21:37:21 +00:00

History

Ettore Di Giacinto 53deeb1107 fix(reasoning): suppress partial tag tokens during autoparser warm-up The C++ PEG parser needs a few tokens to identify the reasoning format (e.g. "<\|channel>thought\n" for Gemma 4). During this warm-up, the gRPC layer was sending raw partial tag tokens to Go, which leaked into the reasoning field. - Clear reply.message in gRPC when autoparser is active but has no diffs yet, matching llama.cpp server behavior of only emitting classified output - Prefer C++ autoparser chat deltas for reasoning/content in all streaming paths, falling back to Go-side extraction for backends without autoparser (e.g. vLLM) - Override non-streaming no-tools result with chat delta content when available - Guard PrependThinkingTokenIfNeeded against partial tag prefixes during streaming accumulation - Reorder default thinking tokens so <\|channel>thought is checked before <\|think\|> (Gemma 4 templates contain both)		2026-04-04 20:45:57 +00:00
..
auth	fix(oauth/invite): do not register user (prending approval) without correct invite (#9189 )	2026-03-31 08:29:07 +02:00
endpoints	fix(reasoning): suppress partial tag tokens during autoparser warm-up	2026-04-04 20:45:57 +00:00
middleware	feat(api): Return 404 when model is not found except for model names in HF format (#9133 )	2026-03-31 10:48:21 +02:00
react-ui	feat(gemma4): add thinking support (#9221 )	2026-04-04 12:11:38 +02:00
routes	feat(api): Allow coding agents to interactively discover how to control and configure LocalAI (#9084 )	2026-04-04 15:14:35 +02:00
static	feat(realtime): WebRTC support (#8790 )	2026-03-13 21:37:15 +01:00
views	feat(realtime): WebRTC support (#8790 )	2026-03-13 21:37:15 +01:00
app.go	feat(api): Allow coding agents to interactively discover how to control and configure LocalAI (#9084 )	2026-04-04 15:14:35 +02:00
app_test.go	feat(api): Return 404 when model is not found except for model names in HF format (#9133 )	2026-03-31 10:48:21 +02:00
explorer.go	chore(refactor): move logging to common package based on slog (#7668 )	2025-12-21 19:33:13 +01:00
http_suite_test.go	feat(api): add support for open responses specification (#8063 )	2026-01-17 22:11:47 +01:00
openresponses_test.go	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
render.go	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00