LocalAI/core/http/endpoints/openai
Ettore Di Giacinto 031a36c995
feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092)
* feat: wire min_p

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat: inferencing defaults

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* chore(refactor): re-use iterative parser

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* chore: generate automatically inference defaults from unsloth

Instead of trying to re-invent the wheel and maintain here the inference
defaults, prefer to consume unsloth ones, and contribute there as
necessary.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* chore: apply defaults also to models installed via gallery

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* chore: be consistent and apply fallback to all endpoint

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-03-22 00:57:15 +01:00
..
types feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
chat.go feat: inferencing default, automatic tool parsing fallback and wire min_p (#9092) 2026-03-22 00:57:15 +01:00
chat_test.go chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029) 2026-03-16 21:31:02 +01:00
completion.go feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00
constants.go fix(api): SSE streaming format to comply with specification (#7182) 2025-11-09 22:00:27 +01:00
edit.go feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00
embeddings.go chore(refactor): move logging to common package based on slog (#7668) 2025-12-21 19:33:13 +01:00
image.go Fix image upload processing and img2img pipeline in diffusers backend (#8879) 2026-03-11 08:05:50 +01:00
image_test.go Fix image upload processing and img2img pipeline in diffusers backend (#8879) 2026-03-11 08:05:50 +01:00
inference.go chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029) 2026-03-16 21:31:02 +01:00
inference_test.go chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029) 2026-03-16 21:31:02 +01:00
inpainting.go feat(UI): image generation improvements (#7804) 2025-12-31 21:59:46 +01:00
inpainting_test.go feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
list.go feat: add users and authentication support (#9061) 2026-03-19 21:40:51 +01:00
openai_suite_test.go Fix image upload processing and img2img pipeline in diffusers backend (#8879) 2026-03-11 08:05:50 +01:00
realtime.go feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
realtime_model.go feat: pass-by metadata to predict options (#8795) 2026-03-05 22:50:10 +01:00
realtime_transport.go feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
realtime_transport_webrtc.go feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
realtime_transport_ws.go feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
realtime_webrtc.go feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
transcription.go feat(api): Add transcribe response format request parameter & adjust STT backends (#8318) 2026-02-01 17:33:17 +01:00