LocalAI/pkg
Ettore Di Giacinto d9c1db2b87
feat: add (experimental) fine-tuning support with TRL (#9088)
* feat: add fine-tuning endpoint

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat(experimental): add fine-tuning endpoint and TRL support

This changeset defines new GRPC signatues for Fine tuning backends, and
add TRL backend as initial fine-tuning engine. This implementation also
supports exporting to GGUF and automatically importing it to LocalAI
after fine-tuning.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* commit TRL backend, stop by killing process

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* move fine-tune to generic features

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* add evals, reorder menu

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fix tests

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-03-21 02:08:02 +01:00
..
audio feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
concurrency chore: update jobresult_test.go (#4124) 2024-11-12 08:52:18 +01:00
downloader chore(size): display size of HF models and allow to specify it from the gallery (#8907) 2026-03-09 17:38:14 +01:00
format feat(api): Add transcribe response format request parameter & adjust STT backends (#8318) 2026-02-01 17:33:17 +01:00
functions chore(deps): bump llama-cpp to 'a0bbcdd9b6b83eeeda6f1216088f42c33d464e38' (#9079) 2026-03-20 08:12:21 +01:00
grpc feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
huggingface-api feat(hf-api): return files in nested directories (#7396) 2025-11-30 09:06:54 +01:00
langchain Remove HuggingFace backend support (#8971) 2026-03-13 01:09:30 +01:00
model feat(ui): Per model backend logs and various fixes (#9028) 2026-03-18 08:31:26 +01:00
oci feat(ui): allow to cancel ops (#7264) 2025-11-13 18:41:47 +01:00
reasoning chore: refactor endpoints to use same inferencing path, add automatic retrial mechanism in case of errors (#9029) 2026-03-16 21:31:02 +01:00
signals chore: update cogito and simplify MCP logics (#6413) 2025-10-09 12:36:45 +02:00
sound feat(realtime): WebRTC support (#8790) 2026-03-13 21:37:15 +01:00
store chore: fix go.mod module (#2635) 2024-06-23 08:24:36 +00:00
system fix: gate CUDA directory checks on GPU vendor to prevent false CUDA detection (#8942) 2026-03-12 07:53:39 +01:00
utils Add sample_rate support to TTS API via post-processing resampling (#8650) 2026-02-25 16:36:27 +01:00
vram chore(size): display size of HF models and allow to specify it from the gallery (#8907) 2026-03-09 17:38:14 +01:00
xio feat(ui): allow to cancel ops (#7264) 2025-11-13 18:41:47 +01:00
xsync chore: fix go.mod module (#2635) 2024-06-23 08:24:36 +00:00
xsysinfo fix: drop gguf VRAM estimation (now redundant) (#8325) 2026-02-01 17:33:28 +01:00