LocalAI/backend/python/trl
Ettore Di Giacinto d9c1db2b87
feat: add (experimental) fine-tuning support with TRL (#9088)
* feat: add fine-tuning endpoint

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* feat(experimental): add fine-tuning endpoint and TRL support

This changeset defines new GRPC signatues for Fine tuning backends, and
add TRL backend as initial fine-tuning engine. This implementation also
supports exporting to GGUF and automatically importing it to LocalAI
after fine-tuning.

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* commit TRL backend, stop by killing process

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* move fine-tune to generic features

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* add evals, reorder menu

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

* Fix tests

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-03-21 02:08:02 +01:00
..
backend.py feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
install.sh feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
Makefile feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
requirements-cpu.txt feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
requirements-cublas12.txt feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
requirements-cublas13.txt feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
requirements.txt feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
reward_functions.py feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
run.sh feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
test.py feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00
test.sh feat: add (experimental) fine-tuning support with TRL (#9088) 2026-03-21 02:08:02 +01:00