LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

Ettore Di Giacinto 8ccf5b2044 feat(speculative-sampling): allow to specify a draft model in the model config (#1052 ) Description This PR fixes #1013. It adds `draft_model` and `n_draft` to the model YAML config in order to load models with speculative sampling. This should be compatible as well with grammars. example: ```yaml backend: llama context_size: 1024 name: my-model-name parameters: model: foo-bar n_draft: 16 draft_model: model-name ``` --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>		2023-09-14 17:44:16 +02:00
..
assets	feat: Update gpt4all, support multiple implementations in runtime (#472 )	2023-06-01 23:38:52 +02:00
backend	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 )	2023-09-14 17:44:16 +02:00
gallery	feat: Model Gallery Endpoint Refactor / Mutable Galleries Endpoints (#991 )	2023-09-02 09:00:44 +02:00
grammar	feat: update integer, number and string rules - allow primitives as root types (#862 )	2023-08-03 23:32:30 +02:00
grpc	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 )	2023-09-14 17:44:16 +02:00
langchain	feat: add LangChainGo Huggingface backend (#446 )	2023-06-01 12:00:06 +02:00
model	feat: backend monitor shutdown endpoint, process based (#938 )	2023-08-23 18:38:37 +02:00
stablediffusion	feat: support upscaled image generation with esrgan (#509 )	2023-06-05 17:21:38 +02:00
utils	feat: Model Gallery Endpoint Refactor / Mutable Galleries Endpoints (#991 )	2023-09-02 09:00:44 +02:00