mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
Update the vLLM backend to support structured output: - Import GuidedDecodingParams from vllm.sampling_params - Handle JSONSchema: parse and pass as GuidedDecodingParams(json_schema=...) - Handle json_object response format: GuidedDecodingParams(json_object=True) - Fall back to Grammar (GBNF) via GuidedDecodingParams(grammar=...) - Remove phantom GuidedDecoding mapping (field doesn't exist in proto) - Fix missing 'import time' and 'import json' for load_video and schema parsing Priority: JSONSchema > json_object > Grammar (GBNF fallback) Ref: #6857 Signed-off-by: eureka928 <meobius123@gmail.com> |
||
|---|---|---|
| .. | ||
| backend.py | ||
| install.sh | ||
| Makefile | ||
| package.sh | ||
| README.md | ||
| requirements-after.txt | ||
| requirements-cpu-after.txt | ||
| requirements-cpu.txt | ||
| requirements-cublas12-after.txt | ||
| requirements-cublas12.txt | ||
| requirements-hipblas-after.txt | ||
| requirements-hipblas.txt | ||
| requirements-install.txt | ||
| requirements-intel-after.txt | ||
| requirements-intel.txt | ||
| requirements.txt | ||
| run.sh | ||
| test.py | ||
| test.sh | ||
Creating a separate environment for the vllm project
make vllm