mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
- Handle both StructuredOutputsParams (vLLM latest) and GuidedDecodingParams (vLLM <=0.8.x) with graceful fallback - Use the correct SamplingParams field name for each version (structured_outputs vs guided_decoding) - Use 'json' parameter (not 'json_schema') matching both APIs - Re-add grammar (GBNF/BNF) passthrough — both vLLM APIs accept a 'grammar' parameter handled by xgrammar which supports GBNF - Priority: JSONSchema > json_object > Grammar Ref: #6857 Signed-off-by: eureka928 <meobius123@gmail.com> |
||
|---|---|---|
| .. | ||
| backend.py | ||
| install.sh | ||
| Makefile | ||
| package.sh | ||
| README.md | ||
| requirements-after.txt | ||
| requirements-cpu-after.txt | ||
| requirements-cpu.txt | ||
| requirements-cublas12-after.txt | ||
| requirements-cublas12.txt | ||
| requirements-hipblas-after.txt | ||
| requirements-hipblas.txt | ||
| requirements-install.txt | ||
| requirements-intel-after.txt | ||
| requirements-intel.txt | ||
| requirements.txt | ||
| run.sh | ||
| test.py | ||
| test.sh | ||
Creating a separate environment for the vllm project
make vllm