LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-04-21 13:27:21 +00:00

History

Ettore Di Giacinto cd56a05c3e ci(vllm): disable tests-vllm-grpc job (heterogeneous runners) Both ubuntu-latest and bigger-runner have inconsistent CPU baselines: some instances support the AVX-512 VNNI/BF16 instructions the prebuilt vllm 0.14.1+cpu wheel was compiled with, others SIGILL on import of vllm.model_executor.models.registry. The libnuma packaging fix doesn't help when the wheel itself can't be loaded. FROM_SOURCE=true compiles vllm against the actual host CPU and works everywhere, but takes 30-50 minutes per run — too slow for a smoke test on every PR. Comment out the job for now. The test itself is intact and passes locally; run it via 'make test-extra-backend-vllm' on a host with the required SIMD baseline. Re-enable when: - we have a self-hosted runner label with guaranteed AVX-512 VNNI/BF16, or - vllm publishes a CPU wheel with a wider baseline, or - we set up a docker layer cache that makes FROM_SOURCE acceptable The detect-changes vllm output, the test harness changes (tests/ e2e-backends + tools cap), the make target (test-extra-backend-vllm), the package.sh and the Dockerfile/install.sh plumbing all stay in place.		2026-04-13 07:46:57 +00:00
..
ci	fix: roll out bluemonday Sanitize more widely (#3794 )	2024-10-12 09:45:47 +02:00
gallery-agent	chore(ci): fix gallery agent	2026-04-02 18:02:18 +00:00
ISSUE_TEMPLATE	docs/examples: enhancements (#1572 )	2024-01-18 19:41:08 +01:00
workflows	ci(vllm): disable tests-vllm-grpc job (heterogeneous runners)	2026-04-13 07:46:57 +00:00
bump_deps.sh	feat: do not bundle llama-cpp anymore (#5790 )	2025-07-18 13:24:12 +02:00
bump_docs.sh	fix: github bump_docs.sh regex to drop emoji and other text (#2180 )	2024-04-29 03:55:29 +00:00
check_and_update.py	fix(ci): fixup checksum scanning pipeline (#3631 )	2024-09-23 10:56:10 +02:00
checksum_checker.sh	fix(ci): fixup correct path for check_and_update.py (#2777 )	2024-07-11 23:05:43 +02:00
dependabot.yml	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00
FUNDING.yml	Create FUNDING.yml (#725 )	2023-07-09 13:39:00 +02:00
labeler.yml	chore(ci): update labels	2025-02-13 09:58:19 +01:00
PULL_REQUEST_TEMPLATE.md	feat(vllm): Allow to set quantization (#1094 )	2023-09-22 15:52:38 +02:00
release.yml	feat(p2p): Federation and AI swarms (#2723 )	2024-07-08 22:04:06 +02:00
stale.yml	feat: add PR template and stale configuration (#316 )	2023-05-20 09:10:20 +02:00