mirror of
https://github.com/mudler/LocalAI
synced 2026-04-21 21:37:21 +00:00
* feat(backends): add sglang Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(sglang): force AVX-512 CXXFLAGS and disable CI e2e job sgl-kernel's shm.cpp uses __m512 AVX-512 intrinsics unconditionally; -march=native fails on CI runners without AVX-512 in /proc/cpuinfo. Force -march=sapphirerapids so the build always succeeds, matching sglang upstream's docker/xeon.Dockerfile recipe. The resulting binary still requires an AVX-512 capable CPU at runtime, so disable tests-sglang-grpc in test-extra.yml for the same reason tests-vllm-grpc is disabled. Local runs with make test-extra-backend-sglang still work on hosts with the right SIMD baseline. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(sglang): patch CMakeLists.txt instead of CXXFLAGS for AVX-512 CXXFLAGS with -march=sapphirerapids was being overridden by add_compile_options(-march=native) in sglang's CPU CMakeLists.txt, since CMake appends those flags after CXXFLAGS. Sed-patch the CMakeLists.txt directly after cloning to replace -march=native. --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> |
||
|---|---|---|
| .. | ||
| disabled | ||
| backend.yml | ||
| backend_build.yml | ||
| backend_build_darwin.yml | ||
| backend_pr.yml | ||
| build-test.yaml | ||
| bump-inference-defaults.yml | ||
| bump_deps.yaml | ||
| bump_docs.yaml | ||
| checksum_checker.yaml | ||
| deploy-explorer.yaml | ||
| gallery-agent.yaml | ||
| generate_grpc_cache.yaml | ||
| generate_intel_image.yaml | ||
| gh-pages.yml | ||
| image-pr.yml | ||
| image.yml | ||
| image_build.yml | ||
| notify-releases.yaml | ||
| release.yaml | ||
| secscan.yaml | ||
| stalebot.yml | ||
| test-extra.yml | ||
| test.yml | ||
| tests-e2e.yml | ||
| tests-ui-e2e.yml | ||
| update_swagger.yaml | ||
| yaml-check.yml | ||