LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-04-21 21:37:21 +00:00

History

Ettore Di Giacinto b4e30692a2 feat(backends): add sglang (#9359 ) * feat(backends): add sglang Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(sglang): force AVX-512 CXXFLAGS and disable CI e2e job sgl-kernel's shm.cpp uses __m512 AVX-512 intrinsics unconditionally; -march=native fails on CI runners without AVX-512 in /proc/cpuinfo. Force -march=sapphirerapids so the build always succeeds, matching sglang upstream's docker/xeon.Dockerfile recipe. The resulting binary still requires an AVX-512 capable CPU at runtime, so disable tests-sglang-grpc in test-extra.yml for the same reason tests-vllm-grpc is disabled. Local runs with make test-extra-backend-sglang still work on hosts with the right SIMD baseline. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(sglang): patch CMakeLists.txt instead of CXXFLAGS for AVX-512 CXXFLAGS with -march=sapphirerapids was being overridden by add_compile_options(-march=native) in sglang's CPU CMakeLists.txt, since CMake appends those flags after CXXFLAGS. Sed-patch the CMakeLists.txt directly after cloning to replace -march=native. --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>		2026-04-16 22:40:56 +02:00
..
disabled	chore(ci): disable CI actions	2026-03-02 14:48:00 +01:00
backend.yml	feat(backends): add sglang (#9359 )	2026-04-16 22:40:56 +02:00
backend_build.yml	chore(deps): bump docker/login-action from 3 to 4 (#8918 )	2026-03-09 22:30:11 +01:00
backend_build_darwin.yml	chore(deps): bump docker/metadata-action from 5 to 6 (#8917 )	2026-03-09 22:27:02 +01:00
backend_pr.yml	Change runner from macOS-14 to macos-latest	2025-12-13 10:11:27 +01:00
build-test.yaml	chore(deps): bump actions/upload-artifact from 6 to 7 (#8730 )	2026-03-02 21:43:39 +01:00
bump-inference-defaults.yml	chore(deps): bump peter-evans/create-pull-request from 7 to 8 (#9114 )	2026-03-24 08:50:50 +01:00
bump_deps.yaml	feat(backend): add turboquant llama.cpp-fork backend (#9355 )	2026-04-15 01:25:04 +02:00
bump_docs.yaml	fix(api)!: Stop model prior to deletion (#8422 )	2026-02-06 09:22:10 +01:00
checksum_checker.yaml	fix(api)!: Stop model prior to deletion (#8422 )	2026-02-06 09:22:10 +01:00
deploy-explorer.yaml	fix(api)!: Stop model prior to deletion (#8422 )	2026-02-06 09:22:10 +01:00
gallery-agent.yaml	fix(ci): small fixups	2026-04-14 09:27:27 +00:00
generate_grpc_cache.yaml	chore(deps): bump docker/build-push-action from 6 to 7 (#8919 )	2026-03-09 22:29:51 +01:00
generate_intel_image.yaml	chore(deps): bump docker/login-action from 3 to 4 (#8918 )	2026-03-09 22:30:11 +01:00
gh-pages.yml	chore(deps): bump actions/upload-pages-artifact from 4 to 5 (#9337 )	2026-04-13 21:53:47 +02:00
image-pr.yml	feat(rocm): bump to 7.x (#9323 )	2026-04-12 08:51:30 +02:00
image.yml	feat(rocm): bump to 7.x (#9323 )	2026-04-12 08:51:30 +02:00
image_build.yml	chore: drop AIO images (#9004 )	2026-03-14 17:49:36 +01:00
notify-releases.yaml	fix(api)!: Stop model prior to deletion (#8422 )	2026-02-06 09:22:10 +01:00
release.yaml	chore(deps): bump softprops/action-gh-release from 2 to 3 (#9336 )	2026-04-13 21:53:28 +02:00
secscan.yaml	Revert "chore(deps): bump securego/gosec from 2.22.9 to 2.22.11" (#7789 )	2025-12-30 09:58:13 +01:00
stalebot.yml	chore(deps): bump actions/stale from 10.1.1 to 10.2.0 (#8633 )	2026-02-23 23:27:20 +01:00
test-extra.yml	feat(backends): add sglang (#9359 )	2026-04-16 22:40:56 +02:00
test.yml	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
tests-e2e.yml	feat(realtime): WebRTC support (#8790 )	2026-03-13 21:37:15 +01:00
tests-ui-e2e.yml	chore(deps): bump actions/upload-artifact from 4 to 7 (#9030 )	2026-03-17 11:42:49 +01:00
update_swagger.yaml	fix(api)!: Stop model prior to deletion (#8422 )	2026-02-06 09:22:10 +01:00
yaml-check.yml	chore(backend gallery): add description for remaining backends (#5679 )	2025-06-17 22:21:44 +02:00