LocalAI

mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

LocalAI [bot] 86a7f6c9fa ci: close GC race + cascade-skip + darwin grpc gaps from v4.2.1 (#9781 ) * ci: close the GC race + cascade-skip + darwin grpc gaps from v4.2.1 v4.2.1's backend.yml run (#25701862853) exposed three independent issues on top of the singletons fix shipped in `ea001995`. Address all three plus two related cleanups: 1. quay GC race in backend-merge-jobs-multiarch (12/37 merges failed with "manifest not found"). Even after PR #9746 split multi/single-arch merges, the multiarch matrix itself takes ~2h to drain at max-parallel: 8, and the earliest per-arch digests (push-by-digest, no tag) get reaped by quay's GC before the merge runs. The split bounded the race for multiarch; it doesn't eliminate it. Anchor each per-arch digest immediately to a tag in the internal ci-cache image (`keepalive-<run_id><tag-suffix>-<platform-tag>`). Quay won't GC tagged manifests. backend_merge.yml deletes the keepalive tags via quay REST API after publishing the user-facing manifest list. Cleanup is best-effort: if the quay token is not OAuth-scoped the merge does NOT fail, the orphan tags just persist. 2. cascade-skip on backend-merge-jobs-singlearch. v4.2.1 had 2 failed and 2 cancelled singlearch builds (out of 199); GHA's default `needs:` semantics cascade-skipped the entire singlearch merge matrix, so zero singleton tags were applied even though 197 singletons built successfully. Wrap the merge `if:` in `!cancelled() && ...` for both multi and single arch in backend.yml and backend_pr.yml so partial build failures publish the successful tag-suffixes. 3. Darwin llama-cpp grpc-server build fails with `find_package(absl)` not found. Same shape as the ccache/blake3/fmt/hiredis/xxhash/zstd fix already in `Dependencies`: a brew cache hit restores `/opt/homebrew/Cellar/grpc` so `brew install grpc` no-ops, but abseil isn't in our Cellar cache list and never gets installed alongside, leaving grpc's CMake unable to resolve it. Mirror the `brew reinstall ccache` line with `brew reinstall grpc` to re-validate grpc's full transitive dep closure on every cache-hit run. 4. Move the four heaviest CUDA cpp builds back to bigger-runner. v4.2.1 wall-clock: -gpu-nvidia-cuda-12-llama-cpp 5h36m, -gpu-nvidia-cuda-12-turboquant 6h05m, -gpu-nvidia-cuda-13-llama-cpp 5h37m, -gpu-nvidia-cuda-13-turboquant 6h05m. The cuda-12 turboquant and cuda-13 turboquant entries are over GHA's 6h job timeout. Phase 5.3 of the free-tier migration (PR #9730) had explicitly flagged this batch as 'highest-risk' with a per-entry revert path. All other matrix entries (vulkan-llama-cpp ~47m, ROCm hipblas-llama-cpp ~2h, intel sycl-f32 ~1h49m) stay on free-tier ubuntu-latest. Verified locally: all six edited workflow YAMLs parse cleanly. Real verification has to come from the next tag release run. Assisted-by: Claude:claude-opus-4-7 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * ci: extract keepalive anchor + cleanup into .github/scripts/ The two inline shell blocks from the previous commit are long enough to hurt readability of the workflow YAML and benefit from their own files with self-contained docs. Move them to .github/scripts/: anchor-digest-in-cache.sh backend_build.yml's keepalive anchor cleanup-keepalive-tags.sh backend_merge.yml's best-effort cleanup Workflow steps reduce to a single `run:` invocation each, with all the parameter plumbing handled by env vars on the step. backend_merge.yml also gains a sparse `actions/checkout@v6` step (sparse to .github/scripts only) so the cleanup script is available on the runner — backend_build already checks out for the docker build. Net workflow diff: -36 lines across the two files. Script logic and behavior are byte-identical to the inline version. Assisted-by: Claude:claude-opus-4-7 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>		2026-05-12 17:22:09 +02:00
..
actions	ci: phase 1-3 of GHA free tier migration (path filter, multi-arch split prep, /mnt disk relief) (#9726 )	2026-05-08 23:43:41 +02:00
ci	fix: roll out bluemonday Sanitize more widely (#3794 )	2024-10-12 09:45:47 +02:00
gallery-agent	fix(ci): switch gallery-agent to sigs.k8s.io/yaml (#9397 )	2026-04-17 10:10:42 +02:00
ISSUE_TEMPLATE	docs/examples: enhancements (#1572 )	2024-01-18 19:41:08 +01:00
scripts	ci: close GC race + cascade-skip + darwin grpc gaps from v4.2.1 (#9781 )	2026-05-12 17:22:09 +02:00
workflows	ci: close GC race + cascade-skip + darwin grpc gaps from v4.2.1 (#9781 )	2026-05-12 17:22:09 +02:00
backend-matrix.yml	ci: close GC race + cascade-skip + darwin grpc gaps from v4.2.1 (#9781 )	2026-05-12 17:22:09 +02:00
bump_deps.sh	feat: do not bundle llama-cpp anymore (#5790 )	2025-07-18 13:24:12 +02:00
bump_docs.sh	fix: github bump_docs.sh regex to drop emoji and other text (#2180 )	2024-04-29 03:55:29 +00:00
bump_vllm_wheel.sh	feat(vllm): expose AsyncEngineArgs via generic engine_args YAML map (#9563 )	2026-04-29 00:49:28 +02:00
check_and_update.py	fix(ci): fixup checksum scanning pipeline (#3631 )	2024-09-23 10:56:10 +02:00
checksum_checker.sh	fix(ci): fixup correct path for check_and_update.py (#2777 )	2024-07-11 23:05:43 +02:00
dependabot.yml	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00
FUNDING.yml	Create FUNDING.yml (#725 )	2023-07-09 13:39:00 +02:00
labeler.yml	chore(ci): update labels	2025-02-13 09:58:19 +01:00
PULL_REQUEST_TEMPLATE.md	feat(vllm): Allow to set quantization (#1094 )	2023-09-22 15:52:38 +02:00
release.yml	feat(p2p): Federation and AI swarms (#2723 )	2024-07-08 22:04:06 +02:00
stale.yml	feat: add PR template and stale configuration (#316 )	2023-05-20 09:10:20 +02:00