mirror of https://github.com/mudler/LocalAI synced 2026-05-24 09:28:23 +00:00

History

LocalAI [bot] 4e154b59e5 fix(ci): unbreak rerankers (torch bump) and vllm-omni on aarch64 (#9688 ) Two unrelated CI breakages bundled together since both are one-liners: - rerankers: bump torch 2.4.1 -> 2.7.1 on cpu/cublas12. The unpinned transformers resolves to 5.x, whose moe.py registers a custom_op with string-typed `'torch.Tensor'` annotations that torch 2.4.1's infer_schema rejects, blocking the gRPC server from starting and failing all 5 backend tests with "Connection refused" on :50051. Matches the version used by the transformers backend. - vllm-omni: strip fa3-fwd from the upstream requirements/cuda.txt before resolving on aarch64. fa3-fwd 0.0.3 ships only an x86_64 wheel and has no sdist, making the cuda profile unsatisfiable on Jetson/SBSA. fa3-fwd is a soft runtime dep — vllm-omni's attention backends fall back to FA2 then SDPA when it's missing. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@localai.io>		2026-05-06 17:07:24 +02:00
..
backend.py	feat: add distributed mode (#9124 )	2026-03-30 00:47:27 +02:00
install.sh	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00
Makefile	feat(mlx): add mlx backend (#6049 )	2025-08-22 08:42:29 +02:00
README.md	feat(rerankers): Add new backend, support jina rerankers API (#2121 )	2024-04-25 00:19:02 +02:00
requirements-cpu.txt	fix(ci): unbreak rerankers (torch bump) and vllm-omni on aarch64 (#9688 )	2026-05-06 17:07:24 +02:00
requirements-cublas12.txt	fix(ci): unbreak rerankers (torch bump) and vllm-omni on aarch64 (#9688 )	2026-05-06 17:07:24 +02:00
requirements-cublas13.txt	Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/rerankers in the pip group across 1 directory" (#8412 )	2026-02-05 14:17:33 +01:00
requirements-hipblas.txt	feat(rocm): bump to 7.x (#9323 )	2026-04-12 08:51:30 +02:00
requirements-intel.txt	feat(qwen-tts): add Qwen-tts backend (#8163 )	2026-01-23 15:18:41 +01:00
requirements-mps.txt	Revert "chore(deps): bump torch from 2.4.1 to 2.7.1+xpu in /backend/python/rerankers in the pip group across 1 directory" (#8412 )	2026-02-05 14:17:33 +01:00
requirements.txt	chore(deps): bump grpcio from 1.78.1 to 1.80.0 in /backend/python/rerankers (#9181 )	2026-03-31 10:10:50 +02:00
run.sh	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00
test.py	fix(reranker): support omitting top_n (#7199 )	2025-11-09 18:40:32 +01:00
test.sh	feat: Add backend gallery (#5607 )	2025-06-15 14:56:52 +02:00

README.md

Creating a separate environment for the reranker project

make reranker