LocalAI/pkg
Sertaç Özercan 45d18813bd
fix: gate CUDA directory checks on GPU vendor to prevent false CUDA detection (#8942)
Container images that install CUDA runtime libraries (e.g., cuda-cudart-12-5
via apt) create /usr/local/cuda-12 directories as a side effect. The previous
code checked for these directories before checking whether a GPU was present,
causing CPU-only hosts to select a CUDA backend that crashes because
libcuda.so.1 is absent.

Reorder checks so CUDA directory existence only refines the capability when
an NVIDIA GPU is actually detected, consistent with the arm64 L4T code path.

Signed-off-by: Sertac Ozercan <sozercan@gmail.com>
2026-03-12 07:53:39 +01:00
..
audio feat(audio): set audio content type (#8416) 2026-02-05 19:14:12 +01:00
concurrency chore: update jobresult_test.go (#4124) 2024-11-12 08:52:18 +01:00
downloader chore(size): display size of HF models and allow to specify it from the gallery (#8907) 2026-03-09 17:38:14 +01:00
format feat(api): Add transcribe response format request parameter & adjust STT backends (#8318) 2026-02-01 17:33:17 +01:00
functions feat(functions): add peg-based parsing and allow backends to return tool calls directly (#8838) 2026-03-08 22:21:57 +01:00
grpc feat: Add Free RPC to backend.proto for VRAM cleanup (#8751) 2026-03-03 12:39:06 +01:00
huggingface-api feat(hf-api): return files in nested directories (#7396) 2025-11-30 09:06:54 +01:00
langchain feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants (#2232) 2024-05-04 17:56:12 +02:00
model feat(ui): MCP Apps, mcp streaming and client-side support (#8947) 2026-03-11 07:30:49 +01:00
oci feat(ui): allow to cancel ops (#7264) 2025-11-13 18:41:47 +01:00
reasoning feat(openresponses): Support reasoning blocks (#8133) 2026-01-21 00:11:45 +01:00
signals chore: update cogito and simplify MCP logics (#6413) 2025-10-09 12:36:45 +02:00
sound feat: Realtime API support reboot (#5392) 2025-05-25 22:25:05 +02:00
store chore: fix go.mod module (#2635) 2024-06-23 08:24:36 +00:00
system fix: gate CUDA directory checks on GPU vendor to prevent false CUDA detection (#8942) 2026-03-12 07:53:39 +01:00
utils Add sample_rate support to TTS API via post-processing resampling (#8650) 2026-02-25 16:36:27 +01:00
vram chore(size): display size of HF models and allow to specify it from the gallery (#8907) 2026-03-09 17:38:14 +01:00
xio feat(ui): allow to cancel ops (#7264) 2025-11-13 18:41:47 +01:00
xsync chore: fix go.mod module (#2635) 2024-06-23 08:24:36 +00:00
xsysinfo fix: drop gguf VRAM estimation (now redundant) (#8325) 2026-02-01 17:33:28 +01:00