LocalAI/backend/cpp
Ettore Di Giacinto 739573e41b
feat(flash_attention): set auto for flash_attention in llama.cpp (#6168)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2025-08-31 17:59:09 +02:00
..
grpc fix: speedup git submodule update with --single-branch (#2847) 2024-07-13 22:32:25 +02:00
llama-cpp feat(flash_attention): set auto for flash_attention in llama.cpp (#6168) 2025-08-31 17:59:09 +02:00