mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
feat(llama.cpp): add flash_attn and no_kv_offload Signed-off-by: Ettore Di Giacinto <mudler@localai.io> |
||
|---|---|---|
| .. | ||
| grpc | ||
| llama | ||
feat(llama.cpp): add flash_attn and no_kv_offload Signed-off-by: Ettore Di Giacinto <mudler@localai.io> |
||
|---|---|---|
| .. | ||
| grpc | ||
| llama | ||