mirror of
https://github.com/mudler/LocalAI
synced 2026-04-21 21:37:21 +00:00
* fixes #1775 and #1774 Add BitsAndBytes Quantization and fixes embedding on CUDA devices * Manage 4bit and 8 bit quantization Manage different BitsAndBytes options with the quantization: parameter in yaml * fix compilation errors on non CUDA environment |
||
|---|---|---|
| .. | ||
| install.sh | ||
| Makefile | ||
| transformers-nvidia.yml | ||
| transformers-rocm.yml | ||
| transformers.yml | ||