mirror of
https://github.com/mudler/LocalAI
synced 2026-05-05 14:28:45 +00:00
* Streaming working * Small fix for regression on CUDA and XPU * use pip version of optimum[openvino] * Update backend/python/transformers/transformers_server.py Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * Token streaming support fix optimum[openvino] package in install.sh * Token Streaming support --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| install.sh | ||
| Makefile | ||
| transformers-nvidia.yml | ||
| transformers-rocm.yml | ||
| transformers.yml | ||