mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
* Use pb.Reply instead of []byte with Reply.GetMessage() in llama grpc to get the proper usage data in reply streaming mode at the last [DONE] frame * Fix 'hang' on empty message from the start Seems like that empty message marker trick was unnecessary --------- Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| backend_suite_test.go | ||
| embeddings.go | ||
| image.go | ||
| llm.go | ||
| llm_test.go | ||
| options.go | ||
| rerank.go | ||
| soundgeneration.go | ||
| stores.go | ||
| token_metrics.go | ||
| tokenize.go | ||
| transcript.go | ||
| tts.go | ||