mirror of
https://github.com/mudler/LocalAI
synced 2026-05-24 09:28:23 +00:00
whisper.cpp can emit bytes that are not valid UTF-8 — typically a multibyte codepoint split across token boundaries. protobuf string fields reject those at marshal time, which would surface as a transcribe failure. Run strings.ToValidUTF8 on the segment text before it leaves the cgo boundary so the bad byte gets replaced with U+FFFD. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Assisted-by: Claude:claude-opus-4-7 [Claude Code] |
||
|---|---|---|
| .. | ||
| acestep-cpp | ||
| llm/llama | ||
| local-store | ||
| opus | ||
| piper | ||
| qwen3-tts-cpp | ||
| sam3-cpp | ||
| sherpa-onnx | ||
| silero-vad | ||
| stablediffusion-ggml | ||
| voxtral | ||
| whisper | ||