mirror of
https://github.com/lobehub/lobehub
synced 2026-04-21 09:37:28 +00:00
* 🔨 chore: update .vscode/settings.json (#13894) * 🐛 fix(builtin-tool-local-system): honor glob scope in local system tool (#13875) Made-with: Cursor * 📝 docs: Update changelog docs and release skills (#13897) - Update changelog documentation format across all historical changelog files - Merge release-changelog-style skill into version-release skill - Update changelog examples with improved formatting and structure Made-with: Cursor --------- Co-authored-by: YuTengjing <ytj2713151713@gmail.com> Co-authored-by: Innei <i@innei.in>
34 lines
1.9 KiB
Text
34 lines
1.9 KiB
Text
---
|
|
title: 'Voice Conversations: Talk Naturally With Your Agents'
|
|
description: LobeHub now supports Text-to-Speech (TTS) and Speech-to-Text (STT), enabling natural voice interactions. Speak with your Agents and hear responses in clear, personalized voices.
|
|
tags:
|
|
- TTS
|
|
- STT
|
|
- Voice Conversations
|
|
- LobeHub
|
|
- Audio Technology
|
|
---
|
|
|
|
# Supporting TTS & STT Voice Conversations
|
|
|
|
LobeHub now supports Text-to-Speech (TTS) and Speech-to-Text (STT), turning typed conversations into natural voice interactions. You can speak with your Agents and hear their responses, making the experience closer to talking with a real person.
|
|
|
|
## Natural voice interaction
|
|
|
|
With TTS, your Agents can read responses aloud in clear, natural-sounding voices. With STT, you can dictate messages instead of typing. Together, they enable hands-free interaction—useful when you're multitasking, on the move, or simply prefer speaking to typing.
|
|
|
|
This is especially helpful for:
|
|
|
|
- Auditory learners who process information better by hearing
|
|
- Users who want to stay productive while commuting or away from a keyboard
|
|
- Anyone who finds voice more accessible or convenient than text
|
|
|
|
## Personalized voice selection
|
|
|
|
Different Agents can have different voices. Choose a voice that matches each Agent's personality or purpose. A professional assistant might use a calm, measured tone. A creative collaborator might sound more expressive.
|
|
|
|
We've curated high-quality voices from OpenAI Audio and Microsoft Edge Speech to serve users across regions and preferences. Select the voice that fits your usage style or scenario.
|
|
|
|
## A complete communication loop
|
|
|
|
Voice support closes the gap between human and AI interaction styles. Speak naturally, hear responses aloud, and maintain context just like you would in a spoken conversation. The rest of LobeHub's features—plugins, multimodal support, context management—work seamlessly alongside voice mode.
|