What Is a Voice Profile?
A voice profile bundles three things into a single reusable configuration:- Speech-to-Text (STT) — transcribes the caller’s audio into text
- Text-to-Speech (TTS) — converts agent text into natural-sounding audio
- Voice — the specific voice identity (gender, accent, character) used for TTS output
Listing Voice Profiles via SDK
VoiceProfile fields
| Field | Type | Description |
|---|---|---|
profile_id | str | Profile ID to pass when creating a Speech Pipe |
name | str | Human-readable name, e.g. "Emma (HD)" |
gender | str | male, female, or neutral |
quality | str | standard or high |
languages | list[str] | Supported language codes, e.g. ["en", "hi"] |
description | str | None | Short description of the voice character |
latency_ms | int | None | Expected end-to-end latency in milliseconds |
greeting_message | str | None | Default greeting for this profile |
Getting a Single Profile
Choosing the Right Profile
By use case
| Use case | Recommended qualities |
|---|---|
| Customer support | High quality, natural voice, low latency |
| Sales outreach | Expressive, high quality, brand-matched |
| Appointment booking | Standard quality — lower latency |
| Multilingual | Match language codes to caller’s region |
By latency
Total call latency has three components: STT transcription + LLM thinking + TTS synthesis.- Choose
quality == "high"when voice naturalness matters more than speed - Choose
quality == "standard"for latency-sensitive or high-volume deployments - Check
latency_mson the profile object for the estimated end-to-end figure
Using a Profile When Creating a Speech Pipe
Passprofile_id directly when creating or updating a Speech Pipe:
Switching a Profile on an Existing Speech Pipe
Next Steps
Speech Pipe
Attach a voice profile when creating or updating a Speech Pipe.
SDK Setup
Install the SDK and configure your runner process.