ElevenLabs WS Stream
Real-time text-to-speech synthesis using ElevenLabs models via WebSocket. Uses SLNG unified protocol - same client code works across all TTS providers. Supports ElevenLabs-specific features: SSML, word alignment, voice settings, and configure commands for mid-stream updates.
query Parameters
voice_id
string🔄 UNIFIED: Voice identifier. For ElevenLabs: used directly as ElevenLabs voice ID. Compatible with all TTS providers.
model_id
string🔄 UNIFIED: Model identifier. For ElevenLabs: eleven_turbo_v2_5, eleven_monolingual_v1, etc. Maps to provider-specific models.
Default: eleven_turbo_v2_5region
string · enum🔄 UNIFIED: Compute region for processing
Enum values:us-westus-easteu-westDefault: us-westsample_rate_hz
integer · enum🔄 UNIFIED: Audio sample rate in Hz
Enum values:8000160002400048000Default: 24000codec
string · enum🔄 UNIFIED: Audio codec format
Enum values:pcm16opusDefault: pcm16
Headers
Authorization
string · requiredThe
Authorization
header is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE
.Upgrade
string · enum · requiredMust be 'websocket' for WebSocket upgrade
Enum values:websocketConnection
string · enum · requiredMust be 'Upgrade' for WebSocket upgrade
Enum values:Upgrade
Responses
Switching Protocols
Deepgram Aura WS Stream
Real-time text-to-speech synthesis using Deepgram Aura models via WebSocket. Features 40+ voices, sub-200ms latency, and optimized LLM integration.
query Parameters
model
stringAura model (aura-asteria-en, aura-luna-en, etc.)
encoding
stringAudio encoding (linear16, mp3, opus, etc.)
sample_rate
integerSample rate in Hz
Headers
Authorization
string · requiredThe
Authorization
header is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE
.
Responses
Switching Protocols
Cartesia Sonic WS Stream
Real-time text-to-speech synthesis using Cartesia Sonic models via WebSocket. Ultra-low latency with word timestamps and emotion control.
query Parameters
model
stringSonic model (sonic-english, sonic-multilingual)
voice_id
stringCartesia voice ID
language
stringLanguage code for multilingual model
Headers
Authorization
string · requiredThe
Authorization
header is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE
.
Responses
Switching Protocols