ElevenLabs WS Stream
Real-time text-to-speech synthesis using ElevenLabs models via WebSocket. Uses SLNG unified protocol - same client code works across all TTS providers. Supports ElevenLabs-specific features: SSML, word alignment, voice settings, and configure commands for mid-stream updates.
query Parameters
voice_idstring🔄 UNIFIED: Voice identifier. For ElevenLabs: used directly as ElevenLabs voice ID. Compatible with all TTS providers.
model_idstring🔄 UNIFIED: Model identifier. For ElevenLabs: eleven_turbo_v2_5, eleven_monolingual_v1, etc. Maps to provider-specific models.
Default: eleven_turbo_v2_5regionstring · enum🔄 UNIFIED: Compute region for processing
Enum values:us-westus-easteu-westDefault: us-westsample_rate_hzinteger · enum🔄 UNIFIED: Audio sample rate in Hz
Enum values:8000160002400048000Default: 24000codecstring · enum🔄 UNIFIED: Audio codec format
Enum values:pcm16opusDefault: pcm16
Headers
Authorizationstring · requiredThe
Authorizationheader is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE.Upgradestring · enum · requiredMust be 'websocket' for WebSocket upgrade
Enum values:websocketConnectionstring · enum · requiredMust be 'Upgrade' for WebSocket upgrade
Enum values:Upgrade
Responses
Switching Protocols
Deepgram Aura WS Stream
Real-time text-to-speech synthesis using Deepgram Aura models via WebSocket. Features 40+ voices, sub-200ms latency, and optimized LLM integration.
query Parameters
modelstringAura model (aura-asteria-en, aura-luna-en, etc.)
encodingstringAudio encoding (linear16, mp3, opus, etc.)
sample_rateintegerSample rate in Hz
Headers
Authorizationstring · requiredThe
Authorizationheader is used to authenticate with the API using your API key. Value is of the formatBearer YOUR_KEY_HERE.
Responses
Switching Protocols