Real-time conversational TTS designed for voice agents with ultra-low latency
deepgram/aura:2-es - websocket
Text-to-Speech API for generating speech from text using SLNG deepgram/aura. Real-time conversational TTS designed for voice agents with ultra-low latency
WebSocket Endpoint
Establishes a WebSocket connection for real-time text-to-speech.
Connection URL: wss://api.slng.ai/v1/tts/slng/deepgram/aura:2-es
Headers
AuthorizationThe Authorization header is used to authenticate with the API using your API key. Value is of the format Bearer YOUR_KEY_HERE.
UpgradeConnectionX-Region-OverrideOptional. Specify a target region for this model. If not provided, the system will automatically select an appropriate region.
deepgram/aura:2-es - websocket › Request Body
Decision Table
| Variant | Matching Criteria |
|---|---|
| type = object · requires: type, model | |
| type = object · requires: type, text | |
| type = object · requires: type | |
| type = object · requires: type | |
| type = object · requires: type |
typemodelModel identifier (e.g., aura:2)
voiceVoice model to use (e.g., aura-2-thalia-en)
Synthesis configuration options
deepgram/aura:2-es - websocket › Responses
Switching Protocols
Decision Table
| Variant | Matching Criteria |
|---|---|
| type = object · requires: type, session_id | |
| type = object · requires: type, data | |
| type = object · requires: type, segment_id | |
| type = object · requires: type, segment_id | |
| type = object · requires: type | |
| type = object · requires: type | |
| type = object · requires: type | |
| type = object · requires: type, code, message |
typesession_idUnique session identifier
deepgram/aura:2-es - http
Text-to-Speech API for generating speech from text using SLNG deepgram/aura. Real-time conversational TTS designed for voice agents with ultra-low latency
Headers
AuthorizationThe Authorization header is used to authenticate with the API using your API key. Value is of the format Bearer YOUR_KEY_HERE.
X-Region-OverrideOptional. Specify a target region for this model. If not provided, the system will automatically select an appropriate region.
deepgram/aura:2-es - http › Request Body
textThe text content to be converted to speech
modelAllowed model names for deepgram/aura:2-es.
languageAllowed language codes for deepgram/aura:2-es.
encodingAudio encoding format
containerAudio container format
sample_rateSample rate in Hz
bit_rateBit rate for compressed formats. For mp3: 32000 or 48000 (default). For opus: 4000-650000. For aac: 4000-192000.
callbackURL to which we'll make the callback request
callback_methodHTTP method by which the callback request will be made
Label your requests for the purpose of identification during usage reporting
Arbitrary key-value pairs attached to the API response for downstream processing
deepgram/aura:2-es - http › Responses
Successful synthesis
Audio file content