Unified STT
Transcribe audio via SLNG’s native WebSocket protocol bridge. The model_variant path parameter specifies the target STT model (e.g., deepgram/nova:3, slng/openai/whisper:large-v3).
Documentation Index
Fetch the complete documentation index at: https://docs.slng.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
API key issued by SLNG. Pass as Authorization: Bearer <token>.
Path Parameters
Target STT model. Must support WebSocket streaming with final_transcript.
deepgram/nova:2, deepgram/nova:3, deepgram/nova:3-medical, reson8/reson8stt:v1, sarvam/saaras:v3, slng/deepgram/nova:3-en, slng/deepgram/nova:3-es, slng/deepgram/nova:3-hi, slng/deepgram/nova:3-kn, slng/deepgram/nova:3-mr, slng/deepgram/nova:3-multi, slng/deepgram/nova:3-ta, slng/deepgram/nova:3-te, slng/openai/whisper:large-v3, soniox/speech-ai:rt-v4 Query Parameters
Target region override. Auto-selected if not provided. Equivalent to the X-Region-Override header; the header takes precedence when both are set.
ap-southeast-2, eu-north-1, us-east-1 Target world part override. Auto-selected if not provided. Equivalent to the X-World-Part-Override header; the header takes precedence when both are set.
ap, eu, na Body
Unmute STT request. Provide either audio (multipart upload) or url (publicly accessible HTTPS URL).
Audio file (multipart) or base64-encoded audio (JSON).
Publicly accessible audio URL.
"https://docs.slng.ai/audio/hello.wav"
Language code for recognition.
Audio sample rate in Hz.
8000, 16000, 22050, 24000, 32000, 48000 Audio encoding format.
linear16, mp3, opus Enable voice activity detection.
Enable partial transcription results.
Response
Transcription successful.