Skip to main content
For the full, current list, see Speech-to-Text Models.

Which approach should you use?

I need to…Use
Transcribe pre-recorded audio filesHTTP
Transcribe in real time from a microphone or callWebSocket Streaming
Process large volumes of recordings asynchronouslyBatch Transcription

Which model should you use?

I need…Recommended modelWhy
Lowest latency for English voice agentsDeepgram Nova 3 (SLNG-hosted)Deployed on SLNG infrastructure
Hindi transcriptionDeepgram Nova 3 Hindi (SLNG-hosted)Dedicated Hindi model in AP South
Broadest language coverageSonioxWide multi-language support
Medical terminologyDeepgram Nova 3 MedicalSpecialized medical vocabulary
Indian languagesSarvam AI SaarasDomain-aware recognition for Indian languages
European languagesReson8 STTDutch, French, German, Spanish, and more

Quick example

curl https://api.slng.ai/v1/stt/slng/deepgram/nova:3 \
  -H "Authorization: Bearer SLNG_API_KEY" \
  -F "audio=@recording.wav" \
  -F "language=en"
Response:
{
  "text": "Hello, how can I help you today?",
  "confidence": 0.97,
  "duration": 2.5,
  "language": "en"
}
See HTTP for complete examples in cURL, JavaScript, and Python, and WebSocket Streaming for real-time transcription.