Speech-to-Text. Transcribe audio files using Whisper Large v3 model hosted by SLNG. Supports 99+ languages with automatic language detection. Best for general-purpose transcription with high accuracy. Accepts multipart/form-data with audio files up to 25MB. Supported formats: mp3, mp4, mpeg, mpga, m4a, wav, webm. Language: Optional ISO-639-1 language code (e.g., 'en', 'es', 'fr', 'de'). If not specified, the model will auto-detect the language. Providing the language can improve accuracy and reduce processing time. WebSocket: Real-time speech-to-text transcription using OpenAI's Whisper Large v3 model via WebSocket. Supports streaming audio input with intelligent Voice Activity Detection (VAD), partial transcripts for immediate feedback, and automatic language detection. Perfect for live transcription, voice commands, and real-time captioning.
openai/whisper:large-v3 - websocket
Speech-to-Text API for converting audio files to text using SLNG openai/whisper. Speech-to-Text. Transcribe audio files using Whisper Large v3 model hosted by SLNG. Supports 99+ languages with automatic language detection. Best for general-purpose transcription with high accuracy. Accepts multipart/form-data with audio files up to 25MB. Supported formats: mp3, mp4, mpeg, mpga, m4a, wav, webm. Language: Optional ISO-639-1 language code (e.g., 'en', 'es', 'fr', 'de'). If not specified, the model will auto-detect the language. Providing the language can improve accuracy and reduce processing time. WebSocket: Real-time speech-to-text transcription using OpenAI's Whisper Large v3 model via WebSocket. Supports streaming audio input with intelligent Voice Activity Detection (VAD), partial transcripts for immediate feedback, and automatic language detection. Perfect for live transcription, voice commands, and real-time captioning.
Headers
AuthorizationThe Authorization header is used to authenticate with the API using your API key. Value is of the format Bearer YOUR_KEY_HERE.
UpgradeConnectionopenai/whisper:large-v3 - websocket › Responses
Switching Protocols
openai/whisper:large-v3 - http
Speech-to-Text API for converting audio files to text using SLNG openai/whisper. Speech-to-Text. Transcribe audio files using Whisper Large v3 model hosted by SLNG. Supports 99+ languages with automatic language detection. Best for general-purpose transcription with high accuracy. Accepts multipart/form-data with audio files up to 25MB. Supported formats: mp3, mp4, mpeg, mpga, m4a, wav, webm. Language: Optional ISO-639-1 language code (e.g., 'en', 'es', 'fr', 'de'). If not specified, the model will auto-detect the language. Providing the language can improve accuracy and reduce processing time. WebSocket: Real-time speech-to-text transcription using OpenAI's Whisper Large v3 model via WebSocket. Supports streaming audio input with intelligent Voice Activity Detection (VAD), partial transcripts for immediate feedback, and automatic language detection. Perfect for live transcription, voice commands, and real-time captioning.
Headers
AuthorizationThe Authorization header is used to authenticate with the API using your API key. Value is of the format Bearer YOUR_KEY_HERE.
openai/whisper:large-v3 - http › Request Body
audioAudio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a, wav, webm. Maximum file size: 25MB.
languageOptional: ISO-639-1 language code (e.g., 'en', 'es', 'fr', 'de'). If not specified, the model will auto-detect the language. Providing the language can improve accuracy and reduce processing time.
openai/whisper:large-v3 - http › Responses
Successful transcription
textThe transcribed text from the audio file
languageDetected or specified language code