Basic File Transcription
Upload an audio file (MP3, WAV, FLAC, OGG, M4A, or WebM) and get back the transcribed text. Here is a sample file you can use:Going Further
You can pass additional form fields to customize the transcription:- Language — If you know the language, pass
language=en(ores,fr, etc.). Not all models auto-detect, so setting this explicitly can improve accuracy. - Diarization — Pass
diarize=trueto identify different speakers in a multi-speaker recording. The response will include aspeakerfield on each word. Available on Deepgram Nova. - Punctuation — Pass
punctuate=trueto add punctuation to the transcript automatically.
Next Steps
- STT WebSocket examples — real-time transcription as users speak
- Speech-to-Text API reference — endpoint-specific parameters