Synthesize speech using Sarvam AI Bulbul with high-quality multilingual TTS for Indian languages and 30+ speaker voices.
API key issued by SLNG. Pass as Authorization: Bearer <token>.
Target world part override. Auto-selected if not provided.
ap Sarvam AI Bulbul TTS request.
Text to synthesize. Supports code-mixed text (English and Indic languages).
1 - 2500Language code in BCP-47 format for text normalization.
bn-IN, en-IN, gu-IN, hi-IN, kn-IN, ml-IN, mr-IN, od-IN, pa-IN, ta-IN, te-IN Speaker voice for the output audio.
shubh, aditya, ritu, priya, neha, rahul, pooja, rohan, simran, kavya, amit, dev, ishita, shreya, ratan, varun, manan, sumit, roopa, kabir, aayan, ashutosh, advait, amelia, sophia, anand, tanya, tarun, sunny, mani, gokul, vijay, shruti, suhani, mohit, kavitha, rehan, soham, rupali Sarvam TTS model identifier.
bulbul:v3 Speech speed (0.5 to 2.0). Default is 1.0.
0.5 <= x <= 2Controls expressiveness (0.01 to 2.0). Default is 0.6.
0.01 <= x <= 2Output sample rate in Hz.
8000, 16000, 22050, 24000, 32000, 44100, 48000 Output audio codec.
mp3, linear16, mulaw, alaw, opus, flac, aac, wav Synthesis successful.
Sarvam AI TTS response format.
Array of base64-encoded audio strings.