Skip to main content
POST
/
v1
/
tts
/
sarvam
/
bulbul:v3
curl --request POST \
  --url https://api.slng.ai/v1/tts/sarvam/bulbul:v3 \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "text": "Hello, how are you today?",
  "target_language_code": "en-IN",
  "speaker": "shubh"
}
'
{
  "audios": [
    "UklGRgAA..."
  ]
}

Authorizations

Authorization
string
header
required

API key issued by SLNG. Pass as Authorization: Bearer <token>.

Headers

X-World-Part-Override
enum<string>

Target world part override. Auto-selected if not provided.

Available options:
ap

Body

application/json

Sarvam AI Bulbul TTS request.

text
string
required

Text to synthesize. Supports code-mixed text (English and Indic languages).

Required string length: 1 - 2500
target_language_code
enum<string>
default:en-IN
required

Language code in BCP-47 format for text normalization.

Available options:
bn-IN,
en-IN,
gu-IN,
hi-IN,
kn-IN,
ml-IN,
mr-IN,
od-IN,
pa-IN,
ta-IN,
te-IN
speaker
enum<string>
default:shubh

Speaker voice for the output audio.

Available options:
shubh,
aditya,
ritu,
priya,
neha,
rahul,
pooja,
rohan,
simran,
kavya,
amit,
dev,
ishita,
shreya,
ratan,
varun,
manan,
sumit,
roopa,
kabir,
aayan,
ashutosh,
advait,
amelia,
sophia,
anand,
tanya,
tarun,
sunny,
mani,
gokul,
vijay,
shruti,
suhani,
mohit,
kavitha,
rehan,
soham,
rupali
model
enum<string>
default:bulbul:v3

Sarvam TTS model identifier.

Available options:
bulbul:v3
pace
number
default:1

Speech speed (0.5 to 2.0). Default is 1.0.

Required range: 0.5 <= x <= 2
temperature
number
default:0.6

Controls expressiveness (0.01 to 2.0). Default is 0.6.

Required range: 0.01 <= x <= 2
speech_sample_rate
enum<string>
default:24000

Output sample rate in Hz.

Available options:
8000,
16000,
22050,
24000,
32000,
44100,
48000
output_audio_codec
enum<string>

Output audio codec.

Available options:
mp3,
linear16,
mulaw,
alaw,
opus,
flac,
aac,
wav

Response

Synthesis successful.

Sarvam AI TTS response format.

audios
string[]

Array of base64-encoded audio strings.