SLNG changelog - SLNG Documentation

June 16, 2026

FeatureUpdate

API v0.53.0

Gradium STT and TTS

Gradium STT streams real-time multilingual transcription over WebSocket, with Flush and EndOfStream controls for finalizing buffered audio. Gradium TTS ships alongside it for streaming WebSocket synthesis and one-shot HTTP synthesis.

Kugel 2 Turbo TTS

Kugel 2 Turbo is the new low-latency tier in the Kugel TTS family. Stream high-quality synthesis with expressiveness controls over the unified TTS WebSocket protocol, with word timestamps and per-chunk completion events.

Rime Arcana v3 French

Rime Arcana v3 French joins the Arcana v3 lineup. Synthesize French speech via WebSocket or HTTP using speakers like livet_aurelie.

ElevenLabs WebSocket controls

The ElevenLabs TTS WebSocket endpoints, including Flash v2.5, v3, and Multilingual v2, accept new init parameters: auto_mode to cut latency by skipping chunk scheduling, sync_alignment to attach alignment data to each audio chunk, enable_ssml_parsing for inline SSML, inactivity_timeout (1–180 s), and chunk_length_schedule. Set enable_logging: false for zero-retention mode on enterprise plans on both WebSocket and HTTP endpoints.

June 12, 2026

FeatureUpdate

API v0.52.3

Execution Layer documentation

The docs portal is reframed around the Execution Layer, the three-stage path that sits between your orchestrator and the models. Read how STT Routing, Tiered Decisioning, and Output Assembly cut latency and model cost across a voice call. The new How It Works page walks through a single turn end to end.The top bar collapses to Documentation, Execution Layer, API Reference, and Models. Every endpoint now lives under one API Reference, grouped by Unified API, TTS, STT, Voice Agents, Batch, and Bridges, so you can find a model without switching tabs.

Consolidated authentication and BYOK pages

API key creation, rotation, and request authentication are merged into a single Authentication & API Keys page. Bring-your-own-key setup for every supported provider now lives on one BYOK page covering caching behavior and provider key handling.

June 11, 2026

Feature

Pipecat plugin

The pipecat-slng Python package connects a Pipecat pipeline to any STT or TTS model on the SLNG gateway. It ships SlngSTTService and SlngTTSService for low-latency streaming over WebSocket, plus SlngHttpTTSService for non-streaming synthesis. Swap the model string to switch provider or region without changing your pipeline code.

June 10, 2026

Update

API v0.52.2

New Sarvam Bulbul and Soniox voices

Sarvam Bulbul v3 adds the Tamil voice ta-IN-rahul and the multilingual shubh, while gu-IN-rahul is retired. Soniox TTS v1 adds four English voices: Maya, Daniel, Noah, and Nina.

Updated Cartesia Sonic 3 voice metadata

The 745-voice Cartesia Sonic 3 catalog has refreshed metadata. Voice IDs are unchanged, so existing requests keep working.

June 3, 2026

Fix

API v0.52.1

Patch-level updates to the unified gateway specification clean up schemas across the TTS, STT, and bridge surfaces. Existing requests and responses keep working without changes.

June 2, 2026

FeatureUpdate

API v0.52.0

Inworld Max 1.5 TTS

Inworld Max 1.5 is available on the SLNG-hosted TTS API in us-east-1. Synthesize up to 2,000 characters per request across 16 languages, pick from 100+ named voices, and tune output with temperature, language (BCP-47), and word- or character-level timestamps.

Asia South region for Nova 3 English and Rime Arcana v3

Deepgram Nova 3 English and Rime Arcana v3 English can now route to asia-south1. Set X-Region-Override: asia-south1 to keep traffic in-region for users in India.

Region rename for Nova 3 Indian languages

The Hindi, Marathi, Kannada, Tamil, and Telugu variants of Deepgram Nova 3 now expose a single asia-south1 region. The legacy ap-south-1 value is no longer accepted on X-Region-Override; switch any pinned requests to asia-south1.

May 29, 2026

Update

API v0.51.2

Supported LLMs for Voice Agents

The llm field on Voice Agents is now a closed set: bedrock-mantle/nvidia.nemotron-super-3-120b, bedrock-mantle/nvidia.nemotron-nano-3-30b, and groq/openai/gpt-oss-120b. Pick one of these IDs when you create or update an agent so requests route to a live model.

Moonshot Kimi K2 removed from Voice Agents

groq/moonshotai/kimi-k2-instruct-0905 is no longer routable for Voice Agents. Switch existing agents and example payloads to groq/openai/gpt-oss-120b (or another model from the supported list) to keep calls working.

May 28, 2026

FeatureUpdate

API v0.51.1

Segment correlation for Cartesia Sonic 3

Cartesia Sonic 3 audio chunks now carry a flush_id. The counter starts at 0 and increments each time you send a text input with flush: true or a standalone flush message. Use it to map streamed audio frames back to the text segment that produced them.

Pronunciation hints on Cognigy and Jambonz TTS bridges

The Cognigy TTS bridge and Jambonz TTS bridge accept an optional pronunciation object on text messages. Set mode (ipa or ssml), name, and dictionary_id to override how a term is spoken per chunk on providers that support pronunciation dictionaries.

Binary audio frames on the Unmute TTS bridge

The Unmute TTS bridge now sends raw binary WebSocket frames instead of JSON audio_chunk, segment_start, and segment_end envelopes. The encoding and sample rate match the values you pass in the init message, so you can write the bytes straight to your audio pipeline. The clear control message has been removed; error and audio_end payloads now carry their fields under a data envelope (with message and code duplicated at the top level for backward compatibility).

May 26, 2026

Update

API v0.50.1

BYOK support for Kugel TTS

You can now bring your own Kugel API key when calling Kugel 1 Turbo, Kugel 1, and Kugel 2. Pass it on the X-Slng-Provider-Key header so Kugel bills your account directly, while the SLNG gateway’s TTS cache still serves repeat requests. See Bring your own key for the full provider list.

May 23, 2026

Feature

API v0.50.0

HTTP streaming for Sarvam Bulbul v3

Sarvam AI Bulbul Stream v3 is now available as a chunked HTTP endpoint. The response is raw audio bytes in the codec you set with output_audio_codec (there is no JSON envelope and no X-Duration header), so you can start playback as soon as the first chunk arrives. Use it to reach the same 30+ Indian-language voices as bulbul:v3 with lower time-to-first-byte.

May 22, 2026

Feature

API v0.49.1

Streaming transcription for Sarvam Saaras v3

Sarvam AI Saaras v3 is now available over WebSocket for real-time transcription across 23 Indian languages. Configure each session with query parameters on the upgrade URL (language-code, mode, sample_rate, input_audio_codec, high_vad_sensitivity, and vad_signals) to tune voice activity detection and audio handling per stream.

May 21, 2026

FeatureUpdate

API v0.48.1

HTTP transcription for Nova 3 Indic languages

Deepgram Nova 3 now exposes HTTP endpoints for Kannada, Marathi, Tamil, and Telugu. You can transcribe these languages with a single request (binary upload or url field) instead of opening a WebSocket.

Soniox TTS language coverage

Soniox TTS now ships voices for 60+ languages, including Afrikaans, Bengali, Bulgarian, Catalan, Czech, Dutch, Greek, Gujarati, Hebrew, Indonesian, Kannada, Malay, Marathi, Norwegian, Tamil, Telugu, and more. Set the language field to any supported ISO 639-1 code on the Soniox TTS HTTP endpoint to pick a voice catalog.

Runtime variables for voice agents

Voice agents can now declare runtime_variables so the model can capture values during a call and reuse them later in webhook URLs or system tool arguments. The built-in set_runtime_variables tool writes the values, which persist for the lifetime of the call or web session. See Voice agents for the setup pattern.

New agent regions and world parts

The region override catalog now includes the eu-non-eu and me world parts, plus the asia-south1, asia-southeast2, and australia-southeast1 regions. Pin requests to Sydney, Jakarta, or a non-EU European region with X-Region-Override, or stay inside the Middle East with X-World-Part-Override: me.

Voice agents no longer need a separate API key

Voice agent create and duplicate requests no longer accept slng_api_key. Agents now use the API key you authenticate the request with. Drop the field from your payloads. Agent duplication also stops copying the inbound connection and call history; reconnect inbound routing on the copy if you need it.

Deepgram Aura 2 region availability

SLNG-hosted Deepgram Aura 2 English is no longer available in us-east-1. Route English TTS to eu-north-1. Aura 2 Spanish drops the na world part and now serves only eu.

May 20, 2026

FeatureUpdate

API v0.46.0

Pronunciation dictionaries for TTS

You can now create reusable pronunciation dictionaries and attach them to any SLNG TTS request. Define rewrite rules once for brand names, acronyms, or domain terms, then reference the dictionary from HTTP, WebSocket, or Unified TTS calls. Manage dictionaries through the new pronunciation dictionary endpoints.

Full voice catalogs on provider pages

Provider voice pages now list every voice in the catalog instead of capping the table at ten per language. The Cartesia Sonic 3 page shows all 745 voices, Sarvam Bulbul shows 405, Deepgram Aura shows 91, Soniox shows 120, Murf shows 111, and Kugel shows 100. You can search and copy any voice ID directly from the provider page.

May 19, 2026

UpdateFix

API v0.44.2

Batch API reference now matches the gateway

The Batch API reference and Batch API guide are realigned with the live api.batch.slng.ai gateway. Request and response schemas, supported audio formats, and the three submission flows (direct upload, URL input, and presigned S3 upload) reflect what the service actually accepts.

May 16, 2026

FeatureUpdate

API v0.44.1

URL-based audio for HTTP transcription

The Whisper Large v3, Cognigy STT bridge, Jambonz STT bridge, and Unmute STT bridge HTTP endpoints now accept a url field pointing to a publicly accessible audio file. Send a JSON body with url and language instead of a multipart upload when your audio is already hosted somewhere.

Deepgram Nova 3 English region change

SLNG-hosted Deepgram Nova 3 English is no longer available in eu-north-1. Route English transcription to australia-southeast1 or us-east-1 instead.

May 15, 2026

Feature

API v0.43.3

voiceai CLI

The new voiceai CLI runs text-to-speech and speech-to-text from a terminal. Install it with curl, Homebrew, or npm, then pipe audio between SLNG models and other tools without writing an HTTP client.

JavaScript and Python SDKs

The typed JavaScript SDK (voiceai-sdk on npm) supports Node, Bun, and Deno. The Python SDK (voiceai-sdk on PyPI) ships sync and async clients for Python 3.9+. Both wrap the full STT and TTS surface so you can drop the raw fetch and WebSocket plumbing.

Agent skills for coding agents

The slng-ai/skills pack teaches Claude Code and similar coding agents to call SLNG directly. Point your agent at the skills repo and it can pick models, build init messages, and stream audio for you.

LiveKit Agents plugin

The livekit-plugins-slng Python package connects LiveKit Agents to any STT or TTS model on the SLNG gateway with a single configuration switch. You can swap providers or regions without changing your agent code.

Embed a voice agent on the web

A new browser embed guide walks through adding a SLNG voice session to any web page. It uses LiveKit, a React frontend, and a backend proxy that keeps your API key off the client.

May 14, 2026

FeatureUpdate

API v0.43.2

Rime Arcana v3 Spanish

Rime Arcana v3 (Spanish) is available as a new TTS endpoint, with both streaming WebSocket and one-shot HTTP synthesis. Choose from ten Spanish voices (aurelio, celestino, lark, luz, mar, nova, pola, seraphina, sirius, and ursa) and pass model: rime/arcana:3-es on init.

Rime Arcana v3 adds eu-north-1

Arcana v3 English, Hindi, and Spanish are now available in eu-north-1. You can route Arcana v3 synthesis to North Europe for lower latency in that region.

Deepgram Nova 3 adds asia-south1

Deepgram Nova 3 Tamil, Telugu, Marathi, and Kannada are now available in asia-south1, in addition to ap-south-1. You can route South Asian language transcription to the Mumbai GCP region.

Deepgram Nova 3 Spanish region change

Deepgram Nova 3 Spanish is no longer available in ap-southeast-2. Use australia-southeast1 or us-east-1 instead.

Soniox TTS region override removed

The x-region header is no longer accepted on Soniox TTS v1 requests. Soniox TTS runs only in na, so requests are routed there automatically.

May 13, 2026

UpdateFix

API v0.42.2

Expanded Murf Falcon voice catalog

The Murf Falcon voice catalog now lists 133 voices across more than 20 locales, including new entries for Bengali, Tamil, Telugu, Gujarati, Kannada, Punjabi, Japanese, Korean, Portuguese, Dutch, Polish, Greek, Croatian, and Scottish English. You can browse voices by full locale code and copy the voice_id for use in your init message.

Simplified Unmute TTS bridge requests

The Unmute TTS bridge no longer requires a model field on HTTP or WebSocket init messages. The model is now inferred from the {model_variant} path. Send only voice and text for HTTP requests, or voice plus optional config on the WebSocket init.

May 7, 2026

Feature

API v0.42.1

Rime Coda Indonesian TTS

Rime Coda is available as a new TTS model in the asia-southeast2 (Jakarta) region. It synthesizes Bahasa Indonesian with low latency across four voices (pujianti_plesmita, siswoko_sigit, taryadi_dani, and usmany_tatianna) and supports streaming WebSocket and one-shot HTTP synthesis.

Region and world-part query parameters on bridges

The Cognigy, Jambonz, and Unmute HTTP bridges now accept ?region= and ?world-part= query parameters, mirroring the X-Region-Override and X-World-Part-Override headers. Use the query form when your platform cannot set custom headers; if both are present, the header wins. See Integrations overview for details.

May 6, 2026

FeatureUpdate

API v0.41.2

Cartesia Sonic 3, Murf Falcon, Kugel, Soniox, Reson8, and Sarvam in the Unified API

The Unified API now routes to Cartesia Sonic 3, Murf Falcon, KugelAudio Kugel 1/1-Turbo/2, Soniox TTS v1, Soniox Speech AI v4, Reson8 STT v1, and Sarvam Saaras v3. You can swap between these providers using a single request shape: pass identifiers like cartesia/sonic:3, murf/murftts:falcon, kugelaudio/kugel:2, soniox/tts-rt:v1, soniox/speech-ai:rt-v4, or reson8/reson8stt:v1.

Webhook tools support custom HTTP methods and raw payloads

Voice agent webhook tools now accept http_method (POST, PUT, PATCH, or DELETE) and webhook_format (envelope or raw). Set webhook_format to raw to send only the tool arguments when the receiving service cannot parse the SLNG envelope.

Tool execution tracking for voice agents

You can record webhook, template, human-transfer, and built-in tool activity against a call by posting to the tool executions endpoint. Each record carries the outcome, duration, and HTTP status, and submitted executions surface on the call detail response for debugging or analytics.

ElevenLabs Flash v2.5 adds Asia Pacific region

ElevenLabs Flash v2.5 is now available in ap, in addition to eu. You can route synthesis to Asia Pacific endpoints for lower latency in that region.

Nova 3 multi-language adds eu-north-1

Deepgram Nova 3 multi-language is now available in eu-north-1 as a specific region (in addition to the broader eu world part), giving you direct routing to North Europe.

Deepgram Aura 2 English region change

Deepgram Aura 2 English is no longer available in ap-southeast-2. Use eu-north-1 or us-east-1 instead.

Nova 3 Hindi region change

Deepgram Nova 3 Hindi is no longer available in ap-southeast-2. Use asia-south1 instead.

April 30, 2026

Update

API v0.40.3

Soniox TTS v1 general availability

Soniox TTS graduates from preview to v1. Update your client to call tts-rt:v1 and set model to tts-rt-v1. The v1-preview path and tts-rt-v1-preview model identifier are retired.

Deepgram Aura 2 voice selection now required

The model field is now required on every Deepgram Aura 2 English and Spanish request. The previous defaults (aura-2-thalia-en and aura-2-celeste-es) no longer apply, so you must pick a voice explicitly.

April 29, 2026

FeatureUpdate

API v0.40.1

Kugel 2 TTS

Kugel 2 is available as a new TTS model in eu. It offers 87 voices with expressiveness control across 26 languages, including Arabic, Chinese, Hindi, Japanese, Korean, and Vietnamese.

Soniox TTS v1-preview

Soniox TTS v1-preview is available as a new TTS model in na, with both streaming WebSocket and one-shot HTTP synthesis. Browse the voice catalog on the Soniox TTS voices page.

Voice catalog pages for Cartesia Sonic 3 and Murf Falcon

You can now browse Cartesia Sonic 3 and Murf Falcon voices with audio samples directly in the docs. Each entry shows the voice_id to pass in your init message.

Nova 3 multi-language adds EU region

Deepgram Nova 3 multi-language is now available in eu, in addition to ap-southeast-2 and us-east-1. You can route multilingual transcription to European endpoints for lower latency.

Nova 3 Hindi region change

Deepgram Nova 3 Hindi is no longer available in ap-south-1. Use ap-southeast-2 or asia-south1 instead.

April 25, 2026

Update

API v0.38.0

Soniox Speech AI Real-time v4

Soniox Speech AI moves to v4. Use the Speech AI Real-time v4 endpoint for streaming transcription with speaker diarization, automatic language identification, and configurable endpoint detection across 60+ languages. The v3 endpoint has been retired. Point clients at the new path to continue receiving native Soniox token frames.

LiveKit plugin compatibility refresh

The LiveKit Agents plugin now targets livekit-agents>=1.5.1 and Python 3.10+. You can pass model-specific options as keyword arguments. For example, whisper_params for Whisper, target_language_code for Sarvam STT, or modelId and speakingStyle for Rime Arcana. New slng_base_url and http_session arguments let you point at a self-hosted gateway and reuse an aiohttp.ClientSession.

Sarvam Saaras v3 STT not supported in LiveKit plugin

Saaras is HTTP-only on SLNG and has no WebSocket endpoint, so it cannot run through the LiveKit plugin’s realtime path. For Hindi voice agents, use slng/deepgram/nova:3-hi or slng/deepgram/nova:3-multi. See the LiveKit plugin provider notes for details.

April 24, 2026

Update

API v0.37.0

Expanded regions for Murf Falcon TTS

Murf Falcon is now available in ap, eu-non-eu, me, and na, in addition to the existing eu world part. You can now route synthesis closer to users across the Americas, Asia Pacific, and the Middle East.

Asia Pacific region for Soniox Speech AI Real-time v3

Soniox Speech AI Real-time v3 adds the ap world part alongside eu and na. Route transcription to Asia Pacific endpoints for lower latency in that region.

April 23, 2026

FeatureUpdate

API v0.36.0

URL and presigned S3 inputs for Batch STT

You can now submit audio to the Batch STT API without uploading a file on every request. Pass a publicly accessible input_url, or request a presigned S3 URL, upload directly, then create the job with the returned s3_key. Both methods accept an optional metadata object for attaching arbitrary key-value pairs to a job.

Batch API usage guide

A new Batch API guide walks through the three input methods (file upload, URL input, and presigned S3 upload) with request flows and sample payloads.

Deepgram Aura 2 English in eu-north-1

Deepgram Aura 2 English TTS is now available in eu-north-1, in addition to ap-southeast-2 and us-east-1.

Whisper Large v3 Compressed removed

The Whisper Large v3 Compressed STT model has been retired from the catalog. Use Whisper Large v3 for multilingual transcription going forward.

April 22, 2026

FeatureUpdate

API v0.32.0

Runtime variables for voice agents

Voice agents can now capture values during a call and reuse them in webhook URLs and system tool arguments. Define a runtime_variables array on your agent, and the model sets values through the built-in set_runtime_variables tool. See the agent configuration examples for setup details.

Webhook HTTP method and payload format

Webhook tools on voice agents now accept http_method (POST, PUT, PATCH, or DELETE) and webhook_format (envelope or raw). Use raw to send only the tool arguments as the request body, skipping the SLNG metadata envelope. Both fields are documented in the Voice Agents API reference.

Expanded regions for Rime Arcana v2 and Cartesia Sonic 3

Rime Arcana v2 TTS is now available in eu-north-1 and us-east-1, in addition to ap-southeast-2. Cartesia Sonic 3 TTS is now available in all three world parts: ap, eu, and na.

New regions for Deepgram Nova 3 English and Hindi

Nova 3 English is now available in ap-south-1 and us-east-1. Nova 3 Hindi adds asia-south1 alongside existing regions.

Utterance end events on Unmute STT Bridge

The Unmute STT Bridge now emits utterance_end events when the upstream model signals the end of a spoken utterance. This gives you an explicit boundary marker for segmenting transcription output.

Native token stream for Soniox Speech AI

Soniox Speech AI Real-time v3 now returns native Soniox token frames instead of normalized transcripts. You receive interim and final tokens directly, including <end> and <fin> endpoint markers when endpoint detection is enabled.

April 20, 2026

Feature

API v0.31.3

Tool personalization for voice agents

You can now use {{variable}} placeholders in runtime tool fields: webhook URLs, system webhook argument values, human transfer phone numbers, and built-in timezones. Values resolve when the tool executes, not at session start, so a missing tool variable does not block the call. Supported surfaces, validation rules, and examples are documented on the Configuration & Tools page.

Tool execution tracking on agent calls

A new tool executions endpoint lets you record webhook, template, human transfer, and built-in tool activity against a call. Execution records (including outcome, duration, and HTTP status) also appear in the call detail response.

April 18, 2026

FeatureUpdate

API v0.31.2

Cartesia Sonic 3 TTS

Cartesia Sonic 3 is available as a new TTS provider. It supports low-latency streaming synthesis over WebSocket with context-aware generation controls.

Reson8 STT

Reson8 STT v1 is available as a new STT provider. It supports real-time transcription over WebSocket with word-level timestamps, confidence scores, and partial results in nine languages including Dutch, French, German, and Spanish.

Deepgram Nova 3 Indic language endpoints

Four new SLNG-hosted Deepgram Nova 3 language variants are available in ap-south-1 (Mumbai): Kannada, Marathi, Tamil, and Telugu. Each has a dedicated WebSocket endpoint for that language.

Soniox Speech AI version correction

The Soniox STT endpoint is now correctly labeled Speech AI Real-time v3. URLs and navigation have been updated accordingly.

April 15, 2026

Feature

API v0.22.1

Batch speech-to-text API

You can now transcribe audio files asynchronously with the new Batch STT API. Upload a file, poll the job status, and download the transcript when ready. Supported formats include wav, mp3, flac, aac, ogg, m4a, mp4, amr, and mpeg. Powered by Speechmatics.

April 11, 2026

FeatureUpdate

API v0.22.0

Murf Falcon TTS

Murf Falcon is available as a new TTS provider. It supports multilingual speech synthesis over WebSocket with multiple encodings and sample rates.

Unified API documentation

The new Unified API section explains how to use one endpoint pattern for every STT and TTS model. Swap providers by changing only the URL path. Your auth, request format, and code stay the same. Includes guides on parameter coverage and supported models.

Integrations hub

A new Integrations page lists third-party platforms you can connect to SLNG. LiveKit, Cognigy, and Jambonz each have dedicated setup paths.

Whisper Large v3 endpoint consolidated

The separate Whisper Large v3 Compressed endpoint has been removed. Use the standard Whisper Large v3 endpoint, which now handles compressed audio directly.

April 9, 2026

FeatureUpdate

API v0.20.0

Language selection for Nova 3 STT

SLNG-hosted Deepgram Nova 3 STT endpoints accept a language parameter in the WebSocket init config. Supported locales by variant:

English: en, en-au, en-us, en-nz, en-gb, en-in
Spanish: es, es-us, es-419, es-ar, es-mx, es-es
Hindi: hi, en
Multi-language: multi

The Hindi variant also accepts en, so you can transcribe English audio without switching endpoints. See the Speech-to-Text models page for the full parameter list.

More sample rates for Rime Arcana TTS

Rime Arcana now supports 8, 16, 22.05, 24 (default), 32, 44.1, and 48 kHz. You can match your audio pipeline directly without resampling.

Simplified endpointing parameter

The endpointing parameter on Deepgram STT endpoints now accepts only an integer (milliseconds of silence before finalizing speech). Set it to 0 to disable. Default remains 10.

April 8, 2026

Feature

API v0.19.0

Graceful WebSocket session close

Send { "type": "close" } on any WebSocket connection to shut down cleanly. The server finishes processing remaining audio, then closes. This replaces the previous cancel behavior and works across TTS, STT, and bridges.

Keepalive for STT streams

Send { "type": "keepalive" } on STT WebSocket connections to prevent idle timeouts during pauses. Useful for voice agent sessions where the user goes silent but the connection should stay open.

Endpointing controls for Deepgram Nova STT

Two new parameters on Deepgram Nova STT models for tuning speech segmentation:

endpointing: milliseconds of silence before finalizing speech. Set to false to disable. Default: 10.
utterance_end_ms: milliseconds of silence between words before an UtteranceEnd event. Range: 200–5000 ms, default: 1000 ms.

India region for Nova 3 Hindi

Deepgram Nova 3 Hindi is now available in ap-south-1 (Mumbai), alongside ap-southeast-2 (Sydney). Use the X-Region-Override header to route to the closest region. See models by region.

​Gradium STT and TTS

​Kugel 2 Turbo TTS

​Rime Arcana v3 French

​ElevenLabs WebSocket controls

​Execution Layer documentation

​Streamlined navigation and a single API Reference

​Consolidated authentication and BYOK pages

​Pipecat plugin

​New Sarvam Bulbul and Soniox voices

​Updated Cartesia Sonic 3 voice metadata

​Gateway spec refinements

​Inworld Max 1.5 TTS

​Asia South region for Nova 3 English and Rime Arcana v3

​Region rename for Nova 3 Indian languages

​Supported LLMs for Voice Agents

​Moonshot Kimi K2 removed from Voice Agents

​Segment correlation for Cartesia Sonic 3

​Pronunciation hints on Cognigy and Jambonz TTS bridges

​Binary audio frames on the Unmute TTS bridge

​BYOK support for Kugel TTS

​HTTP streaming for Sarvam Bulbul v3

​Streaming transcription for Sarvam Saaras v3

​HTTP transcription for Nova 3 Indic languages

​Soniox TTS language coverage

​Runtime variables for voice agents

​New agent regions and world parts

​Voice agents no longer need a separate API key

​Deepgram Aura 2 region availability

​Pronunciation dictionaries for TTS

​Full voice catalogs on provider pages

​Batch API reference now matches the gateway

​URL-based audio for HTTP transcription

​Deepgram Nova 3 English region change

​voiceai CLI

​JavaScript and Python SDKs

​Agent skills for coding agents

​LiveKit Agents plugin

​Embed a voice agent on the web

​Rime Arcana v3 Spanish

​Rime Arcana v3 adds eu-north-1

​Deepgram Nova 3 adds asia-south1

​Deepgram Nova 3 Spanish region change

​Soniox TTS region override removed

​Expanded Murf Falcon voice catalog

​Simplified Unmute TTS bridge requests

​Rime Coda Indonesian TTS

​Region and world-part query parameters on bridges

​Cartesia Sonic 3, Murf Falcon, Kugel, Soniox, Reson8, and Sarvam in the Unified API

​Webhook tools support custom HTTP methods and raw payloads

​Tool execution tracking for voice agents

​ElevenLabs Flash v2.5 adds Asia Pacific region

​Nova 3 multi-language adds eu-north-1

​Deepgram Aura 2 English region change

​Nova 3 Hindi region change

​Soniox TTS v1 general availability

​Deepgram Aura 2 voice selection now required

​Kugel 2 TTS

​Soniox TTS v1-preview

​Voice catalog pages for Cartesia Sonic 3 and Murf Falcon

​Nova 3 multi-language adds EU region

​Nova 3 Hindi region change

​Soniox Speech AI Real-time v4

​LiveKit plugin compatibility refresh

​Sarvam Saaras v3 STT not supported in LiveKit plugin

​Expanded regions for Murf Falcon TTS

​Asia Pacific region for Soniox Speech AI Real-time v3

​URL and presigned S3 inputs for Batch STT

​Batch API usage guide

​Deepgram Aura 2 English in eu-north-1

​Whisper Large v3 Compressed removed

​Runtime variables for voice agents

​Webhook HTTP method and payload format

​Expanded regions for Rime Arcana v2 and Cartesia Sonic 3

​New regions for Deepgram Nova 3 English and Hindi

​Utterance end events on Unmute STT Bridge

​Native token stream for Soniox Speech AI

​Tool personalization for voice agents

​Tool execution tracking on agent calls

​Cartesia Sonic 3 TTS

​Reson8 STT

Gradium STT and TTS

Kugel 2 Turbo TTS

Rime Arcana v3 French

ElevenLabs WebSocket controls

Execution Layer documentation

Streamlined navigation and a single API Reference

Consolidated authentication and BYOK pages

Pipecat plugin

New Sarvam Bulbul and Soniox voices

Updated Cartesia Sonic 3 voice metadata

Gateway spec refinements

Inworld Max 1.5 TTS

Asia South region for Nova 3 English and Rime Arcana v3

Region rename for Nova 3 Indian languages

Supported LLMs for Voice Agents

Moonshot Kimi K2 removed from Voice Agents

Segment correlation for Cartesia Sonic 3

Pronunciation hints on Cognigy and Jambonz TTS bridges

Binary audio frames on the Unmute TTS bridge

BYOK support for Kugel TTS

HTTP streaming for Sarvam Bulbul v3

Streaming transcription for Sarvam Saaras v3

HTTP transcription for Nova 3 Indic languages

Soniox TTS language coverage

Runtime variables for voice agents

New agent regions and world parts

Voice agents no longer need a separate API key

Deepgram Aura 2 region availability

Pronunciation dictionaries for TTS

Full voice catalogs on provider pages

Batch API reference now matches the gateway

URL-based audio for HTTP transcription

Deepgram Nova 3 English region change

voiceai CLI

JavaScript and Python SDKs

Agent skills for coding agents

LiveKit Agents plugin

Embed a voice agent on the web

Rime Arcana v3 Spanish

Rime Arcana v3 adds eu-north-1

Deepgram Nova 3 adds asia-south1

Deepgram Nova 3 Spanish region change

Soniox TTS region override removed

Expanded Murf Falcon voice catalog

Simplified Unmute TTS bridge requests

Rime Coda Indonesian TTS

Region and world-part query parameters on bridges

Cartesia Sonic 3, Murf Falcon, Kugel, Soniox, Reson8, and Sarvam in the Unified API

Webhook tools support custom HTTP methods and raw payloads

Tool execution tracking for voice agents

ElevenLabs Flash v2.5 adds Asia Pacific region

Nova 3 multi-language adds eu-north-1

Deepgram Aura 2 English region change

Nova 3 Hindi region change

Soniox TTS v1 general availability

Deepgram Aura 2 voice selection now required

Kugel 2 TTS

Soniox TTS v1-preview

Voice catalog pages for Cartesia Sonic 3 and Murf Falcon

Nova 3 multi-language adds EU region

Nova 3 Hindi region change

Soniox Speech AI Real-time v4

LiveKit plugin compatibility refresh

Sarvam Saaras v3 STT not supported in LiveKit plugin

Expanded regions for Murf Falcon TTS

Asia Pacific region for Soniox Speech AI Real-time v3

URL and presigned S3 inputs for Batch STT

Batch API usage guide

Deepgram Aura 2 English in eu-north-1

Whisper Large v3 Compressed removed

Runtime variables for voice agents

Webhook HTTP method and payload format

Expanded regions for Rime Arcana v2 and Cartesia Sonic 3

New regions for Deepgram Nova 3 English and Hindi

Utterance end events on Unmute STT Bridge

Native token stream for Soniox Speech AI

Tool personalization for voice agents

Tool execution tracking on agent calls

Cartesia Sonic 3 TTS

Reson8 STT