> ## Documentation Index
> Fetch the complete documentation index at: https://docs.slng.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# SLNG changelog

> Release notes for the SLNG speech and language API. New TTS and STT models, voice agent features, breaking changes, and bug fixes.

<Update label="June 16, 2026" description="API v0.53.0" tags={["Feature", "Update"]}>
  ## Gradium STT and TTS

  [Gradium STT](/api-reference/stt/gradium-stt/gradium-stt-default-ws) streams real-time multilingual transcription over WebSocket, with `Flush` and `EndOfStream` controls for finalizing buffered audio. [Gradium TTS](/api-reference/tts/gradium-tts/gradium-tts-default-ws) ships alongside it for streaming WebSocket synthesis and one-shot [HTTP synthesis](/api-reference/tts/gradium-tts/gradium-tts-default-http).

  ## Kugel 2 Turbo TTS

  [Kugel 2 Turbo](/api-reference/tts/kugel-2-turbo/kugel-2-turbo-ws) is the new low-latency tier in the Kugel TTS family. Stream high-quality synthesis with expressiveness controls over the unified TTS WebSocket protocol, with word timestamps and per-chunk completion events.

  ## Rime Arcana v3 French

  [Rime Arcana v3 French](/api-reference/tts/rime-arcana-v3/arcana-v3-french-ws) joins the Arcana v3 lineup. Synthesize French speech via WebSocket or [HTTP](/api-reference/tts/rime-arcana-v3/arcana-v3-french-http) using speakers like `livet_aurelie`.

  ## ElevenLabs WebSocket controls

  The ElevenLabs TTS WebSocket endpoints, including [Flash v2.5](/api-reference/tts/eleven-flash-v2-5/elevenlabs-flash-v2-5-ws), [v3](/api-reference/tts/eleven-v3/elevenlabs-v3-ws), and [Multilingual v2](/api-reference/tts/multilingual-v2/elevenlabs-multilingual-v2-ws), accept new init parameters: `auto_mode` to cut latency by skipping chunk scheduling, `sync_alignment` to attach alignment data to each audio chunk, `enable_ssml_parsing` for inline SSML, `inactivity_timeout` (1–180 s), and `chunk_length_schedule`. Set `enable_logging: false` for zero-retention mode on enterprise plans on both WebSocket and HTTP endpoints.
</Update>

<Update label="June 12, 2026" description="API v0.52.3" tags={["Feature", "Update"]}>
  ## Execution Layer documentation

  The docs portal is reframed around the [Execution Layer](/execution-layer), the three-stage path that sits between your orchestrator and the models. Read how [STT Routing](/execution-layer/stt-routing), [Tiered Decisioning](/execution-layer/tiered-decisioning), and [Output Assembly](/execution-layer/output-assembly) cut latency and model cost across a voice call. The new [How It Works](/execution-layer/how-it-works) page walks through a single turn end to end.

  ## Streamlined navigation and a single API Reference

  The top bar collapses to **Documentation**, **Execution Layer**, **API Reference**, and **Models**. Every endpoint now lives under one [API Reference](/api-reference/overview), grouped by Unified API, TTS, STT, Voice Agents, Batch, and Bridges, so you can find a model without switching tabs.

  ## Consolidated authentication and BYOK pages

  API key creation, rotation, and request authentication are merged into a single [Authentication & API Keys](/authentication) page. Bring-your-own-key setup for every supported provider now lives on one [BYOK](/execution-layer/byok) page covering caching behavior and provider key handling.
</Update>

<Update label="June 11, 2026" tags={["Feature"]}>
  ## Pipecat plugin

  The [`pipecat-slng`](/agents/pipecat-plugin) Python package connects a [Pipecat](https://github.com/pipecat-ai/pipecat) pipeline to any STT or TTS model on the SLNG gateway. It ships `SlngSTTService` and `SlngTTSService` for low-latency streaming over WebSocket, plus `SlngHttpTTSService` for non-streaming synthesis. Swap the `model` string to switch provider or region without changing your pipeline code.
</Update>

<Update label="June 10, 2026" description="API v0.52.2" tags={["Update"]}>
  ## New Sarvam Bulbul and Soniox voices

  [Sarvam Bulbul v3](/api-reference/tts/sarvam-ai-bulbul-v3/bulbul-v3-ws) adds the Tamil voice `ta-IN-rahul` and the multilingual `shubh`, while `gu-IN-rahul` is retired. [Soniox TTS v1](/api-reference/tts/soniox-tts-v1/soniox-tts-v1-ws) adds four English voices: `Maya`, `Daniel`, `Noah`, and `Nina`.

  ## Updated Cartesia Sonic 3 voice metadata

  The 745-voice [Cartesia Sonic 3](/api-reference/tts/cartesia-sonic-3/cartesia-sonic-3-ws) catalog has refreshed metadata. Voice IDs are unchanged, so existing requests keep working.
</Update>

<Update label="June 3, 2026" description="API v0.52.1" tags={["Fix"]}>
  ## Gateway spec refinements

  Patch-level updates to the unified gateway specification clean up schemas across the TTS, STT, and bridge surfaces. Existing requests and responses keep working without changes.
</Update>

<Update label="June 2, 2026" description="API v0.52.0" tags={["Feature", "Update"]}>
  ## Inworld Max 1.5 TTS

  [Inworld Max 1.5](/api-reference/tts/inworld-max-1-5/inworld-max-1-5-ws) is available on the SLNG-hosted TTS API in `us-east-1`. Synthesize up to 2,000 characters per request across 16 languages, pick from 100+ named voices, and tune output with `temperature`, `language` (BCP-47), and word- or character-level timestamps.

  ## Asia South region for Nova 3 English and Rime Arcana v3

  [Deepgram Nova 3 English](/api-reference/stt/deepgram-nova-3/nova-3-english-ws) and [Rime Arcana v3 English](/api-reference/tts/rime-arcana-v3/arcana-v3-english-ws) can now route to `asia-south1`. Set `X-Region-Override: asia-south1` to keep traffic in-region for users in India.

  ## Region rename for Nova 3 Indian languages

  The Hindi, Marathi, Kannada, Tamil, and Telugu variants of [Deepgram Nova 3](/api-reference/stt/deepgram-nova-3/nova-3-ws) now expose a single `asia-south1` region. The legacy `ap-south-1` value is no longer accepted on `X-Region-Override`; switch any pinned requests to `asia-south1`.
</Update>

<Update label="May 29, 2026" description="API v0.51.2" tags={["Update"]}>
  ## Supported LLMs for Voice Agents

  The `llm` field on [Voice Agents](/voice-agents) is now a closed set: `bedrock-mantle/nvidia.nemotron-super-3-120b`, `bedrock-mantle/nvidia.nemotron-nano-3-30b`, and `groq/openai/gpt-oss-120b`. Pick one of these IDs when you create or update an agent so requests route to a live model.

  ## Moonshot Kimi K2 removed from Voice Agents

  `groq/moonshotai/kimi-k2-instruct-0905` is no longer routable for Voice Agents. Switch existing agents and example payloads to `groq/openai/gpt-oss-120b` (or another model from the supported list) to keep calls working.
</Update>

<Update label="May 28, 2026" description="API v0.51.1" tags={["Feature", "Update"]}>
  ## Segment correlation for Cartesia Sonic 3

  [Cartesia Sonic 3](/api-reference/tts/cartesia-sonic-3/cartesia-sonic-3-ws) audio chunks now carry a `flush_id`. The counter starts at `0` and increments each time you send a `text` input with `flush: true` or a standalone flush message. Use it to map streamed audio frames back to the text segment that produced them.

  ## Pronunciation hints on Cognigy and Jambonz TTS bridges

  The [Cognigy TTS bridge](/api-reference/bridges/cognigy-tts-bridge/cognigy-tts-bridge-ws) and [Jambonz TTS bridge](/api-reference/bridges/jambonz-tts-bridge/jambonz-tts-bridge-ws) accept an optional `pronunciation` object on `text` messages. Set `mode` (`ipa` or `ssml`), `name`, and `dictionary_id` to override how a term is spoken per chunk on providers that support [pronunciation dictionaries](/pronunciation-dictionaries).

  ## Binary audio frames on the Unmute TTS bridge

  The [Unmute TTS bridge](/api-reference/unified-api/unmute-tts-bridge/unmute-tts-bridge-ws) now sends raw binary WebSocket frames instead of JSON `audio_chunk`, `segment_start`, and `segment_end` envelopes. The encoding and sample rate match the values you pass in the init message, so you can write the bytes straight to your audio pipeline. The `clear` control message has been removed; error and `audio_end` payloads now carry their fields under a `data` envelope (with `message` and `code` duplicated at the top level for backward compatibility).
</Update>

<Update label="May 26, 2026" description="API v0.50.1" tags={["Update"]}>
  ## BYOK support for Kugel TTS

  You can now bring your own Kugel API key when calling [Kugel 1 Turbo](/api-reference/tts/kugel-1-turbo/kugel-1-turbo-ws), [Kugel 1](/api-reference/tts/kugel-1/kugel-1-ws), and [Kugel 2](/api-reference/tts/kugel-2/kugel-2-ws). Pass it on the `X-Slng-Provider-Key` header so Kugel bills your account directly, while the SLNG gateway's TTS cache still serves repeat requests. See [Bring your own key](/execution-layer/byok) for the full provider list.
</Update>

<Update label="May 23, 2026" description="API v0.50.0" tags={["Feature"]}>
  ## HTTP streaming for Sarvam Bulbul v3

  [Sarvam AI Bulbul Stream v3](/api-reference/tts/sarvam-ai-bulbul-stream-v3/bulbul-stream-v3-http) is now available as a chunked HTTP endpoint. The response is raw audio bytes in the codec you set with `output_audio_codec` (there is no JSON envelope and no `X-Duration` header), so you can start playback as soon as the first chunk arrives. Use it to reach the same 30+ Indian-language voices as `bulbul:v3` with lower time-to-first-byte.
</Update>

<Update label="May 22, 2026" description="API v0.49.1" tags={["Feature"]}>
  ## Streaming transcription for Sarvam Saaras v3

  Sarvam AI [Saaras v3](/api-reference/stt/sarvam-ai-saaras/saaras-v3-ws) is now available over WebSocket for real-time transcription across 23 Indian languages. Configure each session with query parameters on the upgrade URL (`language-code`, `mode`, `sample_rate`, `input_audio_codec`, `high_vad_sensitivity`, and `vad_signals`) to tune voice activity detection and audio handling per stream.
</Update>

<Update label="May 21, 2026" description="API v0.48.1" tags={["Feature", "Update"]}>
  ## HTTP transcription for Nova 3 Indic languages

  Deepgram Nova 3 now exposes HTTP endpoints for [Kannada](/api-reference/stt/deepgram-nova-3/nova-3-kannada-http), [Marathi](/api-reference/stt/deepgram-nova-3/nova-3-marathi-http), [Tamil](/api-reference/stt/deepgram-nova-3/nova-3-tamil-http), and [Telugu](/api-reference/stt/deepgram-nova-3/nova-3-telugu-http). You can transcribe these languages with a single request (binary upload or `url` field) instead of opening a WebSocket.

  ## Soniox TTS language coverage

  [Soniox TTS](/voices/soniox) now ships voices for 60+ languages, including Afrikaans, Bengali, Bulgarian, Catalan, Czech, Dutch, Greek, Gujarati, Hebrew, Indonesian, Kannada, Malay, Marathi, Norwegian, Tamil, Telugu, and more. Set the `language` field to any supported ISO 639-1 code on the [Soniox TTS HTTP endpoint](/api-reference/tts/soniox-tts-v1/soniox-tts-v1-http) to pick a voice catalog.

  ## Runtime variables for voice agents

  Voice agents can now declare `runtime_variables` so the model can capture values during a call and reuse them later in webhook URLs or system tool arguments. The built-in `set_runtime_variables` tool writes the values, which persist for the lifetime of the call or web session. See [Voice agents](/voice-agents) for the setup pattern.

  ## New agent regions and world parts

  The [region override](/region-override) catalog now includes the `eu-non-eu` and `me` world parts, plus the `asia-south1`, `asia-southeast2`, and `australia-southeast1` regions. Pin requests to Sydney, Jakarta, or a non-EU European region with `X-Region-Override`, or stay inside the Middle East with `X-World-Part-Override: me`.

  ## Voice agents no longer need a separate API key

  Voice agent create and duplicate requests no longer accept `slng_api_key`. Agents now use the API key you authenticate the request with. Drop the field from your payloads. Agent duplication also stops copying the inbound connection and call history; reconnect inbound routing on the copy if you need it.

  ## Deepgram Aura 2 region availability

  SLNG-hosted [Deepgram Aura 2 English](/api-reference/tts/deepgram-aura-2/aura-2-english-http) is no longer available in `us-east-1`. Route English TTS to `eu-north-1`. [Aura 2 Spanish](/api-reference/tts/deepgram-aura-2/aura-2-spanish-http) drops the `na` world part and now serves only `eu`.
</Update>

<Update label="May 20, 2026" description="API v0.46.0" tags={["Feature", "Update"]}>
  ## Pronunciation dictionaries for TTS

  You can now create reusable [pronunciation dictionaries](/pronunciation-dictionaries) and attach them to any SLNG TTS request. Define rewrite rules once for brand names, acronyms, or domain terms, then reference the dictionary from HTTP, WebSocket, or Unified TTS calls. Manage dictionaries through the new [pronunciation dictionary endpoints](/api-reference/tts/pronunciation-dictionaries/create-pronunciation-dictionary-http).

  ## Full voice catalogs on provider pages

  Provider voice pages now list every voice in the catalog instead of capping the table at ten per language. The [Cartesia Sonic 3](/voices/cartesia-sonic-3) page shows all 745 voices, [Sarvam Bulbul](/voices/sarvam-bulbul) shows 405, [Deepgram Aura](/voices/deepgram-aura) shows 91, [Soniox](/voices/soniox) shows 120, [Murf](/voices/murf) shows 111, and [Kugel](/voices/kugel) shows 100. You can search and copy any voice ID directly from the provider page.
</Update>

<Update label="May 19, 2026" description="API v0.44.2" tags={["Update", "Fix"]}>
  ## Batch API reference now matches the gateway

  The [Batch API reference](/api-reference/speechmatics/create-batch-job) and [Batch API guide](/batch-guide) are realigned with the live `api.batch.slng.ai` gateway. Request and response schemas, supported audio formats, and the three submission flows (direct upload, URL input, and presigned S3 upload) reflect what the service actually accepts.
</Update>

<Update label="May 16, 2026" description="API v0.44.1" tags={["Feature", "Update"]}>
  ## URL-based audio for HTTP transcription

  The [Whisper Large v3](/api-reference/stt/whisper-large-v3/whisper-large-v3-http), [Cognigy STT bridge](/api-reference/bridges/cognigy-stt-bridge/cognigy-stt-bridge-http), [Jambonz STT bridge](/api-reference/bridges/jambonz-stt-bridge/jambonz-stt-bridge-http), and [Unmute STT bridge](/api-reference/unified-api/unmute-stt-bridge/unmute-stt-bridge-http) HTTP endpoints now accept a `url` field pointing to a publicly accessible audio file. Send a JSON body with `url` and `language` instead of a multipart upload when your audio is already hosted somewhere.

  ## Deepgram Nova 3 English region change

  SLNG-hosted [Deepgram Nova 3 English](/api-reference/stt/deepgram-nova-3/nova-3-english-http) is no longer available in `eu-north-1`. Route English transcription to `australia-southeast1` or `us-east-1` instead.
</Update>

<Update label="May 15, 2026" description="API v0.43.3" tags={["Feature"]}>
  ## voiceai CLI

  The new [`voiceai` CLI](/sdks/cli) runs text-to-speech and speech-to-text from a terminal. Install it with `curl`, Homebrew, or `npm`, then pipe audio between SLNG models and other tools without writing an HTTP client.

  ## JavaScript and Python SDKs

  The typed [JavaScript SDK](/sdks/javascript) (`voiceai-sdk` on npm) supports Node, Bun, and Deno. The [Python SDK](/sdks/python) (`voiceai-sdk` on PyPI) ships sync and async clients for Python 3.9+. Both wrap the full STT and TTS surface so you can drop the raw `fetch` and WebSocket plumbing.

  ## Agent skills for coding agents

  The [`slng-ai/skills`](/sdks/skills) pack teaches Claude Code and similar coding agents to call SLNG directly. Point your agent at the skills repo and it can pick models, build init messages, and stream audio for you.

  ## LiveKit Agents plugin

  The [`livekit-plugins-slng`](/agents/livekit-plugin) Python package connects [LiveKit Agents](https://docs.livekit.io/agents/) to any STT or TTS model on the SLNG gateway with a single configuration switch. You can swap providers or regions without changing your agent code.

  ## Embed a voice agent on the web

  A new [browser embed guide](/agents/embed-web) walks through adding a SLNG voice session to any web page. It uses LiveKit, a React frontend, and a backend proxy that keeps your API key off the client.
</Update>

<Update label="May 14, 2026" description="API v0.43.2" tags={["Feature", "Update"]}>
  ## Rime Arcana v3 Spanish

  [Rime Arcana v3 (Spanish)](/api-reference/tts/rime-arcana-v3/arcana-v3-spanish-ws) is available as a new TTS endpoint, with both streaming WebSocket and one-shot HTTP synthesis. Choose from ten Spanish voices (`aurelio`, `celestino`, `lark`, `luz`, `mar`, `nova`, `pola`, `seraphina`, `sirius`, and `ursa`) and pass `model: rime/arcana:3-es` on init.

  ## Rime Arcana v3 adds eu-north-1

  [Arcana v3 English](/api-reference/tts/rime-arcana-v3/arcana-v3-english-ws), [Hindi](/api-reference/tts/rime-arcana-v3/arcana-v3-hindi-ws), and Spanish are now available in `eu-north-1`. You can route Arcana v3 synthesis to North Europe for lower latency in that region.

  ## Deepgram Nova 3 adds asia-south1

  [Deepgram Nova 3](/api-reference/stt/deepgram-nova-3/nova-3-multi-language-ws) Tamil, Telugu, Marathi, and Kannada are now available in `asia-south1`, in addition to `ap-south-1`. You can route South Asian language transcription to the Mumbai GCP region.

  ## Deepgram Nova 3 Spanish region change

  Deepgram Nova 3 Spanish is no longer available in `ap-southeast-2`. Use `australia-southeast1` or `us-east-1` instead.

  ## Soniox TTS region override removed

  The `x-region` header is no longer accepted on [Soniox TTS v1](/api-reference/tts/soniox-tts-v1/soniox-tts-v1-ws) requests. Soniox TTS runs only in `na`, so requests are routed there automatically.
</Update>

<Update label="May 13, 2026" description="API v0.42.2" tags={["Update", "Fix"]}>
  ## Expanded Murf Falcon voice catalog

  The [Murf Falcon](/voices/murf) voice catalog now lists 133 voices across more than 20 locales, including new entries for Bengali, Tamil, Telugu, Gujarati, Kannada, Punjabi, Japanese, Korean, Portuguese, Dutch, Polish, Greek, Croatian, and Scottish English. You can browse voices by full locale code and copy the `voice_id` for use in your init message.

  ## Simplified Unmute TTS bridge requests

  The [Unmute TTS bridge](/api-reference/unified-api/unmute-tts-bridge/unmute-tts-bridge-http) no longer requires a `model` field on HTTP or WebSocket init messages. The model is now inferred from the `{model_variant}` path. Send only `voice` and `text` for HTTP requests, or `voice` plus optional `config` on the WebSocket init.
</Update>

<Update label="May 7, 2026" description="API v0.42.1" tags={["Feature"]}>
  ## Rime Coda Indonesian TTS

  [Rime Coda](/api-reference/tts/rime-coda/coda-indonesian-ws) is available as a new TTS model in the `asia-southeast2` (Jakarta) region. It synthesizes Bahasa Indonesian with low latency across four voices (`pujianti_plesmita`, `siswoko_sigit`, `taryadi_dani`, and `usmany_tatianna`) and supports streaming WebSocket and one-shot HTTP synthesis.

  ## Region and world-part query parameters on bridges

  The Cognigy, Jambonz, and Unmute HTTP bridges now accept `?region=` and `?world-part=` query parameters, mirroring the `X-Region-Override` and `X-World-Part-Override` headers. Use the query form when your platform cannot set custom headers; if both are present, the header wins. See [Integrations overview](/integrations/overview#region-and-world-part-overrides) for details.
</Update>

<Update label="May 6, 2026" description="API v0.41.2" tags={["Feature", "Update"]}>
  ## Cartesia Sonic 3, Murf Falcon, Kugel, Soniox, Reson8, and Sarvam in the Unified API

  The [Unified API](/execution-layer/unified-api-models) now routes to Cartesia Sonic 3, Murf Falcon, KugelAudio Kugel 1/1-Turbo/2, Soniox TTS v1, Soniox Speech AI v4, Reson8 STT v1, and Sarvam Saaras v3. You can swap between these providers using a single request shape: pass identifiers like `cartesia/sonic:3`, `murf/murftts:falcon`, `kugelaudio/kugel:2`, `soniox/tts-rt:v1`, `soniox/speech-ai:rt-v4`, or `reson8/reson8stt:v1`.

  ## Webhook tools support custom HTTP methods and raw payloads

  [Voice agent webhook tools](/examples/agents-config#system-triggered-webhooks) now accept `http_method` (`POST`, `PUT`, `PATCH`, or `DELETE`) and `webhook_format` (`envelope` or `raw`). Set `webhook_format` to `raw` to send only the tool arguments when the receiving service cannot parse the SLNG envelope.

  ## Tool execution tracking for voice agents

  You can record webhook, template, human-transfer, and built-in tool activity against a call by posting to the [tool executions endpoint](/api-reference/calls/submit-tool-execution). Each record carries the outcome, duration, and HTTP status, and submitted executions surface on the [call detail response](/api-reference/calls/get-call) for debugging or analytics.

  ## ElevenLabs Flash v2.5 adds Asia Pacific region

  [ElevenLabs Flash v2.5](/api-reference/tts/eleven-flash-v2-5/elevenlabs-flash-v2-5-ws) is now available in `ap`, in addition to `eu`. You can route synthesis to Asia Pacific endpoints for lower latency in that region.

  ## Nova 3 multi-language adds eu-north-1

  [Deepgram Nova 3 multi-language](/api-reference/stt/deepgram-nova-3/nova-3-multi-language-ws) is now available in `eu-north-1` as a specific region (in addition to the broader `eu` world part), giving you direct routing to North Europe.

  ## Deepgram Aura 2 English region change

  [Deepgram Aura 2 English](/api-reference/tts/deepgram-aura-2/aura-2-english-ws) is no longer available in `ap-southeast-2`. Use `eu-north-1` or `us-east-1` instead.

  ## Nova 3 Hindi region change

  [Deepgram Nova 3 Hindi](/api-reference/stt/deepgram-nova-3/nova-3-hindi-ws) is no longer available in `ap-southeast-2`. Use `asia-south1` instead.
</Update>

<Update label="April 30, 2026" description="API v0.40.3" tags={["Update"]}>
  ## Soniox TTS v1 general availability

  [Soniox TTS](/api-reference/tts/soniox-tts-v1/soniox-tts-v1-ws) graduates from preview to v1. Update your client to call `tts-rt:v1` and set `model` to `tts-rt-v1`. The `v1-preview` path and `tts-rt-v1-preview` model identifier are retired.

  ## Deepgram Aura 2 voice selection now required

  The `model` field is now required on every [Deepgram Aura 2 English](/api-reference/tts/deepgram-aura-2/aura-2-english-ws) and [Spanish](/api-reference/tts/deepgram-aura-2/aura-2-spanish-ws) request. The previous defaults (`aura-2-thalia-en` and `aura-2-celeste-es`) no longer apply, so you must pick a voice explicitly.
</Update>

<Update label="April 29, 2026" description="API v0.40.1" tags={["Feature", "Update"]}>
  ## Kugel 2 TTS

  [Kugel 2](/api-reference/tts/kugel-2/kugel-2-ws) is available as a new TTS model in `eu`. It offers 87 voices with expressiveness control across 26 languages, including Arabic, Chinese, Hindi, Japanese, Korean, and Vietnamese.

  ## Soniox TTS v1-preview

  [Soniox TTS v1-preview](/api-reference/tts/soniox-tts-v1/soniox-tts-v1-ws) is available as a new TTS model in `na`, with both streaming WebSocket and one-shot HTTP synthesis. Browse the voice catalog on the [Soniox TTS voices](/voices/soniox) page.

  ## Voice catalog pages for Cartesia Sonic 3 and Murf Falcon

  You can now browse [Cartesia Sonic 3](/voices/cartesia-sonic-3) and [Murf Falcon](/voices/murf) voices with audio samples directly in the docs. Each entry shows the `voice_id` to pass in your init message.

  ## Nova 3 multi-language adds EU region

  [Deepgram Nova 3 multi-language](/api-reference/stt/deepgram-nova-3/nova-3-multi-language-ws) is now available in `eu`, in addition to `ap-southeast-2` and `us-east-1`. You can route multilingual transcription to European endpoints for lower latency.

  ## Nova 3 Hindi region change

  [Deepgram Nova 3 Hindi](/api-reference/stt/deepgram-nova-3/nova-3-hindi-ws) is no longer available in `ap-south-1`. Use `ap-southeast-2` or `asia-south1` instead.
</Update>

<Update label="April 25, 2026" description="API v0.38.0" tags={["Update"]}>
  ## Soniox Speech AI Real-time v4

  Soniox Speech AI moves to v4. Use the [Speech AI Real-time v4](/api-reference/stt/soniox-speech-ai-real-time-v4/speech-ai-real-time-v4-ws) endpoint for streaming transcription with speaker diarization, automatic language identification, and configurable endpoint detection across 60+ languages. The v3 endpoint has been retired. Point clients at the new path to continue receiving native Soniox token frames.

  ## LiveKit plugin compatibility refresh

  The [LiveKit Agents plugin](/agents/livekit-plugin) now targets `livekit-agents>=1.5.1` and Python 3.10+. You can pass model-specific options as keyword arguments. For example, `whisper_params` for Whisper, `target_language_code` for Sarvam STT, or `modelId` and `speakingStyle` for Rime Arcana. New `slng_base_url` and `http_session` arguments let you point at a self-hosted gateway and reuse an `aiohttp.ClientSession`.

  ## Sarvam Saaras v3 STT not supported in LiveKit plugin

  Saaras is HTTP-only on SLNG and has no WebSocket endpoint, so it cannot run through the LiveKit plugin's realtime path. For Hindi voice agents, use `slng/deepgram/nova:3-hi` or `slng/deepgram/nova:3-multi`. See the [LiveKit plugin provider notes](/agents/livekit-plugin#provider-notes) for details.
</Update>

<Update label="April 24, 2026" description="API v0.37.0" tags={["Update"]}>
  ## Expanded regions for Murf Falcon TTS

  [Murf Falcon](/api-reference/tts/murf-falcon/murf-falcon-ws) is now available in `ap`, `eu-non-eu`, `me`, and `na`, in addition to the existing `eu` world part. You can now route synthesis closer to users across the Americas, Asia Pacific, and the Middle East.

  ## Asia Pacific region for Soniox Speech AI Real-time v3

  [Soniox Speech AI Real-time v3](/api-reference/stt/soniox-speech-ai-real-time-v4/speech-ai-real-time-v4-ws) adds the `ap` world part alongside `eu` and `na`. Route transcription to Asia Pacific endpoints for lower latency in that region.
</Update>

<Update label="April 23, 2026" description="API v0.36.0" tags={["Feature", "Update"]}>
  ## URL and presigned S3 inputs for Batch STT

  You can now submit audio to the [Batch STT API](/batch-guide) without uploading a file on every request. Pass a publicly accessible `input_url`, or request a presigned S3 URL, upload directly, then create the job with the returned `s3_key`. Both methods accept an optional `metadata` object for attaching arbitrary key-value pairs to a job.

  ## Batch API usage guide

  A new [Batch API guide](/batch-guide) walks through the three input methods (file upload, URL input, and presigned S3 upload) with request flows and sample payloads.

  ## Deepgram Aura 2 English in eu-north-1

  [Deepgram Aura 2 English TTS](/api-reference/tts/deepgram-aura-2/aura-2-english-ws) is now available in `eu-north-1`, in addition to `ap-southeast-2` and `us-east-1`.

  ## Whisper Large v3 Compressed removed

  The Whisper Large v3 Compressed STT model has been retired from the catalog. Use [Whisper Large v3](/api-reference/stt/whisper-large-v3/whisper-large-v3-ws) for multilingual transcription going forward.
</Update>

<Update label="April 22, 2026" description="API v0.32.0" tags={["Feature", "Update"]}>
  ## Runtime variables for voice agents

  Voice agents can now capture values during a call and reuse them in webhook URLs and system tool arguments. Define a `runtime_variables` array on your agent, and the model sets values through the built-in `set_runtime_variables` tool. See the [agent configuration examples](/examples/agents-config#runtime-variables) for setup details.

  ## Webhook HTTP method and payload format

  Webhook tools on voice agents now accept `http_method` (POST, PUT, PATCH, or DELETE) and `webhook_format` (`envelope` or `raw`). Use `raw` to send only the tool arguments as the request body, skipping the SLNG metadata envelope. Both fields are documented in the [Voice Agents API reference](/api-reference/agents/create-agent).

  ## Expanded regions for Rime Arcana v2 and Cartesia Sonic 3

  [Rime Arcana v2 TTS](/api-reference/tts/rime-arcana-v2/arcana-v2-english-ws) is now available in `eu-north-1` and `us-east-1`, in addition to `ap-southeast-2`. [Cartesia Sonic 3 TTS](/api-reference/tts/cartesia-sonic-3/cartesia-sonic-3-ws) is now available in all three world parts: `ap`, `eu`, and `na`.

  ## New regions for Deepgram Nova 3 English and Hindi

  [Nova 3 English](/api-reference/stt/deepgram-nova-3/nova-3-english-ws) is now available in `ap-south-1` and `us-east-1`. [Nova 3 Hindi](/api-reference/stt/deepgram-nova-3/nova-3-hindi-ws) adds `asia-south1` alongside existing regions.

  ## Utterance end events on Unmute STT Bridge

  The [Unmute STT Bridge](/api-reference/unified-api/unmute-stt-bridge/unmute-stt-bridge-ws) now emits `utterance_end` events when the upstream model signals the end of a spoken utterance. This gives you an explicit boundary marker for segmenting transcription output.

  ## Native token stream for Soniox Speech AI

  [Soniox Speech AI Real-time v3](/api-reference/stt/soniox-speech-ai-real-time-v4/speech-ai-real-time-v4-ws) now returns native Soniox token frames instead of normalized transcripts. You receive interim and final tokens directly, including `<end>` and `<fin>` endpoint markers when endpoint detection is enabled.
</Update>

<Update label="April 20, 2026" description="API v0.31.3" tags={["Feature"]}>
  ## Tool personalization for voice agents

  You can now use `{{variable}}` placeholders in runtime tool fields: webhook URLs, system webhook argument values, human transfer phone numbers, and built-in timezones. Values resolve when the tool executes, not at session start, so a missing tool variable does not block the call. Supported surfaces, validation rules, and examples are documented on the [Configuration & Tools](/examples/agents-config#tool-personalization) page.

  ## Tool execution tracking on agent calls

  A new [tool executions endpoint](/api-reference/calls/submit-tool-execution) lets you record webhook, template, human transfer, and built-in tool activity against a call. Execution records (including outcome, duration, and HTTP status) also appear in the [call detail response](/api-reference/calls/get-call).
</Update>

<Update label="April 18, 2026" description="API v0.31.2" tags={["Feature", "Update"]}>
  ## Cartesia Sonic 3 TTS

  [Cartesia Sonic 3](/api-reference/tts/cartesia-sonic-3/cartesia-sonic-3-ws) is available as a new TTS provider. It supports low-latency streaming synthesis over WebSocket with context-aware generation controls.

  ## Reson8 STT

  [Reson8 STT v1](/api-reference/stt/reson8-stt-v1/reson8-stt-v1-ws) is available as a new STT provider. It supports real-time transcription over WebSocket with word-level timestamps, confidence scores, and partial results in nine languages including Dutch, French, German, and Spanish.

  ## Deepgram Nova 3 Indic language endpoints

  Four new SLNG-hosted Deepgram Nova 3 language variants are available in `ap-south-1` (Mumbai): [Kannada](/api-reference/stt/deepgram-nova-3/nova-3-kannada-ws), [Marathi](/api-reference/stt/deepgram-nova-3/nova-3-marathi-ws), [Tamil](/api-reference/stt/deepgram-nova-3/nova-3-tamil-ws), and [Telugu](/api-reference/stt/deepgram-nova-3/nova-3-telugu-ws). Each has a dedicated WebSocket endpoint for that language.

  ## Soniox Speech AI version correction

  The Soniox STT endpoint is now correctly labeled [Speech AI Real-time v3](/api-reference/stt/soniox-speech-ai-real-time-v4/speech-ai-real-time-v4-ws). URLs and navigation have been updated accordingly.
</Update>

<Update label="April 15, 2026" description="API v0.22.1" tags={["Feature"]}>
  ## Batch speech-to-text API

  You can now transcribe audio files asynchronously with the new [Batch STT API](/api-reference/batch). Upload a file, poll the job status, and download the transcript when ready. Supported formats include wav, mp3, flac, aac, ogg, m4a, mp4, amr, and mpeg. Powered by Speechmatics.
</Update>

<Update label="April 11, 2026" description="API v0.22.0" tags={["Feature", "Update"]}>
  ## Murf Falcon TTS

  [Murf Falcon](/api-reference/tts/murf-falcon/murf-falcon-ws) is available as a new TTS provider. It supports multilingual speech synthesis over WebSocket with multiple encodings and sample rates.

  ## Unified API documentation

  The new [Unified API](/execution-layer/unified-api) section explains how to use one endpoint pattern for every STT and TTS model. Swap providers by changing only the URL path. Your auth, request format, and code stay the same. Includes guides on [parameter coverage](/execution-layer/unified-api-parameters) and [supported models](/execution-layer/unified-api-models).

  ## Integrations hub

  A new [Integrations](/integrations/overview) page lists third-party platforms you can connect to SLNG. LiveKit, Cognigy, and Jambonz each have dedicated setup paths.

  ## Whisper Large v3 endpoint consolidated

  The separate Whisper Large v3 Compressed endpoint has been removed. Use the standard [Whisper Large v3](/api-reference/stt/whisper-large-v3/whisper-large-v3-ws) endpoint, which now handles compressed audio directly.
</Update>

<Update label="April 9, 2026" description="API v0.20.0" tags={["Feature", "Update"]}>
  ## Language selection for Nova 3 STT

  SLNG-hosted Deepgram Nova 3 STT endpoints accept a `language` parameter in the WebSocket `init` config. Supported locales by variant:

  * **English**: `en`, `en-au`, `en-us`, `en-nz`, `en-gb`, `en-in`
  * **Spanish**: `es`, `es-us`, `es-419`, `es-ar`, `es-mx`, `es-es`
  * **Hindi**: `hi`, `en`
  * **Multi-language**: `multi`

  The Hindi variant also accepts `en`, so you can transcribe English audio without switching endpoints. See the [Speech-to-Text models page](/models/stt) for the full parameter list.

  ## More sample rates for Rime Arcana TTS

  [Rime Arcana](/voices/rime-arcana) now supports 8, 16, 22.05, 24 (default), 32, 44.1, and 48 kHz. You can match your audio pipeline directly without resampling.

  ## Simplified endpointing parameter

  The `endpointing` parameter on Deepgram STT endpoints now accepts only an integer (milliseconds of silence before finalizing speech). Set it to `0` to disable. Default remains `10`.
</Update>

<Update label="April 8, 2026" description="API v0.19.0" tags={["Feature"]}>
  ## Graceful WebSocket session close

  Send `{ "type": "close" }` on any [WebSocket](/websockets) connection to shut down cleanly. The server finishes processing remaining audio, then closes. This replaces the previous `cancel` behavior and works across TTS, STT, and [bridges](/execution-layer/unified-api).

  ## Keepalive for STT streams

  Send `{ "type": "keepalive" }` on STT WebSocket connections to prevent idle timeouts during pauses. Useful for voice agent sessions where the user goes silent but the connection should stay open.

  ## Endpointing controls for Deepgram Nova STT

  Two new parameters on [Deepgram Nova STT models](/models/stt) for tuning speech segmentation:

  * **`endpointing`**: milliseconds of silence before finalizing speech. Set to `false` to disable. Default: `10`.
  * **`utterance_end_ms`**: milliseconds of silence between words before an `UtteranceEnd` event. Range: 200–5000 ms, default: 1000 ms.

  ## India region for Nova 3 Hindi

  Deepgram Nova 3 Hindi is now available in `ap-south-1` (Mumbai), alongside `ap-southeast-2` (Sydney). Use the `X-Region-Override` header to route to the closest region. See [models by region](/models/by-region).
</Update>