Speech AI Real-time v3

Messages

{
  "type": "init",
  "config": {
    "language": "en",
    "sample_rate": 16000,
    "audio_format": "pcm_s16le",
    "enable_partials": true,
    "enable_speaker_diarization": true,
    "language_hints": [
      "en"
    ]
  }
}

{
  "type": "partial_transcript",
  "transcript": "Hello world",
  "confidence": 0.92,
  "tokens": [
    {
      "text": "Hello",
      "start_ms": 0,
      "end_ms": 500,
      "confidence": 0.95,
      "is_final": false,
      "speaker": "0"
    },
    {
      "text": " world",
      "start_ms": 500,
      "end_ms": 1000,
      "confidence": 0.9,
      "is_final": false,
      "speaker": "0"
    }
  ]
}

WSS

stt

soniox

speech-ai:rt-v3

Messages

{
  "type": "init",
  "config": {
    "language": "en",
    "sample_rate": 16000,
    "audio_format": "pcm_s16le",
    "enable_partials": true,
    "enable_speaker_diarization": true,
    "language_hints": [
      "en"
    ]
  }
}

{
  "type": "partial_transcript",
  "transcript": "Hello world",
  "confidence": 0.92,
  "tokens": [
    {
      "text": "Hello",
      "start_ms": 0,
      "end_ms": 500,
      "confidence": 0.95,
      "is_final": false,
      "speaker": "0"
    },
    {
      "text": " world",
      "start_ms": 500,
      "end_ms": 1000,
      "confidence": 0.9,
      "is_final": false,
      "speaker": "0"
    }
  ]
}

bearer

type:http

API key issued by SLNG. Pass as Authorization: Bearer <token> in the WebSocket upgrade request headers.

method

type:string

GET

headers

type:object

X-World-Part-Override

type:enum

Target world part override. Auto-selected if not provided. Available world parts: eu, na.

Available options: eu, na

Init Request (Soniox)

type:object

Initialize a Soniox STT session with provider-specific recognition configuration.

Audio Message

type:object

Stream an audio frame to be transcribed.

Finalize Message

type:object

Force-finalize buffered audio tokens without closing the connection.

Close Message

type:object

Signal end of audio stream and close the connection.

Keepalive Message

type:object

Keep the WebSocket connection alive during silence.

Ready Response

type:object

Indicates the session is ready to receive audio.

Tokens Response (Soniox)

type:object

Transcription result from Soniox with token-level detail.

Final Transcript

type:object

Final transcription result with optional metadata.

Error Response

type:object

Indicates an error occurred during recognition.

Saaras v3

⌘I

SLNG// hosted

3rd Party