Whisper Large v3 - SLNG Documentation

Messages

{
  "type": "ready",
  "session_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "context_id": "stt-a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "model": "whisper-large-v3",
  "commands": [
    "flush",
    "stop",
    "clear",
    "context"
  ],
  "features": {
    "vad": true,
    "partial_transcripts": true,
    "final_transcripts": true,
    "language_detection": true,
    "speaker_diarization": true
  }
}

{
  "type": "partial_transcript",
  "context_id": "stt-a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "session_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "transcript": "The quick brown fox jumps over the",
  "language": "en",
  "confidence": 1,
  "is_final": false,
  "audio_duration": 2.56
}

{
  "type": "final_transcript",
  "context_id": "stt-a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "session_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "transcript": "The quick brown fox jumps over the lazy dog.",
  "language": "en",
  "confidence": 1,
  "is_final": true,
  "audio_duration": 3.84,
  "segments": [
    {
      "text": " The quick brown fox jumps over the lazy dog.",
      "start": 0.066,
      "end": 3.71,
      "avg_logprob": -0.08
    }
  ]
}

WSS

stt

slng

openai

whisper:large-v3

Messages

{
  "type": "ready",
  "session_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "context_id": "stt-a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "model": "whisper-large-v3",
  "commands": [
    "flush",
    "stop",
    "clear",
    "context"
  ],
  "features": {
    "vad": true,
    "partial_transcripts": true,
    "final_transcripts": true,
    "language_detection": true,
    "speaker_diarization": true
  }
}

{
  "type": "partial_transcript",
  "context_id": "stt-a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "session_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "transcript": "The quick brown fox jumps over the",
  "language": "en",
  "confidence": 1,
  "is_final": false,
  "audio_duration": 2.56
}

{
  "type": "final_transcript",
  "context_id": "stt-a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "session_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "transcript": "The quick brown fox jumps over the lazy dog.",
  "language": "en",
  "confidence": 1,
  "is_final": true,
  "audio_duration": 3.84,
  "segments": [
    {
      "text": " The quick brown fox jumps over the lazy dog.",
      "start": 0.066,
      "end": 3.71,
      "avg_logprob": -0.08
    }
  ]
}

bearer

type:http

API key issued by SLNG. Pass as Authorization: Bearer <token> in the WebSocket upgrade request headers.

method

type:string

GET

headers

type:object

X-World-Part-Override

type:enum

Target world part override. Auto-selected if not provided. Available world parts: eu.

Available options: eu

Init Request

type:object

Initialize an SLNG-hosted Whisper Large v3 STT session.

Audio Message

type:object

Stream an audio frame to be transcribed.

Finalize Message

type:object

Force-finalize buffered audio tokens without closing the connection.

Close Message

type:object

Signal end of audio stream and close the connection.

Ready Response

type:object

Indicates the Whisper session is ready to receive audio.

Partial Transcript

type:object

Interim Whisper transcription result, updated as more audio is processed.

Final Transcript

type:object

Final Whisper transcription result with segment-level detail.

Error Response

type:object

Indicates an error occurred during recognition.

⌘I