{
"type": "init",
"model": "rime/arcana:es",
"speaker": "seraphina",
"config": {
"sample_rate": 22050,
"encoding": "linear16"
}
}{
"type": "text",
"text": "Hola desde Rime Arcana Español. Probando la síntesis de texto a voz."
}{
"type": "flush"
}{
"type": "clear"
}{
"type": "cancel"
}{
"type": "ready",
"session_id": "sess_tts_abc123"
}{
"type": "audio_chunk",
"data": "UklGRiQAAABXQVZFZm10IBAAAAABAAEA...",
"sequence": 1
}{
"type": "segment_start",
"segment_id": "seg_001"
}{
"type": "segment_end",
"segment_id": "seg_001"
}{
"type": "flushed"
}{
"type": "cleared"
}{
"type": "audio_end",
"duration": 3.5
}{
"type": "error",
"code": "provider_error",
"message": "Provider returned an unexpected error"
}Text-to-Speech API for generating Spanish speech from text using Rime Arcana TTS model. Establishes a WebSocket connection for real-time text-to-speech.
{
"type": "init",
"model": "rime/arcana:es",
"speaker": "seraphina",
"config": {
"sample_rate": 22050,
"encoding": "linear16"
}
}{
"type": "text",
"text": "Hola desde Rime Arcana Español. Probando la síntesis de texto a voz."
}{
"type": "flush"
}{
"type": "clear"
}{
"type": "cancel"
}{
"type": "ready",
"session_id": "sess_tts_abc123"
}{
"type": "audio_chunk",
"data": "UklGRiQAAABXQVZFZm10IBAAAAABAAEA...",
"sequence": 1
}{
"type": "segment_start",
"segment_id": "seg_001"
}{
"type": "segment_end",
"segment_id": "seg_001"
}{
"type": "flushed"
}{
"type": "cleared"
}{
"type": "audio_end",
"duration": 3.5
}{
"type": "error",
"code": "provider_error",
"message": "Provider returned an unexpected error"
}API key issued by SLNG. Pass as Authorization: Bearer <token> in the WebSocket upgrade request headers.
GET
Must contain all of these types
Initialize a Spanish Rime Arcana TTS session.
Send Spanish text to synthesize.
Force any buffered text/audio to be finalized and delivered.
Clear any queued text/audio from the current session.
Cancel the current generation and stop any further audio.
Indicates the session is ready to receive messages.
Chunk of base64-encoded audio data.
Signals the start of a synthesized segment.
Signals the end of a synthesized segment.
Acknowledges that buffered output was flushed.
Acknowledges that queued output was cleared.
Signals the end of audio generation.
Indicates an error occurred during synthesis.
Was this page helpful?