Request lifecycle of a single turn
When a caller speaks:- Audio arrives from the caller via your orchestrator.
- STT Routing selects the transcription model based on language, accent, noise profile, and cost constraints.
- Transcript goes to your orchestrator’s LLM, or to SLNG’s tiered decisioning.
- Tiered Decisioning determines whether the turn needs full inference, local inference, or can be resolved without calling the LLM.
- The response (LLM, cached, or deterministic) is sent to TTS.
- Output Assembly checks cache, assembles from segments where possible, and generates only what is new.
- Audio returns to the caller.