Call Settings - OSVI AI Docs

Call settings control how an agent behaves in the audio stream — when it stops talking, how long it waits before responding, what it does when the line goes quiet. Defaults work well for most agents; reach for these when an agent talks over people, pauses awkwardly, or responds too eagerly. All settings live on the agent’s Call Settings tab.

Interruptions and turn-taking

Setting	What it does
Allow interruptions	Whether the caller can cut the agent off mid-sentence. On by default — turning it off makes the agent finish every sentence, which feels robotic but suits compliance scripts that must be read in full.
Min interruption duration	How long the caller must speak before it counts as an interruption (0.1–2.0 s). Raise this if background noise or short “mm-hmm”s keep cutting the agent off.
Minimum interruption words	Alternative threshold in words — the caller must say at least this many words to interrupt.
Min / max endpointing delay	How long the agent waits after the caller stops speaking before replying. The minimum (default ~0.4 s) prevents the agent jumping in on a breath pause; the maximum (default ~2 s) caps how long it can hold back. Raise the minimum for callers who speak with pauses (numbers, addresses); lower it for snappier back-and-forth.

Silence handling

When the caller goes quiet, the agent checks in rather than sitting in dead air:

Voice activity detection (VAD) — on by default, with a sensitivity slider. Higher sensitivity detects quieter speech but picks up more background noise.
Silence message — what the agent says after a sustained silence. Use a static message (e.g. “Hey, are you still there?”) or have one generated from the conversation context so it fits naturally.
If the caller still doesn’t respond after the check-in, the agent ends the call (the conversation record shows the silence timeout as the end reason).

Voice and audio

Noise cancellation — on by default; filters background noise from the caller’s side before the agent hears it.
Background audio — play ambient sound behind the agent’s voice to make the call feel like it comes from a real place. Options: none, city ambience, forest ambience, office ambience, crowded room — with a volume slider. Subtle office ambience makes long pauses feel less artificial.

Response timing

Presets (fast / medium / slow / custom) that trade snappiness against the risk of talking over the caller. Fast suits short transactional calls; slow suits conversations where callers think out loud.

Supporting LLM

For agents where every millisecond of response delay matters, you can enable a second, smaller LLM that generates a brief acknowledgement (“Sure, let me check that…”) while the main model composes the full answer. You choose its provider and model, set its temperature and token cap, and give it a filler instruction describing the kinds of acknowledgements it should produce. The result is an agent that never leaves a long thinking gap.

Change one setting at a time and re-test in the web-call playground. Interruption and endpointing settings interact — adjusting both at once makes it hard to tell which change helped.

​Interruptions and turn-taking

​Silence handling

​Voice and audio

​Response timing

​Supporting LLM

Interruptions and turn-taking

Silence handling

Voice and audio

Response timing

Supporting LLM