This study investigates how voice AI agents decide when you've finished speaking.
You'll have a short conversation with an AI across three different conditions -
each using a different strategy for detecting the end of your turn.
Select a condition to begin
A
VAD Baseline
The AI responds after a fixed period of silence. No language understanding is used; the system simply waits for the mic to go quiet.
Fixed 1.0s thresholdSilence-only
B
VAD + Syntactic Scorer
Silence detection is combined with a lightweight language model that scores whether your utterance forms a complete thought before firing.
EOS scorerCompleteness gate
C
Full Ensemble
Combines intent classification, a dynamic silence threshold, an EOS score, and an LLM turn-completion scorer into a weighted ensemble decision.
Intent-awareDynamic thresholdEnsemble
Each session takes ~5 minutes. Fill out the survey as you go. Record your experience after each condition before moving to the next.
Responds after a fixed 1.0s silence window. No language understanding - silence alone determines the turn boundary.
Fixed 1.0s thresholdSilence-only
—
stable
The AI will ask you
I'm here to help you think through a decision. Tell me about something you've been going back and forth on - walk me through both sides as you see them.