[AI]■ STORY TIMELINE

OPEN-SOURCE MODEL LISTENS, DECIDES TO SPEAK EVERY 0.4 SECONDS

A new open-source voice model called Audio Interaction processes audio continuously without waiting for recordings to end, making real-time decisions about when to respond. The model handles translation, transcription, and chat while detecting ambient sounds like coughing.

1 SOURCEFIRST SEEN JUN 6, 10:50 AM► READ THE ARTICLE

The Decoder+0m

New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent

Unlike GPT-4o or Qwen3.5-Omni, Audio Interaction doesn't wait for a recording to end: it translates, transcribes, chats,…

◄ BACK TO ARTICLE