[AI]■ STORY TIMELINE
OPEN-SOURCE MODEL LISTENS, DECIDES TO SPEAK EVERY 0.4 SECONDS
A new open-source voice model called Audio Interaction processes audio continuously without waiting for recordings to end, making real-time decisions about when to respond. The model handles translation, transcription, and chat while detecting ambient sounds like coughing.
The Decoder+0m
Unlike GPT-4o or Qwen3.5-Omni, Audio Interaction doesn't wait for a recording to end: it translates, transcribes, chats,…