Thinking Machines Lab is pushing the boundaries of human-AI collaboration with its research preview of interaction models. This new approach embeds interactivity directly into the AI, rather than relying on external systems, aiming to make working with AI as fluid as collaborating with another person. The models are designed to process audio, video, and text continuously, enabling real-time thinking, responding, and acting.
Related startups
The core idea is to address what the lab calls the 'collaboration bottleneck.' Current AI systems, often optimized for autonomous tasks, struggle with human-in-the-loop workflows. Users can't always specify needs upfront, and interfaces often push humans out, despite their value in clarifying and providing feedback. The goal is to enable AI interfaces that meet humans where they are, facilitating natural interaction through speaking, listening, seeing, and interjecting.
The Collaboration Bottleneck
Existing AI models operate on a single thread, waiting for user input to complete before processing new information. This turn-based system creates a narrow communication channel, limiting the nuances of human knowledge, intent, and judgment that can be conveyed. It's akin to resolving a critical disagreement over email instead of an in-person conversation.