AI's Next Leap: Interaction Models

Thinking Machines Lab is pushing the boundaries of human-AI collaboration with its research preview of interaction models. This new approach embeds interactivity directly into the AI, rather than relying on external systems, aiming to make working with AI as fluid as collaborating with another person. The models are designed to process audio, video, and text continuously, enabling real-time thinking, responding, and acting.

Visual TL;DR. Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI. Interaction Models facilitates Meet Humans Where. Collaboration Bottleneck solves Improved AI Workflows.

Collaboration Bottleneck: current AI struggles with human-in-the-loop workflows
Thinking Machines Lab: introduces new AI interaction models research preview
Interaction Models: AI embeds interactivity directly, not external systems
Real-time Multimodal: processes audio, video, text continuously for thinking
Fluid Human-AI: enables working with AI as natural as people
Meet Humans Where: facilitates natural interaction through speaking, listening, seeing
Improved AI Workflows: overcomes limitations of single-thread AI processing

Visual TL;DRQuickExplainDeeper

The core idea is to address what the lab calls the 'collaboration bottleneck.' Current AI systems, often optimized for autonomous tasks, struggle with human-in-the-loop workflows. Users can't always specify needs upfront, and interfaces often push humans out, despite their value in clarifying and providing feedback. The goal is to enable AI interfaces that meet humans where they are, facilitating natural interaction through speaking, listening, seeing, and interjecting.

Related startups

The Collaboration Bottleneck

Existing AI models operate on a single thread, waiting for user input to complete before processing new information. This turn-based system creates a narrow communication channel, limiting the nuances of human knowledge, intent, and judgment that can be conveyed. It's akin to resolving a critical disagreement over email instead of an in-person conversation.

Thinking Machines argues that interactivity must scale with intelligence and be an intrinsic part of the AI model itself. This contrasts with current methods that stitch together external components to simulate real-time capabilities. The lab's research suggests that models trained from scratch with a multi-stream, micro-turn design achieve state-of-the-art performance in both intelligence and responsiveness.

Capabilities

Building interactivity into the model unlocks several new capabilities:

Seamless dialog management: The model inherently understands user cues like thinking, yielding, or self-correction without needing a separate component.
Verbal and visual interjections: The AI can interrupt or interject contextually, mirroring natural human conversation flow.

Simultaneous speech: Both the user and the AI can speak concurrently, useful for applications like live translation.

Time-awareness: The model has a direct understanding of elapsed time.

Simultaneous tool calls, search, and generative UI: While conversing, the AI can concurrently perform searches, call tools, or generate user interfaces, seamlessly integrating results back into the dialogue.

These continuous interactions foster an experience that feels more like a true collaboration than a series of prompts and responses. It's a significant shift from the segmented, turn-based interactions that currently define much of our engagement with AI.

© 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.

AI's Next Leap: Interaction Models

Related startups

The Collaboration Bottleneck

Capabilities

AI Daily Digest