AI's Next Leap: Interaction Models

Thinking Machines Lab introduces 'interaction models' for AI, enabling real-time, multimodal collaboration that mirrors human conversation.

6 min read
Abstract visualization of interconnected nodes representing AI interaction model data streams.
Conceptual illustration of Thinking Machines' interaction models.

Thinking Machines Lab is pushing the boundaries of human-AI collaboration with its research preview of interaction models. This new approach embeds interactivity directly into the AI, rather than relying on external systems, aiming to make working with AI as fluid as collaborating with another person. The models are designed to process audio, video, and text continuously, enabling real-time thinking, responding, and acting.

Visual TL;DR. Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI. Interaction Models facilitates Meet Humans Where. Collaboration Bottleneck solves Improved AI Workflows.

  1. Collaboration Bottleneck: current AI struggles with human-in-the-loop workflows
  2. Thinking Machines Lab: introduces new AI interaction models research preview
  3. Interaction Models: AI embeds interactivity directly, not external systems
  4. Real-time Multimodal: processes audio, video, text continuously for thinking
  5. Fluid Human-AI: enables working with AI as natural as people
  6. Meet Humans Where: facilitates natural interaction through speaking, listening, seeing
  7. Improved AI Workflows: overcomes limitations of single-thread AI processing
Visual TL;DR
Visual TL;DR — startuphub.ai Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI addressed by develops enables leads to Collaboration Bottleneck Thinking Machines Lab Interaction Models Real-time Multimodal Fluid Human-AI From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI addressed by develops enables leads to CollaborationBottleneck Thinking MachinesLab InteractionModels Real-timeMultimodal Fluid Human-AI From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI addressed by develops enables leads to Collaboration Bottleneck current AI struggles withhuman-in-the-loop workflows Thinking Machines Lab introduces new AI interaction modelsresearch preview Interaction Models AI embeds interactivity directly, notexternal systems Real-time Multimodal processes audio, video, text continuouslyfor thinking Fluid Human-AI enables working with AI as natural aspeople From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI addressed by develops enables leads to CollaborationBottleneck current AIstruggles withhuman-in-the-loop… Thinking MachinesLab introduces new AIinteraction modelsresearch preview InteractionModels AI embedsinteractivitydirectly, not… Real-timeMultimodal processes audio,video, textcontinuously for… Fluid Human-AI enables workingwith AI as naturalas people From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI. Interaction Models facilitates Meet Humans Where. Collaboration Bottleneck solves Improved AI Workflows addressed by develops enables leads to facilitates solves Collaboration Bottleneck current AI struggles withhuman-in-the-loop workflows Thinking Machines Lab introduces new AI interaction modelsresearch preview Interaction Models AI embeds interactivity directly, notexternal systems Real-time Multimodal processes audio, video, text continuouslyfor thinking Fluid Human-AI enables working with AI as natural aspeople Meet Humans Where facilitates natural interaction throughspeaking, listening, seeing Improved AI Workflows overcomes limitations of single-thread AIprocessing From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Collaboration Bottleneck addressed by Thinking Machines Lab. Thinking Machines Lab develops Interaction Models. Interaction Models enables Real-time Multimodal. Real-time Multimodal leads to Fluid Human-AI. Interaction Models facilitates Meet Humans Where. Collaboration Bottleneck solves Improved AI Workflows addressed by develops enables leads to facilitates solves CollaborationBottleneck current AIstruggles withhuman-in-the-loop… Thinking MachinesLab introduces new AIinteraction modelsresearch preview InteractionModels AI embedsinteractivitydirectly, not… Real-timeMultimodal processes audio,video, textcontinuously for… Fluid Human-AI enables workingwith AI as naturalas people Meet Humans Where facilitates naturalinteraction throughspeaking,… Improved AIWorkflows overcomeslimitations ofsingle-thread AI… From startuphub.ai · The publishers behind this format

The core idea is to address what the lab calls the 'collaboration bottleneck.' Current AI systems, often optimized for autonomous tasks, struggle with human-in-the-loop workflows. Users can't always specify needs upfront, and interfaces often push humans out, despite their value in clarifying and providing feedback. The goal is to enable AI interfaces that meet humans where they are, facilitating natural interaction through speaking, listening, seeing, and interjecting.

Related startups

The Collaboration Bottleneck

Existing AI models operate on a single thread, waiting for user input to complete before processing new information. This turn-based system creates a narrow communication channel, limiting the nuances of human knowledge, intent, and judgment that can be conveyed. It's akin to resolving a critical disagreement over email instead of an in-person conversation.

Thinking Machines argues that interactivity must scale with intelligence and be an intrinsic part of the AI model itself. This contrasts with current methods that stitch together external components to simulate real-time capabilities. The lab's research suggests that models trained from scratch with a multi-stream, micro-turn design achieve state-of-the-art performance in both intelligence and responsiveness.

Capabilities

Building interactivity into the model unlocks several new capabilities:

  • Seamless dialog management: The model inherently understands user cues like thinking, yielding, or self-correction without needing a separate component.

  • Verbal and visual interjections: The AI can interrupt or interject contextually, mirroring natural human conversation flow.

  • Simultaneous speech: Both the user and the AI can speak concurrently, useful for applications like live translation.

  • Time-awareness: The model has a direct understanding of elapsed time.

  • Simultaneous tool calls, search, and generative UI: While conversing, the AI can concurrently perform searches, call tools, or generate user interfaces, seamlessly integrating results back into the dialogue.

  • These continuous interactions foster an experience that feels more like a true collaboration than a series of prompts and responses. It's a significant shift from the segmented, turn-based interactions that currently define much of our engagement with AI.

    © 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.