Humanoids Learn Self-Other Distinction

Humanoid robots now learn self-other distinction and build predictive self-models from sensory data, enabling better collaboration and task performance in human-robot environments.

5 min read
A humanoid robot interacting in a shared environment with humans and other robots.
Visualizing the self-other distinction capability of the humanoid robot.

Humanoid robots increasingly operate alongside humans, yet a critical gap remains: their inability to distinguish themselves from others. This lack of self-other distinction hinders effective collaboration and safe navigation in shared spaces.

Visual TL;DR. Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence method Bypasses identity labels. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration. Learned self-model instrumental supports Robust multi-agent interaction.

  1. Robots lack self-other distinction: hinders collaboration and safe navigation in shared spaces
  2. Proprioceptive-visual correspondence: robot learns to differentiate itself from others using sensory data
  3. Bypasses identity labels: no need for explicit labels or complex kinematic models
  4. Predictive self-model: maps joint configurations to 3D body occupancy
  5. Learned self-model instrumental: enables downstream tasks in multi-agent scenarios
  6. Better collaboration: improved task performance in human-robot environments
  7. Robust multi-agent interaction: fundamental for effective human-robot collaboration
Visual TL;DR
Visual TL;DR — startuphub.ai Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration problem builds enables leads to Robots lack self-other distinction Proprioceptive-visual correspondence Predictive self-model Learned self-model instrumental Better collaboration From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration problem builds enables leads to Robots lackself-other… Proprioceptive-visualcorrespondence Predictiveself-model Learnedself-model… Bettercollaboration From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration problem builds enables leads to Robots lack self-other distinction hinders collaboration and safe navigationin shared spaces Proprioceptive-visual correspondence robot learns to differentiate itself fromothers using sensory data Predictive self-model maps joint configurations to 3D bodyoccupancy Learned self-model instrumental enables downstream tasks in multi-agentscenarios Better collaboration improved task performance in human-robotenvironments From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration problem builds enables leads to Robots lackself-other… hinderscollaboration andsafe navigation in… Proprioceptive-visualcorrespondence robot learns todifferentiateitself from others… Predictiveself-model maps jointconfigurations to3D body occupancy Learnedself-model… enables downstreamtasks inmulti-agent… Bettercollaboration improved taskperformance inhuman-robot… From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence method Bypasses identity labels. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration. Learned self-model instrumental supports Robust multi-agent interaction problem method builds enables leads to supports Robots lack self-other distinction hinders collaboration and safe navigationin shared spaces Proprioceptive-visual correspondence robot learns to differentiate itself fromothers using sensory data Bypasses identity labels no need for explicit labels or complexkinematic models Predictive self-model maps joint configurations to 3D bodyoccupancy Learned self-model instrumental enables downstream tasks in multi-agentscenarios Better collaboration improved task performance in human-robotenvironments Robust multi-agent interaction fundamental for effective human-robotcollaboration From startuphub.ai · The publishers behind this format
Visual TL;DR — startuphub.ai Robots lack self-other distinction problem Proprioceptive-visual correspondence. Proprioceptive-visual correspondence method Bypasses identity labels. Proprioceptive-visual correspondence builds Predictive self-model. Predictive self-model enables Learned self-model instrumental. Learned self-model instrumental leads to Better collaboration. Learned self-model instrumental supports Robust multi-agent interaction problem method builds enables leads to supports Robots lackself-other… hinderscollaboration andsafe navigation in… Proprioceptive-visualcorrespondence robot learns todifferentiateitself from others… Bypasses identitylabels no need forexplicit labels orcomplex kinematic… Predictiveself-model maps jointconfigurations to3D body occupancy Learnedself-model… enables downstreamtasks inmulti-agent… Bettercollaboration improved taskperformance inhuman-robot… Robustmulti-agent… fundamental foreffectivehuman-robot… From startuphub.ai · The publishers behind this format

Bootstrapping Self-Representation from Sensory Data

Researchers have demonstrated a novel approach where a humanoid robot learns to differentiate itself from others solely through proprioceptive-visual correspondence. This breakthrough bypasses the need for explicit identity labels or complex kinematic models, a significant hurdle in current robotics. The system establishes a predictive self-model that maps joint configurations to its three-dimensional body occupancy, effectively learning how its own body shape changes with movement.

Related startups

Enabling Robust Multi-Agent Interaction

Once this foundational self-other distinction is established, the learned self-model proves instrumental in various downstream tasks. In scenarios involving multiple agents, including humans and morphologically identical robots, the system reliably identifies itself. This capability directly supports critical functions such as target reaching, collision-aware motion planning, and human-to-robot motion retargeting. The ability to form a 3D self-model is a crucial step towards true humanoid robot self-awareness.

© 2026 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.