Meta’s AI researchers are edging closer to a long-sought frontier in computing: avatars that don’t just look like us, but move, react, and engage with the nuance of genuine human presence. In their latest announcement, the company’s Fundamental AI Research (FAIR) group unveiled a set of audiovisual behavioral motion models that generate lifelike gestures and facial expressions from audio and video. The project, dubbed Seamless Interaction, is backed by an unprecedented dataset, over 4,000 hours of paired conversations, and aims to bridge the chasm between mechanical avatars and embodied social interaction.
To appreciate the significance, it helps to understand why this problem is hard.
