"We've hit this kind of like GPT-3.5 moment for video. Let's make sure the world is kind of aware of what's possible now." This bold declaration by Bill Peebles, head of the OpenAI Sora team, encapsulates the groundbreaking nature of their latest generative video model. Peebles, alongside engineering lead Thomas Dimson and product lead Rohan Sahai, recently sat down with Konstantine Buhler and Sonya Huang of Sequoia Capital on the "Training Data" podcast. The conversation delved deep into Sora 2’s technical innovations, its philosophical underpinnings, and the profound implications for creativity and our understanding of artificial intelligence.
The team behind Sora 2 is not merely building a tool; they are crafting a new paradigm for content creation, aiming to compress filmmaking processes from months to mere days. Bill Peebles, the visionary behind the diffusion transformer that powers Sora and many other video generation models, highlighted his traditional research path from undergrad to Berkeley, culminating in his pivotal work on Sora at OpenAI. Thomas Dimson, with a background steeped in building early machine learning and recommender systems at Instagram, and later a "Minecraft in the browser" startup, brings a wealth of product and social platform experience. Rohan Sahai, who transitioned from working on ChatGPT to lead Sora's product team, rounds out a group whose diverse expertise is clearly shaping Sora's ambitious trajectory.
