"To me, AGI will not be complete without spatial intelligence. And I want to solve that problem." Dr. Fei-Fei Li, often hailed as the godmother of AI, articulated this audacious vision during a fireside chat at AI Startup School in San Francisco on June 16, 2025. Joined by Diana Hu, General Partner at Y Combinator, Li delved into her foundational work with ImageNet and its pivotal role in igniting the deep learning revolution, before charting the course for AI's demanding future.
Li recounted the early days of computer vision, a time when data was scarce and algorithms faltered. Her unwavering belief in data-driven methods, even when neural networks were out of favor, led to the creation of ImageNet in 2009. This massive, labeled dataset for visual recognition fundamentally shifted the paradigm, providing the backbone for modern computer vision. The breakthrough moment arrived in 2012, when AlexNet, leveraging ImageNet and powerful GPUs, dramatically outperformed previous benchmarks. "It was an old algorithm," Li noted, referring to convolutional neural networks, "but it was the first time that two GPUs were put together... for the computing of deep learning." This convergence of data, compute, and algorithms validated Li's vision, proving that quantity and quality of data were indeed crucial.
