"Models are what they eat." This concise truth encapsulates the profound shift in artificial intelligence highlighted by Ari Morcos, CEO and co-founder of Datology, on the Latent Space podcast. He argues that the prevailing focus on intricate model architectures and brute-force compute scaling has overlooked the most impactful lever for AI progress: sophisticated data curation.
Morcos, interviewed by Alessio Fanelli and Swyx, shared his personal journey from a neuroscience background, deeply immersed in understanding neural dynamics and inductive biases, to a stark realization he dubs "the bitter lesson." After years spent on papers attempting to understand why certain model representations were desirable, he found a consistent, confronting insight emerging around 2020: "all that really matters is the data."
This epiphany revealed that as data scales, the carefully engineered inductive biases in model architectures become less critical, even "mildly harmful" past a threshold of roughly one million data points. The traditional computer science approach, which treated datasets as a given to be optimized against, was fundamentally flawed. The era of self-supervised learning, enabling a million-fold increase in data quantity from ImageNet to trillions of tokens, ushered in a new regime where models are consistently "underfitting" the available data.
