For years, the promise of comprehensive AI/ML evaluation has been a recurring industry prophecy, often met with a knowing nod and little tangible action. John Dickerson, CEO of Mozilla AI, delivered a compelling presentation at the AI Engineer World's Fair, dissecting this decade-long deferral and positing that 2025 is finally the year "evals" become indispensable, driven by a confluence of macroeconomic shifts and technological advancements.
Dickerson, drawing on his extensive experience as co-founder and Chief Scientist at Arthur AI before joining Mozilla, explained that AI/ML monitoring and evaluation have always been two sides of the same sword. However, this critical need was rarely top-of-mind for the C-suite until two pivotal events converged. Prior to November 30, 2022, traditional machine learning models often "spit out some numbers that are ingested and lost in a larger system," resulting in a "tenuous connection to downstream KPIs." Despite "lots of lip service around AI/ML ROI from the C-Suite," genuine investment in evaluation remained elusive, largely confined to the CIO's purview.
