The hype around AI capabilities often feels limitless, but a sobering new analysis of the notoriously difficult FrontierMath benchmark shows the hard limits of today’s most advanced models. According to research published by Greg Burnham, even with near-infinite attempts, the current generation of AI, including models like GPT-5, appears fundamentally incapable of solving more than 70% of these advanced math problems.
For anyone using AI today, the most practical number is the single-shot success rate. On that front, the best we’ve seen is GPT-5 solving a mere 29% of FrontierMath problems on a given run. That’s the reality of what you can expect from a single query.
