Google is making a bold play for the future of artificial intelligence, announcing its seventh-generation Tensor Processing Units, dubbed Ironwood, alongside new Arm-based Axion virtual machines. Unveiled on November 6, 2025, these custom-designed chips are Google's latest salvo in the escalating compute arms race, specifically targeting the burgeoning "age of inference" where the focus shifts from merely training massive AI models to efficiently serving them at scale.
The company's VP/GM of AI & Infrastructure, Amin Vahdat, and VP & GM of Compute and AI Infrastructure, Mark Lohmeyer, highlighted a critical industry pivot. As models like Gemini and Claude become ubiquitous, the challenge isn't just building them, but powering the "useful, responsive interactions" that define modern AI applications. This new era, characterized by constantly evolving model architectures, the rise of complex agentic workflows, and near-exponential demand for compute, demands a fresh approach to infrastructure. Google believes its vertically integrated, custom silicon strategy is the answer.
