Google Unveils Ironwood TPU for AI Inference
InfoQ, Friday, May 2nd, 2025
Google has unveiled its seventh-generation Tensor Processing Unit (TPU), Ironwood, at Google Cloud Next 25. Ironwood is Google's most performant and scalable custom AI accelerator to date and the first TPU designed specifically for inference workloads.
Google emphasizes that Ironwood is designed to power what they call the "age of inference," marking a shift from responsive AI models to proactive models that generate insights and interpretations. The company states that AI agents will use Ironwood to retrieve and generate data, delivering insights and answers.