Back Issues This Week → Current Issue → Popular →

All issuesVolume 325, Issue 4IT Vendor NewsNVIDIA

How The Economics Of Inference Can Maximize AI Value

NVIDIA News, Friday, April 25th, 2025

Understanding the cost of AI in production can help users achieve high-quality performance and profitability.

As AI models evolve and adoption grows, enterprises must perform a delicate balancing act to achieve maximum value.

That's because inference - the process of running data through a model to get an output - offers a different computational challenge than training a model.

Pretraining a model - the process of ingesting data, breaking it down into tokens and finding patterns - is essentially a one-time cost. But in inference, every prompt to a model generates tokens, each of which incur a cost.

more →  ·  More from NVIDIA →