Back Issues This Week → Current Issue → Popular →

All issuesVolume 317, Issue 4IT Vendor NewsNVIDIA

NVIDIA Blackwell Sets New Standard for Generative AI in MLPerf Inference Debut

NVIDIA News, Wednesday, August 28th, 2024

First submission using the NVIDIA Blackwell GPU delivers up to 4x more performance on Llama 2 70B, and NVIDIA Hopper architecture powers large gains across industry AI benchmarks.

As enterprises race to adopt generative AI and bring new services to market, the demands on data center infrastructure have never been greater. Training large language models is one challenge, but delivering LLM-powered real-time services is another.

In the latest round of MLPerf industry benchmarks, Inference v4.1, NVIDIA platforms delivered leading performance across all data center tests. The first-ever submission of the upcoming NVIDIA Blackwell platform revealed up to 4x more performance than the NVIDIA H100 Tensor Core GPU on MLPerf's biggest LLM workload, Llama 2 70B, thanks to its use of a second-generation Transformer Engine and FP4 Tensor Cores.

more →  ·  More from NVIDIA →