NVIDIA: Running AI Workloads On Rack-Scale Supercomputers
HPCwire, Tuesday, April 7th, 2026
The NVIDIA GB200 NVL72 and NVIDIA GB300 NVL72 systems, featuring NVIDIA Blackwell architecture, are rack-scale supercomputers. They're designed with 18 tightly coupled compute trays, massive GPU fabrics, and high-bandwidth networking packaged as a unit.
For AI architects and HPC platform operators, the challenge isn't just racking and stacking hardware-it's turning infrastructure into safe, performant, and easy-to-use resources for end users. The mismatch between rack-scale hardware topology and scheduler abstractions is where most of the operational complexity lives. Left unaddressed, schedulers operate on a flat pool of GPUs and nodes, overlooking the system's hierarchical and topology-sensitive design.