AWS Announces Amazon EC2 Capacity Blocks For ML Workloads
AWS News, Tuesday, October 31,2023
First-of-its-kind consumption model enables customers to reserve high-performance Amazon EC2 UltraClusters of NVIDIA GPUs to accelerate their generative AI development
AWS announced the general availability of Amazon Elastic Compute Cloud (EC2) Capacity Blocks for ML, an industry-first consumption model that enables any customer to access highly sought-after GPU compute capacity to run their short duration machine learning (ML) workloads.
With EC2 Capacity Blocks, customers can reserve hundreds of NVIDIA GPUs colocated in Amazon EC2 UltraClusters designed for high-performance ML workloads. Customers can use EC2 Capacity Blocks with P5 instances, powered by the latest NVIDIA H100 Tensor Core GPUs, by specifying their cluster size, future start date, and duration.