Shining Brighter Together: Google's Gemma Optimized to Run on NVIDIA GPUs
NVIDIA News, Wednesday, February 21st, 2024
New open language models from Google accelerated by TensorRT-LLM across NVIDIA AI platforms - including local RTX AI PCs.
Share
NVIDIA, in collaboration with Google, launched optimizations across all NVIDIA AI platforms for Gemma - Google's state-of-the-art new lightweight 2 billion- and 7 billion-parameter open language models that can be run anywhere, reducing costs and speeding innovative work for domain-specific use cases.
Teams from the companies worked closely together to accelerate the performance of Gemma - built from the same research and technology used to create the Gemini models - with NVIDIA TensorRT-LLM, an open-source library for optimizing large language model inference, when running on NVIDIA GPUs in the data center, in the cloud, and locally on workstations with NVIDIA RTX GPUs or PCs with GeForce RTX GPUs.