Back Issues

NVIDIA's NVFP4 enables 4-bit LLM training without the accuracy trade-off

NVIDIA, Monday, November 10th, 2025

Researchers at Nvidia have developed a new approach to train large language models (LLMs) in 4-bit format while preserving their stability and accuracy.

The new technique, called NVFP4, makes it possible to train quantized models that match the performance of larger 8-bit models at half the memory and a fraction of the compute costs.

The success of NVFP4 shows a path toward cutting the costs of AI by running leaner models that match the performance of larger ones. It can also pave the way for a future where the costs of training LLMs will drop to a point where training custom models becomes more accessible.

more → · More from NVIDIA →