A low-precision numerical format optimized by NVIDIA that uses fewer bits per number than standard formats, enabling efficient inference on NVIDIA GPUs while maintaining reasonable accuracy.