Confidence-Based Decoding is Provably Efficient for Diffusion Language Models

Changxiao Cai, Gen Li|March 23, 2026arXiv

Key Takeaway

Confidence-based decoding in diffusion models is provably efficient and adapts automatically to data complexity, offering a theoretical foundation for why this practical strategy works well.

Summary

This paper proves that confidence-based decoding—a strategy that decides which tokens to generate next in diffusion language models based on prediction confidence—is theoretically efficient.

efficiency reasoning training

Key Terms

diffusion-language-models confidence-based-decoding entropy-sum-strategy kl-divergence