Differentiable Zero-One Loss via Hypersimplex Projections

Camilo Gomez, Pengyang Wang, Liansheng Tang|February 26, 2026arXiv

Key Takeaway

You can now directly optimize for classification accuracy during training instead of using proxy losses, improving performance especially when trai...

Summary

This paper solves a long-standing problem in machine learning: how to optimize the zero-one loss (the metric that actually measures classification accuracy) using gradient descent. The authors create a smooth mathematical approximation that lets you backpropagate through this loss, which helps models train better on large batches of data.

training

Key Terms

zero-one-loss differentiable-approximation gradient-based-optimization hypersimplex