A Unified Memory Perspective for Probabilistic Trustworthy AI

Xueji Zhao, Likai Pei, Jianbo Liu, Kai Ni, Ningyuan Cao|March 26, 2026arXiv

Key Takeaway

Memory access, not computation speed, limits performance in probabilistic AI systems—hardware designers need to optimize for both data delivery and randomness generation together, not separately.

Summary

This paper examines how memory systems become the performance bottleneck in AI systems that need probabilistic computation for safety and robustness. It proposes treating deterministic data access as a special case of stochastic sampling, creating a unified framework to analyze memory efficiency.

efficiency safety architecture

Key Terms

stochastic-sampling probabilistic-computation compute-in-memory entropy-limited-operation