Temporal Credit Is Free

Aur Shalev Merin|March 30, 2026arXiv

Key Takeaway

Online learning in RNNs doesn't require sophisticated credit assignment algorithms—proper gradient normalization with immediate derivatives is sufficient and dramatically more memory-efficient.

Summary

Recurrent networks can learn online using simple immediate derivatives instead of expensive backpropagation-through-time. The key insight: the hidden state naturally carries temporal information forward, so you just need proper gradient normalization and avoid stale memory traces. This approach matches or beats complex algorithms while using 1000x less memory.

training efficiency

Key Terms

recurrent-neural-networks online-learning temporal-credit-assignment gradient-normalization