Universal Hypernetworks for Arbitrary Models

Xuanfeng Zhou|April 2, 2026arXiv

Key Takeaway

A single fixed hypernetwork can generate weights for diverse architectures and tasks by using architecture/task descriptors as input, eliminating the need to retrain generators when switching between different model types.

Summary

This paper introduces Universal Hypernetworks (UHN), a single neural network that can generate weights for many different model architectures and tasks. Instead of building separate weight generators for each model type, UHN uses descriptors (text descriptions of architecture and task) to produce weights for any compatible model, working across vision, graphs, text, and math tasks.

architecture training efficiency

Key Terms

hypernetwork weight-generation descriptor-based-generation multi-task-learning