The computational resources and time required to run a model on new inputs, typically measured in memory usage and processing time.