A measure of uncertainty in an agent's decision-making at a given state—how much of the decision space lacks statistical support from training data.