Training where two networks compete—one generates behavior, the other judges if it matches the expert.