Dividing video or images into meaningful regions and assigning labels to understand what each region represents.