Creating candidate regions or concepts from input (e.g., converting text queries into visual targets).