Automated planning can solve the joint problem of designing distributed data pipelines and scheduling them on real infrastructure, enabling users to specify workflows declaratively rather than imperatively.
This paper introduces WORKSWORLD, a planning domain for automatically designing and scheduling data pipelines across distributed computer systems. Instead of manually specifying how data flows between processing components, users describe their data sources, available tools, and desired outputs—and an AI planner figures out the optimal workflow and resource allocation.