The process of automatically reading and extracting structured information like text, tables, and layout from documents.