Current AI models struggle with real-world table-text reasoning; SPARTA exposes this gap with automatically-generated, complex multi-hop questions ...
SPARTA is a benchmark for testing AI models on complex questions that require reasoning across both text and tables together.