Adding simple edge detection to flowchart images helps VLMs understand topology better—a practical, training-free technique that improves industrial document processing by 11-17 percentage points without requiring annotated data.
EdgeFlow improves how Vision Language Models convert flowcharts into machine-readable formats by adding edge detection as a visual guide. The method works without training data or fine-tuning, achieving significant improvements on real-world industrial flowcharts by helping the model better understand the structure and connections between elements.