Dynamic graph-based feature connections outperform fixed spatial neighborhoods for reconstructing deformable surgical scenes, especially when dealing with occlusions and low-texture surfaces.
EndoVGGT improves 3D reconstruction of soft tissues during surgery by using a graph neural network module that dynamically connects similar tissue regions across the image, even when instruments block the view or surfaces are shiny. This approach recovers the true shape of deformable tissues better than previous methods and works on new surgical videos it hasn't seen before.