SIMIFY: Generative Real-to-Sim Enables Multi-Object Spatial and Physical Reasoning
arXiv, March 2026
TL;DR: A training-free, test-time framework that reconstructs simulation-ready assets from a single RGB-D image using 3D generative and vision-language models, then launches thousands of parallel physics rollouts with evolutionary search to optimize object arrangements for language-specified tasks. Achieves 67% success on real-robot hardware, surpassing baselines using off-the-shelf foundation models.
