Collections of ICLR 2026 paper: "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"
Zekun Qi
qizekun
AI & ML interests
Embodied Intelligence, Large Langugae Model, 3D Computer Vision
Recent Activity
upvoted a collection 3 days ago
SenseNova-U1 authored a paper 4 days ago
ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?