Precise Debugging Benchmark: Is Your Model Debugging or Regenerating? Paper • 2604.17338 • Published 15 days ago • 4
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning Paper • 2507.16746 • Published Jul 22, 2025 • 35
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published Dec 11, 2024 • 54