1 15 3

Chenming Zhu

ChaimZhu

https://zcmax.github.io/

AI & ML interests

Multimodal Large Language Models, 3D Perception and Understanding, Embodied AI

Recent Activity

upvoted a paper 10 days ago

G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

updated a model 12 days ago

InternRobotics/InternVLA-N1

upvoted a paper 2 months ago

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

View all activity

Organizations

upvoted a paper 10 days ago

G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published 10 days ago • 8

updated a model 12 days ago

InternRobotics/InternVLA-N1

Robotics • 8B • Updated 12 days ago • 742 • 36

upvoted a paper 2 months ago

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29 • 45

updated a model 3 months ago

InternRobotics/InternVLA-N1-Preview

Robotics • 8B • Updated Sep 1 • 2 • 6

published a model 3 months ago

InternRobotics/InternVLA-N1

Robotics • 8B • Updated 12 days ago • 742 • 36

upvoted a paper 3 months ago

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

Paper • 2508.17472 • Published Aug 24 • 26

liked a dataset 4 months ago

jasonzhango/SPAR-7M-RGBD

Updated Jun 15 • 460 • 6

updated a model 4 months ago

InternRobotics/InternVLA-N1-S2

8B • Updated Jul 28 • 355 • 1

published a model 4 months ago

InternRobotics/InternVLA-N1-S2

8B • Updated Jul 28 • 355 • 1

liked a model 4 months ago

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated 30 days ago • 172k • • 2.27k

authored a paper 5 months ago

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10 • 42

updated a dataset 5 months ago

ChaimZhu/LLaVA-3D-Data

Viewer • Updated Jul 11 • 859k • 83

published a dataset 5 months ago

ChaimZhu/LLaVA-3D-Data

Viewer • Updated Jul 11 • 859k • 83

upvoted a paper 5 months ago

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10 • 42

commented a paper 5 months ago

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10 • 42 •

liked a dataset 5 months ago

rbler/OST-Bench

Updated 8 days ago • 138 • 4

upvoted 3 papers 5 months ago

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published Jul 7 • 47

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8 • 58

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published Jun 19 • 27

upvoted a paper 7 months ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Paper • 2505.17022 • Published May 22 • 27

Chenming Zhu

AI & ML interests

Recent Activity

Organizations

ChaimZhu's activity