ZehongMa

zehongma

·

https://zehong-ma.github.io/

zehong-ma

AI & ML interests

MLLMs, Image/Video Generation, Multi-modal Representation Learning

Recent Activity

upvoted a paper 16 days ago

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

upvoted a paper about 1 month ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

upvoted a paper about 2 months ago

Qwen-Image-2.0 Technical Report

View all activity

Organizations

None yet

Papers 2

arxiv:2511.19365

arxiv:2506.09045

spaces 1

DeCo

Embed and display a remote webpage

models 3

zehongma/PixelGen

Updated Feb 1 • 1

zehongma/DeCo

Updated Nov 25, 2025 • 4

zehongma/OVMR

Updated Jun 16, 2025

datasets 1

zehongma/ImageNet21k_OVR

Viewer • Updated Oct 10, 2024 • 20.3k • 14