arxiv:2511.19365
ZehongMa
zehongma
AI & ML interests
MLLMs, Image/Video Generation, Multi-modal Representation Learning
Recent Activity
upvoted a paper about 19 hours ago
Continuous Latent Diffusion Language Model upvoted a paper 3 days ago
Video Generation with Predictive Latents upvoted an article about 1 month ago
PRX Part 3 — Training a Text-to-Image Model in 24h!Organizations
None yet