VISIONx @ NYU

university

https://www.sainingxie.com/

AI & ML interests

None defined yet.

Recent Activity

bytetriper submitted a paper about 1 hour ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

AustinWang0330 published a model about 1 hour ago

nyu-visionx/webssl300m_decoder

AustinWang0330 published a model about 1 hour ago

nyu-visionx/siglip2_decoder

View all activity

Papers

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

View all Papers

Organization Card

Community About org cards

Edit this README.md markdown file to author your organization card.

Collections 7

View 7 collections

models 36

nyu-visionx/webssl300m_decoder

Updated about 12 hours ago • 3

nyu-visionx/siglip2_decoder

Updated 14 days ago • 39

nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B

Text Generation • 17B • Updated 14 days ago • 3

nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B

Text Generation • 4B • Updated 14 days ago • 135

nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B-WebSSL

4B • Updated 14 days ago • 17

nyu-visionx/Cambrian-S-3B-S3

3B • Updated 19 days ago • 251

nyu-visionx/Cambrian-S-3B-S2

3B • Updated 19 days ago • 278

nyu-visionx/Cambrian-S-3B-S1

3B • Updated 19 days ago • 7

nyu-visionx/Cambrian-S-1.5B-S3

2B • Updated 19 days ago • 185

nyu-visionx/Cambrian-S-1.5B-S2

2B • Updated 19 days ago • 287

datasets 13

nyu-visionx/Cambrian-S-3M

Updated about 10 hours ago • 10k • 2

nyu-visionx/scale-rae-data

Updated 11 days ago • 39 • 1

nyu-visionx/VSI-Bench

Viewer • Updated Nov 11, 2025 • 10.3k • 7.9k • 58

nyu-visionx/VSI-Train-10k

Viewer • Updated Nov 7, 2025 • 10k • 663 • 3

nyu-visionx/VSI-SUPER-Count

Viewer • Updated Nov 7, 2025 • 400 • 1.05k • 4

nyu-visionx/VSI-SUPER-Recall

Viewer • Updated Nov 7, 2025 • 300 • 875 • 3

nyu-visionx/VSI-590K

Preview • Updated Nov 7, 2025 • 3.59k • 11

nyu-visionx/CV-Bench

Viewer • Updated Jul 20, 2025 • 5.28k • 4.32k • 41

nyu-visionx/pyramid_flow_ft_results

Viewer • Updated Mar 30, 2025 • 8.42k • 59

nyu-visionx/pisa-experiments

Updated Mar 18, 2025 • 148 • 2

View 13 datasets