Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 2 days ago • 51
Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 23 days ago • 64
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 28 days ago • 75
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers Paper • 2401.11605 • Published Jan 21, 2024 • 23
Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Paper • 2510.12586 • Published Oct 14 • 108
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 65
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published Aug 4 • 132
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8 • 29
facebook/dinov3-vith16plus-pretrain-lvd1689m Image Feature Extraction • 0.8B • Updated Aug 19 • 106k • 35
facebook/dinov3-vits16-pretrain-lvd1689m Image Feature Extraction • 21.6M • Updated Aug 19 • 315k • 56