TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 3 days ago • 83
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus Paper • 2603.20105 • Published 20 days ago • 37
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising Paper • 2603.08703 • Published about 1 month ago • 32
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published Mar 8 • 85
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published Feb 28 • 37
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward Paper • 2603.07700 • Published Mar 8 • 13
EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation Paper • 2602.13669 • Published Feb 14 • 2
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation Paper • 2601.12719 • Published Jan 19 • 1
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers Paper • 2602.16968 • Published Feb 19 • 12
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 151
Moonshine: Speech Recognition for Live Transcription and Voice Commands Paper • 2410.15608 • Published Oct 21, 2024 • 12
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation Paper • 2602.19163 • Published Feb 22 • 14
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published Feb 15 • 53