AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published 3 days ago • 22
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 3 days ago • 134
RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models Paper • 2601.03699 • Published 4 days ago • 5
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation Paper • 2601.02204 • Published 6 days ago • 56
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 5 days ago • 94
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 6 days ago • 28
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 7 days ago • 35
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Paper • 2601.02256 • Published 6 days ago • 30
InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams Paper • 2601.02281 • Published 6 days ago • 29
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 10 days ago • 109
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 9 days ago • 50
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published 13 days ago • 17
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published 13 days ago • 44