OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 5 days ago • 26
Thinking with Programming Vision: Towards a Unified View for Thinking with Images Paper • 2512.03746 • Published 4 days ago • 15
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment Paper • 2512.02807 • Published 5 days ago • 7
STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving Paper • 2502.00212 • Published Jan 31 • 3
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 5 days ago • 47
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published 11 days ago • 95
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 5 days ago • 172
Architecture Decoupling Is Not All You Need For Unified Multimodal Model Paper • 2511.22663 • Published 10 days ago • 28
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 14 days ago • 240
First Frame Is the Place to Go for Video Content Customization Paper • 2511.15700 • Published 18 days ago • 52
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published 19 days ago • 222
view article Article Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 Jul 23 • 4
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21 • 86
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19 • 118
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published Jun 17 • 44