Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation Paper • 2601.20614 • Published 4 days ago • 114
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 6 days ago • 123
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 3 days ago • 90
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Paper • 2601.21420 • Published 3 days ago • 28
Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning Paper • 2601.19280 • Published 5 days ago • 7
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published Dec 2, 2025 • 54
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 6 days ago • 38
Towards Pixel-Level VLM Perception via Simple Points Prediction Paper • 2601.19228 • Published 5 days ago • 15
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 2 days ago • 36
RWKV7-Gxx-GGUF Collection GGUF of RWKV7-G series reasoning models • 6 items • Updated 8 days ago • 10
Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published 12 days ago • 23
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR Paper • 2601.14251 • Published 12 days ago • 23