PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning Paper • 2601.05593 • Published 24 days ago • 81
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 7 days ago • 39
OptiMind: Teaching LLMs to Think Like Optimization Experts Paper • 2509.22979 • Published Sep 26, 2025 • 3
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 12 days ago • 69
Clara-Molecular Collection NVIDIA Clara Models for Molecular Science • 10 items • Updated 4 days ago • 7
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 25 days ago • 214
Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published 22 days ago • 20
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 25 days ago • 29
Jamba Reasoning 3B Collection AI21's top-performing reasoning model that packs leading scores on intelligence benchmarks and highly-efficient processing into a compact 3B build • 2 items • Updated Oct 8, 2025 • 6
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 26 items • Updated 6 days ago • 98
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published Dec 23, 2025 • 85
DFlash Collection Block Diffusion for Flash Speculative Decoding • 3 items • Updated 11 days ago • 12
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 69 items • Updated 6 days ago • 330
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 28 days ago • 37
MiroThinker-v1.5 Collection MiroMind’s Open Source Research Agent for Prediction • 4 items • Updated 17 days ago • 24