The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning"
Mingyang Song
Nickyang
AI & ML interests
LRMs, Long-Context LLMs, LLM Judges, Many-Shot ICL
Organizations
None yet
FastCuRL
The collection for the Paper "Curriculum Reinforcement Learning with Stage-wise
Context Scaling for Efficient Training R1-like Reasoning Models"
-
Nickyang/FastCuRL-1.5B-V3
Text Generation • 2B • Updated • 2 • 4 -
Nickyang/FastCuRL-1.5B-Preview
Text Generation • 2B • Updated • 3 • 8 -
Nickyang/FastCuRL-Data
Viewer • Updated • 82.2k • 17 • 3 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • Updated • 1.45M • • 1.45k
ConciseR
The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning"
FastCuRL
The collection for the Paper "Curriculum Reinforcement Learning with Stage-wise
Context Scaling for Efficient Training R1-like Reasoning Models"
-
Nickyang/FastCuRL-1.5B-V3
Text Generation • 2B • Updated • 2 • 4 -
Nickyang/FastCuRL-1.5B-Preview
Text Generation • 2B • Updated • 3 • 8 -
Nickyang/FastCuRL-Data
Viewer • Updated • 82.2k • 17 • 3 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • Updated • 1.45M • • 1.45k
models 5
Nickyang/ConciseR-Zero-7B-Preview
Text Generation • Updated
• 3 • 1
Nickyang/ConciseR-Zero-7B
Text Generation • Updated
• 1 • 1
Nickyang/FastCuRL-1.5B-V3
Text Generation • 2B • Updated
• 2 • 4
Nickyang/FastCuRL-1.5B-V2
Text Generation • 2B • Updated
• 5 • 1
Nickyang/FastCuRL-1.5B-Preview
Text Generation • 2B • Updated
• 3 • 8