-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 105 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.02k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 197 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 35
AI & ML interests
Scale up the Reasoner-Zero Training
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 105 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.02k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 197 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 35