Running 81 Unlocking On-Policy Distillation for Any Model Family 📝 81 Improve model performance by transferring knowledge between different model families
Running on Zero 31 Gpt2 Multiplication Predictor 📈 31 Multiply large numbers using different reasoning methods