collinear-ai/qwen3-14b-opd-rq3-subset-vllm-accelerate-step34 Text Generation • Updated about 24 hours ago • 22
collinear-ai/qwen3-14b-opd-rq3-subset-vllm-accelerate-step34 Text Generation • Updated about 24 hours ago • 22
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published 22 days ago • 16
VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation Paper • 2510.23497 • Published Oct 27, 2025 • 1
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes Paper • 2306.13649 • Published Jun 23, 2023 • 29
Running 78 Unlocking On-Policy Distillation for Any Model Family 📝 78 Improve model performance by transferring knowledge between different model families