Yuxiang Zhang
TokerZ
AI & ML interests
LLM-based Agent, RL, Large Reasoning Model
Recent Activity
upvoted
a
paper
3 days ago
Black-Box On-Policy Distillation of Large Language Models
upvoted
a
paper
about 1 month ago
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
upvoted
a
paper
about 2 months ago
Solving a Million-Step LLM Task with Zero Errors
Organizations
None yet