Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
13
Xiaoyu Tan
WIlliam1900
Follow
21world's profile picture
SteveSHEN's profile picture
2 followers
ยท
5 following
https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
authored
a paper
3 days ago
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
authored
a paper
3 days ago
AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
View all activity
Organizations
WIlliam1900
's datasets
None public yet