Xiaoyu Tan's picture

9 13

Xiaoyu Tan

WIlliam1900

·

https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

authored a paper 3 days ago

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

authored a paper 3 days ago

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

View all activity

Organizations

WIlliam1900 's datasets

None public yet