yuqi yang's picture

2

yuqi yang

tzteyang

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 10 months ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published Mar 9, 2025 • 20

upvoted a paper 12 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 103