12 14 5

Ding

dyyyyyyyy

AI & ML interests

None yet

Recent Activity

new activity about 1 month ago

dyyyyyyyy/FAPO-Critic:Add task categories, tags, paper link, and sample usage

new activity about 1 month ago

dyyyyyyyy/FAPO-GenRM-4B:Improve model card: Add pipeline tag, library name, paper link, and abstract

authored a paper about 1 month ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

View all activity

Organizations

New activity in dyyyyyyyy/FAPO-Critic about 1 month ago

Add task categories, tags, paper link, and sample usage

#1 opened about 1 month ago by

nielsr

New activity in dyyyyyyyy/FAPO-GenRM-4B about 1 month ago

Improve model card: Add pipeline tag, library name, paper link, and abstract

#1 opened about 1 month ago by

nielsr

authored a paper about 1 month ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Paper • 2510.22543 • Published Oct 26 • 10

commented a paper about 1 month ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Paper • 2510.22543 • Published Oct 26 • 10 •

updated 2 datasets about 1 month ago

dyyyyyyyy/FAPO-Reasoning-Dataset

Viewer • Updated Oct 28 • 351k • 106

dyyyyyyyy/FAPO-Critic

Viewer • Updated Oct 31 • 87k • 76

updated 2 models about 1 month ago

dyyyyyyyy/FAPO-32B

33B • Updated Oct 28 • 14 • 1

dyyyyyyyy/FAPO-GenRM-4B

Text Generation • 4B • Updated Oct 31 • 100 • 1

updated a collection about 1 month ago

FAPO

Collection

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning. Project Page: https://fapo-rl.github.io/ • 4 items • Updated Oct 24

published a model about 1 month ago

dyyyyyyyy/FAPO-32B

33B • Updated Oct 28 • 14 • 1

published a dataset about 1 month ago

dyyyyyyyy/FAPO-Reasoning-Dataset

Viewer • Updated Oct 28 • 351k • 106

updated a collection about 1 month ago

FAPO

Collection

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning. Project Page: https://fapo-rl.github.io/ • 4 items • Updated Oct 24

published a model about 1 month ago

dyyyyyyyy/FAPO-GenRM-4B

Text Generation • 4B • Updated Oct 31 • 100 • 1

published a dataset about 1 month ago

dyyyyyyyy/FAPO-Critic

Viewer • Updated Oct 31 • 87k • 76

upvoted a paper about 2 months ago

Revisiting Long-context Modeling from Context Denoising Perspective

Paper • 2510.05862 • Published Oct 7 • 20

authored a paper 3 months ago

SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning

Paper • 2509.16548 • Published Sep 20

commented a paper 3 months ago

SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning

Paper • 2509.16548 • Published Sep 20 •

updated a model 5 months ago

dyyyyyyyy/Qwen2.5-1.5B-GenRM-WithTemplate

2B • Updated Jun 30 • 5

Ding

AI & ML interests

Recent Activity

Organizations

dyyyyyyyy's activity

Add task categories, tags, paper link, and sample usage

Improve model card: Add pipeline tag, library name, paper link, and abstract