CodeGoat24
's Collections
UnifiedReward Training Data
updated
Unified Reward Model for Multimodal Understanding and Generation
Paper
•
2503.05236
•
Published
•
123
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement
Fine-Tuning
Paper
•
2505.03318
•
Published
•
92
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer
•
Updated
•
337k
•
266
CodeGoat24/ImageGen-CoT-Reward-5K
Viewer
•
Updated
•
5.54k
•
116
•
1
CodeGoat24/LLaVA-Critic-113k
Preview
•
Updated
•
172
Viewer
•
Updated
•
21.4k
•
64
CodeGoat24/ShareGPTVideo-DPO
Viewer
•
Updated
•
101k
•
38
Viewer
•
Updated
•
29k
•
195
Preview
•
Updated
•
140
Viewer
•
Updated
•
73.2k
•
46
Viewer
•
Updated
•
72.7k
•
54
Viewer
•
Updated
•
19k
•
46