-
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
Paper • 2412.14922 • Published • 88 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 47 -
OpenAI o1 System Card
Paper • 2412.16720 • Published • 36 -
Revisiting In-Context Learning with Long Context Language Models
Paper • 2412.16926 • Published • 32
ysj
sjyuxyz
·
AI & ML interests
None yet
Organizations
december papers
-
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
Paper • 2412.14922 • Published • 88 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 47 -
OpenAI o1 System Card
Paper • 2412.16720 • Published • 36 -
Revisiting In-Context Learning with Long Context Language Models
Paper • 2412.16926 • Published • 32
datasets
8
sjyuxyz/AgentTraj-L-Categorized
Viewer
•
Updated
•
25.7k
•
49
sjyuxyz/CCPO_Medical-Reasoning
Viewer
•
Updated
•
500
•
12
sjyuxyz/CCPO_Code-Reasoning
Viewer
•
Updated
•
469
•
34
sjyuxyz/financial-sentiment-analysis
Viewer
•
Updated
•
100k
•
39
•
2
sjyuxyz/openwebtext-subset-1M-rows
Viewer
•
Updated
•
1M
•
13
sjyuxyz/GSM-Plus-Formatted
Viewer
•
Updated
•
25.9k
•
119
sjyuxyz/MMLU-Pro-with-subset
Viewer
•
Updated
•
26.6k
•
779
sjyuxyz/web3mmlu
Viewer
•
Updated
•
298
•
21
•
1