rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_longform_averaged_outcome_with_system_prompt
Viewer
•
Updated
•
2.94k
•
3
rl-rag/rl_rag_sqa_searcharena_rubrics_web_augmented_outcome_with_new_mcp_system_prompt
Viewer
•
Updated
•
2.94k
•
4
rl-rag/gpqa_diamond_rlvr_no_prompt
Viewer
•
Updated
•
198
•
2
rl-rag/nq_rlvr_no_prompt_f1_test
Viewer
•
Updated
•
3.61k
•
3
rl-rag/tqa_rlvr_no_prompt_f1_test
Viewer
•
Updated
•
17.9k
•
4
rl-rag/hotpotqa_rlvr_no_prompt_f1_test
Viewer
•
Updated
•
7.41k
•
3
rl-rag/2wiki_rlvr_no_prompt_f1_test
Viewer
•
Updated
•
300
•
10
rl-rag/asearcher_short_form_rlvr_with_system_prompt
Viewer
•
Updated
•
70.6k
•
3
rl-rag/verified_miro_trajectories
Viewer
•
Updated
•
9.88k
•
3
rl-rag/rl_rag_sqa_openscholar_rubrics_s2_augmented_longform_averaged_outcome_with_system_prompt
Viewer
•
Updated
•
2.42k
•
1
rl-rag/combined-sft-training-data-v20250824_MiroSystemPrompt
Viewer
•
Updated
•
4.44k
•
1
Viewer
•
Updated
•
3.99k
•
1
rl-rag/rl_rag_sqa_no_retrieval_1k_longform_finegrained_with_system_prompt
Viewer
•
Updated
•
999
•
3
rl-rag/rl_rag_sqa_no_retrieval_1k_longform_averaged_outcome_with_system_prompt
Viewer
•
Updated
•
999
•
1
rl-rag/rl_rag_no_retrieval_1k_longform_rubrics_only_with_system_prompt
Viewer
•
Updated
•
999
•
1
rl-rag/gpt-oss-20b-eval-react-serper
rl-rag/verifiable_synthetic_1k_0814
Viewer
•
Updated
•
1.05k
•
4
rl-rag/verifiable_synthetic_varied_depth_o3_verified
Viewer
•
Updated
•
101
•
7
rl-rag/verifiable_synthetic_depth_one_v2_verified
Viewer
•
Updated
•
114
•
5
rl-rag/combined-sft-training-data-v20250724
Viewer
•
Updated
•
568
•
4
rl-rag/qwq_32b_factualqa_sft_data
Viewer
•
Updated
•
36.5k
•
3