rl-llm-coders/RS_GT_RM_8B_iter2
Text Classification
•
8B
•
Updated
•
4
rl-llm-coders/Multi_SFT_8B
Text Generation
•
8B
•
Updated
•
4
rl-llm-coders/RS_SFT_8B_iter2
Text Generation
•
8B
•
Updated
•
5
rl-llm-coders/RS_SFT_8B_iter1
Text Generation
•
8B
•
Updated
•
2
rl-llm-coders/RS_GT_SFT_8B_iter2
Text Generation
•
8B
•
Updated
•
6
rl-llm-coders/RS_GT_SFT_8B_iter1
Text Generation
•
8B
•
Updated
•
1
rl-llm-coders/RS_GT_RM_8B_iter1
Text Classification
•
8B
•
Updated
•
6
rl-llm-coders/RS_GT_RM_8B
Text Classification
•
8B
•
Updated
•
5
rl-llm-coders/Final_RS_RM_8B_iter2
Text Classification
•
8B
•
Updated
•
9
rl-llm-coders/Final_RS_RM_8B_iter1
Text Classification
•
8B
•
Updated
•
5
rl-llm-coders/Final_RS_GT_RM_1B_iter1
Text Classification
•
1B
•
Updated
•
5
rl-llm-coders/Final_RS_RM_1B_iter1
Text Classification
•
1B
•
Updated
•
7
rl-llm-coders/RS_GT_RM_1B_iter1
Text Classification
•
1B
•
Updated
•
5
rl-llm-coders/RS_GT_RM_1B_iter0
Text Classification
•
1B
•
Updated
•
5
rl-llm-coders/RS_RM_1B_iter2
Text Classification
•
1B
•
Updated
•
4
rl-llm-coders/RS_RM_1B_iter1
Text Classification
•
1B
•
Updated
•
4
rl-llm-coders/RM_1B_iter0
Text Classification
•
1B
•
Updated
•
6
rl-llm-coders/RM_8B_iter0
Text Classification
•
8B
•
Updated
•
6
rl-llm-coders/Dagger_SFT_8B_iter2
Text Generation
•
8B
•
Updated
•
4
rl-llm-coders/Dagger_SFT_8B_iter1
Text Generation
•
8B
•
Updated
•
6
rl-llm-coders/RS_GT_1B_RM_iter1
Text Generation
•
1B
•
Updated
•
4
rl-llm-coders/RS_GT_1B_SFT_iter1
Text Generation
•
1B
•
Updated
•
7
rl-llm-coders/RS_GT_SFT_1B_iter2
Text Generation
•
1B
•
Updated
•
6
rl-llm-coders/RS_1B_RM_iter0
Text Generation
•
1B
•
Updated
•
7
rl-llm-coders/RS_1B_RM_iter1
Text Generation
•
1B
•
Updated
•
5
rl-llm-coders/RS_1B_RM_iter2
Text Generation
•
1B
•
Updated
•
7
rl-llm-coders/RS_1B_SFT_iter3
Text Generation
•
1B
•
Updated
•
5
rl-llm-coders/RS_1B_SFT_iter2
Text Generation
•
1B
•
Updated
•
6
rl-llm-coders/RS_1B_SFT_iter1
Text Generation
•
1B
•
Updated
•
5
Text Generation
•
1B
•
Updated
•
5