benchmark lmlmcat/cmmlu Updated Jul 13, 2023 • 24.5k • 73 nlp-waseda/JMMLU Updated Feb 27, 2024 • 425 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 13.2k • 95 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 106k • 355
benchmark lmlmcat/cmmlu Updated Jul 13, 2023 • 24.5k • 73 nlp-waseda/JMMLU Updated Feb 27, 2024 • 425 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 13.2k • 95 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 106k • 355