Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
imyangyixuan
's Collections
LLM_Benchmark_TestOnly
LLM Benchmark
LLM Benchmark
updated
Nov 11, 2025
Upvote
-
qwedsacf/competition_math
Viewer
•
Updated
Jan 28, 2023
•
12.5k
•
12.6k
•
114
TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
10 days ago
•
12.1k
•
128k
•
457
ChilleD/StrategyQA
Viewer
•
Updated
Aug 26, 2023
•
2.29k
•
4.48k
•
5
cais/mmlu
Viewer
•
Updated
Mar 8, 2024
•
231k
•
387k
•
684
allenai/IF_multi_constraints_upto5
Viewer
•
Updated
Oct 2, 2025
•
95.4k
•
1.05k
•
23
HuggingFaceH4/MATH-500
Viewer
•
Updated
Dec 15, 2025
•
500
•
118k
•
288
Upvote
-
Share collection
View history
Collection guide
Browse collections