LLM Benchmark - a imyangyixuan Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

imyangyixuan 's Collections

LLM_Benchmark_TestOnly

LLM Benchmark

updated Nov 11, 2025

qwedsacf/competition_math

Viewer • Updated Jan 28, 2023 • 12.5k • 12.6k • 114
TIGER-Lab/MMLU-Pro

Benchmark • Updated 10 days ago • 12.1k • 128k • 457
ChilleD/StrategyQA

Viewer • Updated Aug 26, 2023 • 2.29k • 4.48k • 5
cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 387k • 684
allenai/IF_multi_constraints_upto5

Viewer • Updated Oct 2, 2025 • 95.4k • 1.05k • 23
HuggingFaceH4/MATH-500

Viewer • Updated Dec 15, 2025 • 500 • 118k • 288

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs