arxiv:2508.07485
Tyler Marques
tmarques
ยท
AI & ML interests
None yet
Recent Activity
updated a dataset about 1 hour ago
GoodStartLabs/gsl-benchmark-logs authored a paper 9 months ago
Democratizing Diplomacy: A Harness for Evaluating Any Large Language
Model on Full-Press Diplomacy