Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Shaw's picture
137 1 1

Alex Shaw

alexgshaw
lincolnhuj's profile picture mohanz's profile picture evalstate's profile picture
·
https://www.tbench.ai/
  • alexgshaw
  • alexgshaw
  • alexgshaw

AI & ML interests

None yet

Recent Activity

new activity about 18 hours ago
harborframework/terminal-bench-2-leaderboard:Add LemonHarness(GPT-5.3-Codex) submission - 84.5%
new activity 1 day ago
harborframework/terminal-bench-2-leaderboard:Add Capy GPT-5.5 submission
new activity 2 days ago
harborframework/terminal-bench-2-leaderboard:Add Wecode GPT-5.5 Terminal-Bench 2.0 submission
View all activity

Organizations

Perception, Control, and Cognition Lab's profile picture  ML Foundations Development's profile picture Laude Institute's profile picture DCAgent's profile picture Harbor's profile picture Terminal-Bench's profile picture Harbor Framework's profile picture

upvoted a paper 3 months ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published Jan 17 • 36
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs