Database Activity

Database Activity

Apply to our funding roundApply

HCAST: Human-Calibrated Autonomy Software Tasks | grantmaking.ai

← Back to Database

EquiStamp
HCAST: Human-Calibrated Autonomy Software Tasks

HCAST: Human-Calibrated Autonomy Software Tasks

0

active

Benchmark of 189 machine-learning, cybersecurity, software engineering, and reasoning tasks with over 1,500 hours of human baselines, used to evaluate autonomous AI agents and to which EquiStamp-affiliated researchers contributed alongside METR.

Endorsements support EquiStamp.

Benchmark of 189 machine-learning, cybersecurity, software engineering, and reasoning tasks with over 1,500 hours of human baselines, used to evaluate autonomous AI agents and to which EquiStamp-affiliated researchers contributed alongside METR.

Endorsements support EquiStamp.

People

Updated 05/18/26 · By grantmaking.ai

Co-author

Daniel O'Connell

Co-author

Grants Received– no grants recorded

Updated 05/18/26 · By grantmaking.ai

Discussion

Sign in to comment

No comments yet. Be the first to share your thoughts.