Thomas Broadley
About me
I'm a Canadian living in Berkeley, California.
I work as a Member of Technical Staff at METR. METR's mission is to assess the capabilities of, and the risk of catastrophe from, frontier AI systems.
I'm a co-author of three of METR's recent papers:
- "Measuring AI Ability to Complete Long Tasks", in which we demonstrate that AI performance on certain software tasks is doubling every seven months
- "HCAST: Human-Calibrated Autonomy Software Tasks", a software development benchmark
- "RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts", a machine learning research engineering benchmark
Before METR, I spent 4.5 years as a full-stack web developer at Faire, an online wholesale marketplace. Also, I've interned at Datadog, Zeitspace, Shopify Plus, and Boltmade (since acquired by Shopify).
I have a Bachelor of Computer Science from the University of Waterloo. In September 2020, I attended a one-week mini-batch at the Recurse Center.
Contact me
Email:
htbroadley@outlook.com