HomeJobsFull Time

Benchmark Testing and Analysis Lead

ARC Prize Foundation LogoARC Prize Foundation


Date Posted

29 April, 2026

Work Location

United States

Salary Offered

$150,000 — $250,000 yearly

Job Type

Full Time

Experience Required

3+ years

Remote Work

Allowed

Stock Options

No

Vacancies

1 available


A technical researcher to own how we evaluate frontier models on the ARC-AGI benchmarks. This person will run new models end-to-end, mine the data exhaust from every run, and translate what we learn into reports and public communication that shape the conversation on where model capability is heading. This is a remote, full-time role.

What You'll Do:

  • Own our model benchmarking and testing process, and run new frontier models against ARC-AGI-1, ARC-AGI-2, and ARC-AGI-3 as they ship
  • Build and own the ARC Prize Analysis Package - a repeatable report produced for every new frontier model, turning raw logs into insight on capability, failure modes, and gaps
  • Own the official and community leaderboards end-to-end - from scoring pipeline to public page
  • Serve as primary contact for new labs testing on ARC-AGI, and communicate findings externally via Twitter, newsletter, and policy and partner briefings

What We're Looking For:

  • Research background with hands-on model evaluation experience - you've run evals before and know how to read the results (model training experience not required)
  • Deep understanding of how modern models work and fail, and comfortable building your own tooling and analysis to answer the questions you care about
  • Strong ownership instinct and clear technical communicator

Example outputs this role would produce: a model score announcement and a model analysis blog post.

About ARC Prize Foundation

ARC Prize Foundation Logo

AI benchmarks that measure general intelligence and inspire new ideas

Company Size: 1 - 5 People
Year Founded: 2024
Country: United States

BEAMSTART

BEAMSTART is a global entrepreneurship community, serving as a catalyst for innovation and collaboration. With a mission to empower entrepreneurs, we offer exclusive deals with savings totaling over $1,000,000, curated news, events, and a vast investor database. Through our portal, we aim to foster a supportive ecosystem where like-minded individuals can connect and create opportunities for growth and success.

© Copyright 2026 BEAMSTART. All Rights Reserved.