evals Jobs - June 2026
Search by company, role, stack, location, salary signal, source, and work setup.
Save this search
Turn the current filters into an email alert.
Log in to save filtered searches as alerts.
-
Tetsuwan Scientific
Software Engineer
Posted 2 weeks, 3 days ago
We work with robots that run biology experiments. The robots are accurate, but programming them by hand takes so long that most labs don't bother. So we wrote a compiler and visual editor a scientist can use. They describe their protocol in plain languag…
Roles
Tech stack
Location
San Francisco (SoMa)
Work setup
Full-time · ON-SITE
Compensation
$140K–$180K + equity
Benefits
Equity, $140K–$180K salary
-
kinxshn
Forward Deployed Engineer
Posted 3 weeks ago
Small stealth team in the property management space, building AI agents that run real operations. Agents and humans work as one team. We want a senior backend engineer to own a client-facing use case end to end: build the backend tools and workflows our agen…
Tech stack
Location
Remote
Work setup
full-time · Fully remote; remote timezone: UTC -3 to +3.
Compensation
Competitive compensation; fully remote.
Benefits
Fully remote, Competitive compensation
-
Scribd, Inc.
AI Developer Platform
Posted 3 weeks ago
Scribd, Inc. is on a mission to advance human understanding. Scribd, Slideshare, Everand, and Fable help billions of people move beyond access into insight, application, and expertise. Scribd is hiring a highly technical PM to own its AI Developer Platform, …
Tech stack
Location
US, Canada
Work setup
full-time · REMOTE
Compensation
Competitive salary and great benefits.
Benefits
Competitive salary, Great benefits, Real work-life balance
-
Stealth Startup
Founding Engineer
Posted 1 month, 1 week ago
Founding Engineering role with early IC focus (first 3–6 months) to build a multi-modal AI agent product (voice + SMS + email). Responsibilities include building real-time voice systems (streaming audio, low latency, turn-taking), SMS and email automation wit…
Roles
Tech stack
Location
NYC (in-person 5x/week)
-
MemberPress
AI Engineer
Posted 1 month, 1 week ago
We’re looking for a sharp AI engineer with real experience shipping production LLM systems. Own the AI core of a new product—building inference pipelines, writing evals, keeping costs in check, and making the system measurably better over time.
-
AirOps
AirOps hiring
Posted 1 month, 2 weeks ago
AirOps is the runtime marketing teams at Ramp, Chime, Carta, and Rippling use to earn citations in responses from ChatGPT, Claude, and Perplexity. It provides orchestrated LLMs, RAG, JavaScript/Python code steps, evals, and human-in-the-loop systems. Backed b…
Tech stack
Location
SF, NYC, Montevideo