OPEN ROLE

Staff Data Scientist: AI Evaluation & Context Systems

"Lead architect for Evaluation & Context Systems. Define the mathematical standard of "Truth" for autonomous commerce agents."

Role Overview

LocationHQ - Los Angeles

OwnershipAll evaluation architecture, GraphRAG systems fusing Knowledge Graph with vector search, and Risk-Adjusted Value per Token optimization.

DurationOngoing

Compensation$250,000 - $380,000 Base + PIUs

DeliverableArchitect the Proprietary Evaluation Harness (Golden Sets, adversarial loops), Neuro-Symbolic Retrieval System (GraphRAG), and Unit Economics of Intelligence modeling.

OutcomeBuild the "Judge" that measures epistemic validity of AI agents and ensures they adjudicate reality rather than hallucinate it.

👉

READ: Guide to Working at Demand.io | What You're Actually Signing Up For

Note: Please read this. If it scares you, we just saved you 15 minutes. If it excites you, you're in the right place.

Complete a brief video questionnaire to apply for this role. It takes 12 minutes to calibrate initial compatibility. Your only required preparation is reading the Guide linked above. Beyond that, just bring clarity of mind and a readiness to share who you are.

We recommend using the Chrome browser on any laptop or smartphone (if you are using an iPhone, please use Safari). For best results, ensure you are in a quiet room with a stable internet connection. You’ll get 30 seconds to read the questions and think through your response before each recording begins. We look forward to hearing what you share.