Eval Harness Development Jobs
Browse 16 Eval Harness Development jobs on FDE Jobs.
16 jobs
MAApplied AI, Evaluation Engineer
Mistral AI·Paris, Paris, France (On-site)
Mistral AI
Paris, Paris, France (On-site)
4h ago
HASoftware Engineer, Agents
Harvey·Worldwide (Remote)
Harvey
Worldwide (Remote)
$161.3K – $241.9K Yearly5h ago
CAResearcher, Evals
Cartesia·San Francisco, California, United States (On-site)
Cartesia
San Francisco, California, United States (On-site)
$220K – $350K Yearly5h ago
HASoftware Engineer, Agents
Harvey·Worldwide (Remote)
Harvey
Worldwide (Remote)
$161.3K – $241.9K Yearly5h ago
HAStaff Software Engineer, Agents
Harvey·Worldwide (Remote)
Harvey
Worldwide (Remote)
$231K – $340K Yearly5h ago
OPResearch Engineer, Frontier Evals & Environments
OpenAI·San Francisco, California, United States (On-site)
OpenAI
San Francisco, California, United States (On-site)
$205K – $380K Yearly5h ago
HAStaff Software Engineer, Agents
Harvey·Worldwide (Remote)
Harvey
Worldwide (Remote)
$231K – $340K Yearly5h ago
HASenior Software Engineer, Agents
Harvey·United States (Remote)
Harvey
United States (Remote)
$193.4K – $290K Yearly5h ago
HAMid/Senior/Staff Software Engineer, Agents
Harvey·Worldwide (Remote)
Harvey
Worldwide (Remote)
$193.4K – $290K Yearly5h ago
ANSoftware Engineer, Safeguards Evals
Anthropic·San Francisco, California, United States (Hybrid)
Anthropic
San Francisco, California, United States (Hybrid)
$320K – $485K Yearly13h ago
OPResearcher: Agent Post-Training, API & Power-Users
OpenAI·California, United States (Remote)
OpenAI
California, United States (Remote)
$295K – $445K Yearly5h ago
ANEngineering Manager, Agent Prompts & Evals
Anthropic·San Francisco, California, United States (Hybrid)
Anthropic
San Francisco, California, United States (Hybrid)
$320K – $405K Yearly4h ago
COProduct Manager, Agent Harness & Modelling
Cohere·Canada + 4 more (Remote)
Cohere
Canada + 4 more (Remote)
13h ago
ITAI QA Trainer – LLM Evaluation
Invisible Technologies·Worldwide (Remote)
Invisible Technologies
Worldwide (Remote)
$6 – $65 Hourly4h ago
MAModel Behavior Architect- Function Calling
Mistral AI·London, England, United Kingdom (Hybrid)
Mistral AI
London, England, United Kingdom (Hybrid)
5h ago
OPResearcher, Artifacts - Agent Post-Training
OpenAI·California, United States (Remote)
OpenAI
California, United States (Remote)
$250K – $380K Yearly5h ago