Engineering Manager, Evals
On-site · New York City, New York, United States or San Francisco, California, United States
Job Summary
Engineering Manager, Evals at Cursor leads a high-impact team responsible for creating high-signal evaluation datasets and developer-friendly tools to write and run evals. You will own the end-to-end evaluation roadmap, guide CursorBench evolution to reflect real developer workflows, and define online quality signals that translate into shipping decisions and model training loops. The role emphasizes leadership, cross-disciplinary alignment across research, product, data, and infrastructure, and hands-on experience with production evaluation systems to measure and improve agent quality.
Required Qualifications
- Led engineering teams shipping production systems
- Strong people leadership and coaching skills
- Ability to align research, product, data, and infrastructure on what 'good' means and turn into durable metrics and processes
- Experience building and operating evaluation or measurement systems (AI evals, experimentation platforms, ranking/relevance, search quality, or reliability instrumentation)
- Strong data acumen and collaboration with data scientists and researchers
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.