Applied AI Scientist, Clinical AI Agents
$140,000–$300,000 year
Remote · United States or New York City, New York, United States
Job Summary
Applied AI Scientist/Engineer to build, evaluate, and productionize clinical AI agents and supervised ML models within healthcare workflows. You will design and implement agentic AI systems that read complex medical records, apply clinical and coding context, call tools, produce auditable outputs, and drive improvements from real-world data and failures. You will own end-to-end responsibilities from problem framing to agent design, evaluation, deployment, trace analysis, and ongoing improvement, working at the intersection of software engineering, LLM systems, evaluation, and healthcare workflow understanding. You will develop long-horizon agent behavior across context construction, retrieval, tool use, memory, routing, verification, escalation, and human-in-the-loop review, define success criteria, build rigorous evaluation loops using production logs and benchmarks, and collaborate with research and ML engineering teams on model selection, fine-tuning, reward modeling, distillation, and internal AI infrastructure. The role emphasizes production impact, auditable outputs, and measurable improvements in clinical and operational settings, with level options from L2 to Staff and a base salary range of $140k-$300k per year.
Required Qualifications
- 4+ years of software/ML engineering experience
- Proficient in Python
- Experience turning real-world workflows into structured AI problems
- Experience building evaluation systems and benchmarks for AI systems
- Experience deploying AI systems into production
- Familiarity with LLMs, agents, RAG, tool calling, prompting, and evaluation
- Healthcare domain experience a plus
Desired Qualifications
- 4+ years of software/ML engineering experience
- Proficient in Python
- Experience turning messy real-world workflows into structured AI problems
- Experience turning model potential into reliable, correct, safe systems
- Experience building production AI systems
- Experience with evaluation systems, benchmarks, or regression tests for AI systems
- Experience with LLMs, agents, RAG, tool calling, prompting, or evaluation
- Healthcare domain experience or familiarity with clinical workflows is a plus
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.