Principal AI Engineer - Agent Ops / SRE
$168,400–$220,000 year
Hybrid · Chicago, Illinois, United States or Charlotte, North Carolina, United States
Job Summary
Principal AI Engineer - Agent Ops/SRE responsible for leading the AI delivery lifecycle with emphasis on automation, scalable distributed systems, and reliability. Responsibilities include liaising between AI COE and Platform/SRE teams, designing and maintaining scalable AI systems with performance, latency, and observability targets, implementing CICD and IAC modules, collaborating on governance and deployment tooling, building deployment templates and runbooks, and ensuring production systems meet enterprise SLOs. Requires senior-level expertise in DevOps/MLOps, agent-based AI solutions, and experience with Terraform, GITOps, observability tools, and multi-cloud environments. Hybrid or Remote work arrangement with on-site presence 3 days a week for offices in Hartford, CT; Columbus, OH; Charlotte, NC; Chicago, IL; US work authorization without sponsorship.
Required Qualifications
- Bachelor's degree in Computer Science, Computer Engineering, or a technical field.
- 10+ years building and shipping software and/or platform solutions for enterprises.
- Programming experience with Python is required.
- 3+ years of experience with IAC (Terraform).
- 5+ years of experience owning production CICD, GitOps and release management gating.
- 3+ years of experience in implementing observability, performance & reliability solutions: SLO, P99-95 latency, alert tuning, & dashboards.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.