Principal SRE
$180,000–$240,000 year
On-site · Seattle, Washington, United States
Job Summary
Principal Site Reliability Engineer at Gradial responsible for the reliability, scalability, and operational health of Gradial's production platform. You will lead the evolution of Kubernetes, CI/CD, observability, and infrastructure as code across the stack, build tooling and automation to accelerate engineering velocity, and partner with the engineering team to identify and mitigate scaling risks. This hands-on IC leadership role requires 5+ years of SRE/DevOps/infra experience, deep Kubernetes and cloud-native expertise, strong IaC/GitOps/CI/CD capabilities, and a track record of influencing engineering teams toward reliable, scalable systems. The role emphasizes AI-native infrastructure considerations and may include opportunities to work with AI tooling and compute-intensive workloads. The salary range is $180,000–$240,000 annually, with potential bonuses and equity. The company highlights a fast-paced, ownership-driven environment and a focus on meaningful impact.
Required Qualifications
- 5+ years of experience in SRE, DevOps, platform engineering, or infrastructure roles with direct ownership of production systems
- Proven success designing and operating production-grade infrastructure in fast-moving, high-growth environments
- Deep expertise in Kubernetes, cloud-native architecture, and container orchestration
- Strong experience with infrastructure as code, GitOps, CI/CD workflows, and modern deployment practices
- Strong command of observability and reliability fundamentals across metrics, logging, tracing, alerting, and incident response
- A track record of leading through influence, making sound technical decisions, and raising the bar across engineering teams
- Nice to Have familiarity with AI or ML infrastructure, including GPU provisioning, model deployment, or compute-intensive workloads
- Experience supporting cloud or multi-cloud environments with a focus on resilience and scale
- Comfort with TypeScript or Python for internal tooling and operational automation
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.