Gradial logo
Gradial2 months ago

Principal SRE

$180,000–$240,000 year

On-site · Seattle, Washington, United States

Type
Full Time
Level
Senior Level
Education
Not Specified
Company size
Unknown

Job Summary

Principal Site Reliability Engineer at Gradial responsible for the reliability, scalability, and operational health of Gradial's production platform. You will lead the evolution of Kubernetes, CI/CD, observability, and infrastructure as code across the stack, build tooling and automation to accelerate engineering velocity, and partner with the engineering team to identify and mitigate scaling risks. This hands-on IC leadership role requires 5+ years of SRE/DevOps/infra experience, deep Kubernetes and cloud-native expertise, strong IaC/GitOps/CI/CD capabilities, and a track record of influencing engineering teams toward reliable, scalable systems. The role emphasizes AI-native infrastructure considerations and may include opportunities to work with AI tooling and compute-intensive workloads. The salary range is $180,000–$240,000 annually, with potential bonuses and equity. The company highlights a fast-paced, ownership-driven environment and a focus on meaningful impact.

Required Qualifications

  • 5+ years of experience in SRE, DevOps, platform engineering, or infrastructure roles with direct ownership of production systems
  • Proven success designing and operating production-grade infrastructure in fast-moving, high-growth environments
  • Deep expertise in Kubernetes, cloud-native architecture, and container orchestration
  • Strong experience with infrastructure as code, GitOps, CI/CD workflows, and modern deployment practices
  • Strong command of observability and reliability fundamentals across metrics, logging, tracing, alerting, and incident response
  • A track record of leading through influence, making sound technical decisions, and raising the bar across engineering teams
  • Nice to Have familiarity with AI or ML infrastructure, including GPU provisioning, model deployment, or compute-intensive workloads
  • Experience supporting cloud or multi-cloud environments with a focus on resilience and scale
  • Comfort with TypeScript or Python for internal tooling and operational automation
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

$180k – $240k / yr

Principal SRE · Gradial

Apply on Sorce