Staff Site Reliability Engineer
On-site · Bengaluru, Karnataka, India
Job Summary
Staff Site Reliability Engineer to design, build, and operate highly scalable, reliable, and secure infrastructure powering production systems across AWS and GCP. Lead major reliability and modernization initiatives including container platform migrations (ECS to EKS/GKE) and microservice enablement across multi-cloud environments. Serve as a technical authority in Kubernetes (EKS and GKE), cloud infrastructure (AWS and GCP), and modern CI/CD practices (GitOps, automation pipelines). Partner with development teams to architect and enable microservice-based applications for production readiness, scalability, and observability. Implement and manage infrastructure as code (Terraform, Ansible) across multiple cloud providers. Drive observability, performance, and cost efficiency through monitoring, logging, and alerting. Champion SRE best practices—defining SLOs/SLIs, conducting blameless postmortems, and improving incident response. Lead complex technical projects from conception to completion, mentor engineers, and collaborate with security/compliance partners. Participate in on-call rotations and advocate for reliability and operational excellence.
Required Qualifications
- Bachelor's degree in Computer Science or equivalent hands-on experience
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.