Senior Site Reliability Engineer
$100,000–$125,000 year
On-site · Bengaluru, Karnataka, India
Job Summary
Seeking a Senior Site Reliability Engineer to lead SRE initiatives, mentor engineers, and architect solutions for system resilience and operational excellence. Responsibilities include defining SRE best practices, managing incidents, and enhancing disaster recovery strategies. Ideal candidates have over 5 years of SRE experience, expertise in cloud platforms, container orchestration, and monitoring tools, along with strong coding skills in Python or Go.
Required Qualifications
- Bachelor's or Master's degree in Computer Science or a related field.
- 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
- Strong experience with cloud platforms (AWS, GCP, or Azure) and cloud-native technologies.
- Expertise in Kubernetes and container orchestration.
- Expertise with log management tools like ELK or Graylog.
- Strong coding/scripting skills in Python, Go, or Bash for automation.
- Deep understanding of networking, DNS, CDN, load balancing, and security.
- Proven experience with observability tools (Prometheus, Grafana, ELK, OpenTelemetry).
- Hands-on experience in performance tuning, high availability, and DR strategies.
- Strong knowledge of incident management frameworks and reliability metrics (SLOs, SLIs, SLAs).
- Experience leading cross-functional reliability initiatives.
Desired Qualifications
- Experience in a fast-paced, agile development environment.
- SRE Certifications from Datadog/Google.
- Experience with Chaos Engineering.
- Exposure to AIOps and ML-based observability.
- Experience in leading SRE transformations at scale.
- Experience in multi-timezone support.
- Experience working for B2B products from scratch.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.