Site Reliability Engineer (AWS & Kubernetes)
On-site · Bengaluru, Karnataka, India or Gurugram, Haryana, India
Job Summary
Site Reliability Engineer to support the reliability, monitoring and operational excellence of cloud-native platforms. You’ll work with AWS and Kubernetes (EKS), contribute to monitoring/observability with Grafana and Prometheus, assist in incident management and root-cause analysis, participate in on-call rotations, and implement infrastructure changes using Terraform and GitOps workflows. You’ll support CI/CD pipelines (GitLab, Argo CD), help improve system performance and stability, and ensure alignment with security, compliance and operational standards; collaborate with DevOps and engineering teams to deliver observable, scalable and resilient services. The role is at associate level with 45 hours weekly, based in Bengaluru, Chennai, or Gurugram, India.
Required Qualifications
- Experience with AWS and Kubernetes (EKS) in production or pre-production.
- Familiarity with monitoring and observability tools such as Grafana and Prometheus.
- Understanding of CI/CD pipelines and Git-based workflows (GitLab preferred).
- Exposure to Terraform or infrastructure-as-code concepts.
- Basic understanding of SRE practices and production support models.
- Experience troubleshooting applications or infrastructure issues.
- Awareness of networking and security fundamentals in cloud environments.
- Willingness to participate in on-call rotations and incident response.
- Strong problem-solving mindset and eagerness to learn.
- Good communication and collaboration skills.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.