Site Reliability Engineer with Splunk
On-site · Kolkata, West Bengal, India
Job Summary
Site Reliability Engineer with Splunk sought to design, implement, and optimize enterprise observability solutions across applications, infrastructure, and cloud environments. You will develop dashboards, alerts, and telemetry frameworks to provide real-time visibility into system health, build automation to remove repetitive tasks, enable runbook automation and self-healing incident workflows, define SLIs/SLOs and alerting strategies to improve reliability, and drive improvements in MTTD/MTTR through telemetry-driven insights. The role emphasizes integration of observability platforms with CI/CD pipelines, cloud services, and ITSM tools such as ServiceNow, collaboration with engineering/operations teams to establish standards, and a focus on proactive monitoring, anomaly detection, and AIOps capabilities. A Bachelor's degree and hands-on experience with Splunk, Dynatrace, Grafana, OpenTelemetry, AWS/GCP, Python, MELT, and Terraform are required.
Required Qualifications
- 3+ years of experience in Observability Engineering, Site Reliability Engineering, or related domains
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience)
- Hands-on experience with observability platforms such as Splunk, Dynatrace, Grafana, and OpenTelemetry
- Proficiency in Python for automation and operational tooling
- Experience implementing metrics, logs, events, and distributed tracing (MELT) across distributed systems
- Hands-on experience with Terraform and Infrastructure as Code practices
- Strong understanding of SLIs, SLOs, alerting strategies, and incident response frameworks
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.