Site Reliability Engineer
On-site · Pune, Maharashtra, India
Job Summary
Site Reliability Engineer embedded with product teams to design, code, test, and evolve systems to increase payments reliability and overall efficiency. Drive reliability practices across the software development lifecycle, including architecture reviews, capacity planning, chaos testing, SLOs, and incident retrospectives. Maintain service health with monitoring, alerting, self-healing, and follow-the-sun incident response; promote blameless post-incident reviews and codify learnings. Serve as a technical and cultural leader across the assigned line of business and contribute to resilience and deployment standards. Interface with key clients as needed and support go-forward offers with BizDev and DevOps teams. Requires BS in Computer Science or related field (or equivalent practical experience), 5+ years in production environments, and proficiency in Java/Go/Shell/Python; experience with PostgreSQL/Oracle, Cassandra, Kafka/RabbitMQ; Azure cloud and Linux environments. Willingness to travel up to 15% (domestic or international).
Required Qualifications
- BS degree in Computer Science, related technical field, or equivalent practical experience
- Experience writing code in Java, Go, Shell, Python, or a similar language
- Experience in data structures, database systems algorithms, and software design
- Practical knowledge with RDBs (such as PostgreSQL, Oracle), NoSQL KV stores (such as Cassandra) and messaging systems (such as Kafka, RabbitMQ and MQ)
- Strong experience in Production level, mission critical environments. Azure Cloud experience running Production workloads preferred
- Strong experience working in virtualized Linux operating environments
- 5+ years of real-world experience
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.