Site Reliability Engineer
On-site · Singapore, Singapore
Job Summary
Site Reliability Engineer on the Infrastructure Team at Grasshopper (Singapore). Design, implement, and maintain robust observability systems (monitoring, logging, tracing, alerting) to ensure high availability and rapid incident detection. Architect and maintain scalable solutions on Google Cloud and on‐premise infrastructure; advance research platform capabilities; investigate production issues; collaborate with developers to improve CI/CD and toolchains. Drive an SRE mindset within the organization. Required: 3–5 years of hands‐on experience in Platform/SRE/Infrastructure Engineering; experience with Linux, networking, distributed systems; production Kubernetes; GitOps with Argo CD; cloud experience (GCP or AWS); programming in Python or Go; strong collaboration and communication; self‐motivated and adaptable. Beneficial: experience with Kubernetes operators, Terraform, on‐prem bare metal, Puppet/Chef/Ansible, Argo‐CD/Argo Workflows, Prometheus/OpenTelemetry, RedHat/CentOS, and large‐scale compute/workloads. What we offer: 21 days annual leave, comprehensive insurance, wellness budget, gym, and more. Grasshopper is an equal opportunity employer.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.