Site Reliability Engineer
On-site · Berlin, State of Berlin, Germany
Job Summary
Site Reliability Engineer role with Sony Interactive Entertainment focusing on reliability and scalability of cloud gaming services. Lead team discussions on improvements, create High Level Designs for new products, mentor junior SRE staff, lead incident response and post-mortem activities, collaborate with cross-functional teams to address technical debt and toil, contribute to code to improve reliability, and implement automation to reduce ongoing toil. Requires 5+ years in software development or Linux systems administration, strong communication, on-call availability, and expertise across Linux production systems, programming languages (Python, Bash, Go, Java, C++, Rust), distributed storage, NoSQL, data aggregation (ElasticSearch, Kafka), RDBMS HA, monitoring/alerting (Prometheus, Grafana), Kubernetes and AWS, packaging, configuration management, and performance analysis. Note: background checks at offer stage and privacy notice are mentioned.
Required Qualifications
- Minimum of 5+ years working experience in Software Development and/or Linux Systems Administration role.
- Strong interpersonal, written and verbal communication skills.
- Available to be scheduled in on-call rotation.
- Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
- Development experience in one or more of the following programming languages: Python (preferred), Bash, Go, Java, C++, or Rust
- Experience with distributed data storage at scale (Hadoop, Ceph)
- NoSQL at scale (MongoDB, Redis, Cassandra)
- Data Aggregation technologies (ElasticSearch, Kafka)
- Scaling and running traditional RDBMS (PostgreSQL, MySQL) with High Availability
- Monitoring & Alerting (Prometheus, Grafana), and Incident Management toolsets
- Kubernetes and/or AWS (deployment and management)
- Software Distribution (Package management and distribution at scale)
- Configuration Management (ansible, saltstack, puppet, chef)
- S/W Performance analysis and load testing (QA or SDET experience: a plus)
- Background checks may be conducted at offer stage and privacy notice referenced
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.