Senior Service Reliability Engineer
On-site · Berlin, State of Berlin, Germany
Job Summary
Senior SRE role focused on leadership in reliability and scalability, defining KPIs and processes, influencing architecture, mentoring junior staff, and driving platform-wide solutions with strong Linux and distributed systems expertise. Requires deep experience across large-scale Web services, on-call readiness, and proficiency with modern tech stacks (Python, Kubernetes, AWS, NoSQL/SQL databases, monitoring, and incident management).
Required Qualifications
- Minimum of 7+ years working experience in Software Development and/or Linux Systems Administration role.
- Strong interpersonal, written and verbal communication skills.
- Available to be scheduled in on-call rotation.
- Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
- Development experience in one or more of the following programming languages: Python (preferred), Bash, Go, Java, C++, or Rust
- Experience with at least 3 of the following topics: Distributed data storage at scale (Hadoop, Ceph); NoSQL at scale (MongoDB, Redis, Cassandra); Data aggregation technologies (ElasticSearch, Kafka); Scaling and running traditional RDBMS (PostgreSQL, MySQL) with High Availability; Monitoring & alerting (Prometheus, Grafana), and Incident Management toolsets; Kubernetes and/or AWS (deployment and management); Software distribution (Package management and distribution at scale); Configuration management (ansible, saltstack, puppet, chef); Software performance analysis and load testing (QA or SDET experience: a plus)
- Note about background checks and equal opportunity statements (not qualifications)
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.