Senior Site Reliability Engineer
Remote · Spain or EU
Job Summary
Senior Site Reliability Engineer responsible for owning incident management end-to-end, maintaining and evolving observability stacks, keeping production applications running through on-call rotations, developing automations for reliability, and collaborating with product engineers to promote SRE practices. Must have solid programming experience (Python with Django/AsyncIO and/or Java with Spring Boot), experience with observability tools (LGTM - Loki, Grafana, Tempo, Mimir), production Python services, strong AWS and Kubernetes skills, relational databases (PostgreSQL) and messaging systems (RabbitMQ, NATS, Kafka); experience as an on-call SRE; hands-on troubleshooting of distributed systems; ownership mindset; strong communication and knowledge sharing. Remote-first with hybrid options from Kyiv, Warsaw, and Lisbon; globally distributed team with EU time zone alignment. Proficiency in English required.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.