Site Reliability Engineer (Python)
On-site · Kuala Lumpur, Kuala Lumpur, Malaysia
Job Summary
Site Reliability Engineer to design, implement, and maintain scalable, secure, and reliable Python-based services and microservices; build and maintain monitoring, observability, and automation tooling; participate in incident management with runbooks and post-incident reviews; collaborate with Python development squads to ensure reliability across the software lifecycle; leverage Python-based tooling, scripting, and automation to improve performance and CI/CD pipelines.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
- Minimum 2 years in SRE, DevOps, or similar roles
- Strong Python proficiency — including async frameworks (asyncio, FastAPI), ORM frameworks (Django), testing (pytest), packaging (Poetry/pip), and scripting
- Experience with cloud platforms (AWS, GCP, or Azure) and container orchestration (Kubernetes, Docker)
- Familiarity with Infrastructure-as-Code tools such as Terraform or CloudFormation
- Strong problem-solving skills and ability to work effectively under pressure
- Excellent communication and collaboration skills for cross-functional teamwork
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.