SRE Leader
On-site · Kuala Lumpur, Kuala Lumpur, Malaysia
Job Summary
Bybit SRE Leader role focused on building and operating a reliable, cost-efficient, automated, and scalable SRE organization. Responsibilities include establishing company-wide SLO/SLA systems, implementing MTTD/MTTR measurement and fault self-healing capabilities, fostering chaos engineering practices, standardizing change risk controls, and driving a data-driven FinOps cost governance model. Key efforts cover automated operation and maintenance, IaC-driven infrastructure, cross-region and multi-cloud considerations, security and compliance with financial-grade requirements, and leadership of a team of senior SREs to reduce toil and improve incident response. Must bring extensive hands-on experience with large-scale infrastructure, cost management, automation tooling in Go/Python, and experience with cloud platforms (AWS preferred) and multi-account/VPC architectures. Benefits emphasize growth, global collaboration, and internal mobility.
Required Qualifications
- More than 10 years of experience in infrastructure/operations/SRE
- More than 5 years of experience leading a team of more than 10 people in SRE/Infra
- Deep understanding of SRE methodology: SLO/SLI/Error Budget, Toil Management, Capacity Planning, Incident Management
- Large-scale cost management practical experience: environments with high cloud spend (e.g., >$5 million/year)
- Systematic FinOps practical experience
- Capacity modeling based on business metrics
- Proficient in IaC tools (Terraform/Pulumi/CloudFormation) and large-scale implementations
- Experience in automated operation and maintenance
- Experience in cryptocurrency exchange, securities, or payments is a plus
- Programming ability in Go/Python for building automation tools
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.