Cloud Operations Engineer - Infrastructure
On-site · Irvine, California, United States
Job Summary
Cloud Operations Engineer responsible for designing, building, and maintaining scalable, secure cloud-native infrastructure for large-scale production workloads. Manage multi-account AWS environments with Infrastructure as Code (Terraform), operate production Kubernetes clusters, deploy and support Kubernetes ecosystem components (CRDs, Helm, HPA, Cluster Autoscaler, CoreDNS), and implement GitOps-based deployment workflows (FluxCD or ArgoCD). Improve Istio service mesh capabilities, define reliability practices (SLOs, error budgets, monitoring, alerting, post-mortems), participate in an on-call rotation, and drive automation across provisioning, configuration management, CI/CD, and observability using Terraform, Go, or Python. Collaborate with cross-functional teams to enhance infrastructure reliability, scalability, and operational efficiency.
Required Qualifications
- Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field
- 2+ years of hands-on experience in cloud infrastructure, Kubernetes operations, platform engineering, SRE, or related areas
- Strong knowledge of AWS services (EKS, IAM, VPC, EC2, S3)
- Hands-on experience operating Kubernetes in production environments
- Familiarity with Kubernetes ecosystem tools (CRDs, Helm, Cluster API, HPA, Cluster Autoscaler, CoreDNS)
- Experience with GitOps tools (FluxCD or ArgoCD)
- Solid Linux administration and troubleshooting skills
- Experience with CI/CD and Terraform, Go, Python or similar tools
- Understanding of reliability engineering practices (SLOs, incidents, monitoring, post-mortems)
- Willingness to participate in scheduled on-call rotation
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.