AI DevOps Engineer (MLOps & Cloud)
On-site · Petaling Jaya, Selangor, Malaysia
Job Summary
Design, implement, and maintain scalable, secure cloud infrastructure for AI/ML solutions. Build and manage Infrastructure as Code (IaC) using Terraform or CloudFormation. Develop and maintain CI/CD pipelines for AI applications and model deployment. Support end-to-end MLOps lifecycle (training, versioning, deployment, monitoring). Automate deployments using blue/green, canary releases, and rollback strategies. Configure monitoring, alerting, and observability for production systems. Optimize performance, scalability, and cost (compute, storage, GPU usage). Use AI tools to accelerate scripting, automation, and incident resolution. Troubleshoot production issues and ensure high system reliability and availability. Support developers and data scientists on DevOps tooling and best practices. Implement DevSecOps standards, including security, secrets management, and compliance. Maintain clear technical documentation for infrastructure and processes.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.