Senior Machine Learning Engineer - Platform
Remote
Job Summary
Senior Machine Learning Engineer - Platform at Zendesk focused on building the production AI platform that enables agentic services at scale. You will lead the design of shared platform services underpinning agentic workflows, including runtime orchestration, workflow execution, state management, retries, and fault recovery. You will define and evolve Platform APIs and internal tooling, partnering with product, AI, and infrastructure teams to ensure the platform is usable, secure, and consistent. You will design platform-level infrastructure for memory and context across concurrent sessions with emphasis on consistency, isolation, performance, and cost. You will contribute to reliability, observability, and evaluation systems to help teams understand system behavior, detect regressions, and enforce quality gates in CI/CD. You will help design governance, guardrails, and policy enforcement to operate agentic services safely in enterprise environments. The role requires 5+ years of backend experience (Java, Go, or Python) and experience with agent architectures (planning, tool dispatch, memory, error recovery), API design, security/sandboxing for tool execution, and building foundational systems used by multiple teams. Zendesk emphasizes a hybrid working model with global offices and remote options, and explicitly notes that AI or automated decision systems may be used to screen applications. The posting highlights a commitment to fairness, inclusion, and accessibility, with AA/EEO/ Veterans/ Disabled employment considerations and accommodation options for applicants with disabilities.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.