Senior Product Manager, Compute Platform
$280,540–$330,950 year
On-site · San Mateo, California, United States
Job Summary
Senior Product Manager, Compute Platform at Roblox responsible for defining the strategy and roadmap for the next-generation AI infrastructure, spanning GPU/CPU fleets across on-prem, edge data centers, and public cloud. Own products that transform raw GPU hosts into reliable, production-ready AI compute—drivers, firmware management, fleet health and performance abstractions—and collaborate across Roblox teams to enable compute primitives. Key duties include driving strategy for Compute Platform (Managed Kubernetes, Managed Compute Services, and fleet APIs), scaling GPU infrastructure for training and inference, ensuring platform reliability, and partnering with multiple platform teams to empower users with Compute Platform primitives. Required expertise covers Kubernetes internals, GPU/accelerator scheduling, cloud-native networking, and experience building scalable compute infrastructures with a focus on efficiency and developer experience. Strong builder mindset, cross-functional leadership, and experience across AWS/GCP/Azure are highly valued; role may involve on-site requirements in San Mateo, CA with salary in the stated range and equity eligibility.
Required Qualifications
- 7+ years of product management experience focused on Compute infrastructure or distributed systems at scale
- Deep practical understanding of Kubernetes internals and control plane components
- Experience productizing Kubernetes Operators, Controllers, and CRDs
- Deep familiarity with GPU/accelerator architecture and scheduling challenges
- Familiarity with cloud-native service networking
- Built production-grade compute platforms with emphasis on efficiency and reliability
- Ability to balance multiple users and stakeholders with tradeoffs between utilization, latency, cost, and time-to-ship
- Builder mindset with prototyping and leveraging AI for ideation
- Experience building Compute infrastructure on AWS, GCP, or Azure
- Background in AI model development, training, inference (preferred)
- Kernel-level experience or familiarity with custom kernel drivers (preferred)
- Experience building agentic systems for Compute or infrastructure (preferred)
- For US-based roles: visa-H1B sponsorship note indicating potential limitations
- Annual Salary Range: $280,540—$330,950 USD
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.