Inference Technical Lead, Sora
Hybrid · San Francisco, California, United States
Job Summary
The Inference Technical Lead role at OpenAI focuses on improving model serving efficiency for the Sora team. Responsibilities include engineering efforts to enhance model serving and inference performance, driving optimizations for system throughput and reliability, partnering with research and product teams, and designing critical serving infrastructure. The ideal candidate will bring expertise in model performance optimization and a strong foundation in data movement and kernel-level systems, contributing to the performance of multimodal AI systems.
Desired Qualifications
- Deep expertise in model performance optimization, particularly at the inference layer
- Strong background in kernel-level systems, data movement, and low-level performance tuning
- Excitement about scaling high-performing AI systems that serve real-world, multimodal workloads
- Ability to navigate ambiguity, set technical direction, and drive complex initiatives to completion
Additional Requirements
- Background checks for applicants will be administered in accordance with applicable law
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.