Senior Engineer 2: Inference Optimizations
$167,200–$209,000 year
Remote · Seattle, Washington, United States or US
Job Summary
As a Senior Engineer 2 at DigitalOcean, you will lead the technical strategy for benchmarking and performance optimizations in the AI Inference Optimization team, focusing on maximizing throughput and minimizing latency for large models. Responsibilities include engineering solutions for performance issues, implementing cutting-edge optimization techniques, and advising on GPU hardware and software integration. Ideal candidates will have over five years of experience in high-performance computing, deep familiarity with the Gen AI landscape, and strong leadership skills to guide technical direction within the team.
Required Qualifications
- 5+ years of experience in high-performance computing or AI infrastructure
- Proven track record of solving compute utilization and memory bandwidth bottlenecks
- Experience acting as a technical lead, driving design and delivery through cross-functional alignment and expert-level delegation
Desired Qualifications
- Deep familiarity with the Gen AI landscape (LLM, VLM, LMM)
- Hands-on experience with attention-layer optimizations and parallelization strategies across distributed GPU environments
- Comprehensive understanding of NVIDIA and AMD GPU architectures and software ecosystems (CUDA, ROCm, etc.)
- Extensive experience integrating and contributing to open-source software projects
- Excellent system design skills related to low-level GPU programming
- Experience acting as a technical lead
Additional Requirements
- You may apply to a maximum of 3 positions within any 180-day period.
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.