Staff Software Engineer, ML Acceleration
Remote · United States or Pittsburgh, Pennsylvania, United States
Job Summary
Staff Software Engineer, ML Acceleration role focuses on accelerating ML iterations for Stack AV by profiling, optimizing, and fine-tuning ML models; integrate OSS tooling to enable self-sufficient profiling; streamline model deployment across diverse hardware; collaborate with ML researchers to balance accuracy and speed; implement optimizations using CUDA, Triton, and custom kernels; promote engineering excellence within the team.
Required Qualifications
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
- 5+ years of experience with GPU programming and optimization
- Strong programming skills in C++ and Python
- Proven experience in GPU programming and optimization
- Familiarity with deep learning frameworks, especially PyTorch
- CUDA programming
- Triton language for GPU kernels
- PyTorch optimization techniques
- TensorRT implementation
- ONNX model conversion and deployment
- Custom GPU kernel development
- Deep understanding of GPU architectures and performance optimization
- Problem-Solving: Strong analytical and problem-solving skills
- Communication: Excellent verbal and written communication skills
- Autonomous vehicles (AV) experience is a bonus
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.