Deep Learning Performance Software Engineer
On-site · Shanghai, Shanghai, China
Job Summary
Develop compilers and DSLs for deep learning workloads. Design and implement highly optimized deep learning kernels. Continuously improve the compiler architecture for current and next generation chips. Perform performance analysis on emerging AI workloads and integrate with AI frameworks. Master's or Ph.D degree (or equivalent experience) in CE, CS&E, CS, or AI. Excellent C/C++ programming and software design skills. Experience with XLA, TVM, MLIR, LLVM, deep learning models and algorithms. 3+ years of relevant work experience.
Required Qualifications
- Master's or Ph.D degree (or equivalent experience) in CE, CS&E, CS, or AI
- Excellent C/C++ programming and software design skills
- Experience with XLA, TVM, MLIR, LLVM, deep learning models and algorithms
- 3+ years of relevant work experience
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.