Research Engineer / Research Scientist (Pre-training)
On-site · New York City, New York, United States or Toronto, Ontario, Canada
Job Summary
In this role, you’ll push the frontier of visual generative models by working on large-scale pre-training for our text-to-image foundation models, shaping objectives, algorithms, data, and systems, and turning novel ideas into models that power products used by millions of users. You’ll collaborate with a creative and ambitious team of researchers and engineers building the future of the creative economy. The ideal candidate holds a PhD or Master’s in CS and has 5+ years of AI research experience with publications in top venues, strong proficiency in PyTorch or JAX, and demonstrated ability to communicate complex research, develop and optimize ML systems, and contribute across programming languages. Location options include New York, NY (US) and Toronto, ON (Canada), with in-person work arrangements. Knowledge of diffusion/foundation models, and experience with large-scale pre-training are valued, as is a track record of impactful research contributions and the ability to operate in a fast-paced, ownership-driven team.
Required Qualifications
- PhD or Master’s degree in Computer Science or equivalent industry experience
- 5+ years of experience in AI research, including training, fine-tuning, and experimenting with foundation models beyond black-box use
- Track record of first-author publications at top-tier AI conferences (e.g., NeurIPS, ICML, ICLR, CVPR, ECCV, ICCV, ACL, EMNLP)
- Strong proficiency in one or more deep learning frameworks (e.g., JAX, PyTorch)
- Experience communicating complex research to peers
- Solid knowledge of programming languages and experience in developing, debugging, and optimizing beyond ML systems
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.