Senior Research Engineer, Olmo + Molmo
$146,880–$220,320 year
On-site · Seattle, Washington, United States
Job Summary
Senior Research Engineer responsible for end-to-end delivery of training and infrastructure for Ai2’s flagship open models (e.g., Olmo, Molmo). Build and optimize LLM, multimodal, and agentic research pipelines, including data curation, preprocessing, training, and inference; design multimodal (vision + language) models and agentic workflows; scope and lead high-impact research projects; apply strong software engineering practices to bridge research and production; contribute to open-source model releases, datasets, public APIs, and technical reports. Requires 4+ years in ML infrastructure, end-to-end model development experience, familiarity with MoEs and long-context LLMs, Python proficiency with PyTorch/JAX/TensorFlow, cloud/container skills, and BS/MSc in a quantitative field (or equivalent) with 2+ years software development experience. This position is based in Seattle with on-site work, and Ai2 invites applicants from outside the United States.
Required Qualifications
- BS or MSc in Computer Science, Statistics, Engineering, Applied Mathematics, or a related quantitative field (or equivalent experience)
- A minimum of 2 years of software development experience
- 4+ years of ML infrastructure experience — data preprocessing, model training, evaluation, inference, and deployment
- Experience with end-to-end model development — dataset construction, training, fine-tuning, evaluation, profiling, and monitoring
- Familiarity with modern model architectures — including LLMs (MoEs, long-context models), vision-language models (e.g., Molmo, LLaVA), and experience training and evaluating both
- Agentic systems knowledge — tools, memory, and long-running workflows
- Strong software engineering fundamentals — performant, scalable systems and confident debugging
- Proficiency in Python and a major ML framework (PyTorch, JAX, or TensorFlow)
- Familiarity with cloud and containerization (e.g., GCP, AWS, Docker)
- BS or MSc in Computer Science, Statistics, Engineering, Applied Mathematics, or a related quantitative field (or equivalent experience)
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.