The Allen Institute logo
The Allen Institute1 week ago

Senior Research Engineer, Olmo + Molmo

$146,880–$220,320 year

On-site · Seattle, Washington, United States

Type
Full Time
Level
Senior Level
Education
Masters Degree
Company size
Unknown
Industry
Artificial Intelligence

Job Summary

Senior Research Engineer responsible for end-to-end delivery of training and infrastructure for Ai2’s flagship open models (e.g., Olmo, Molmo). Build and optimize LLM, multimodal, and agentic research pipelines, including data curation, preprocessing, training, and inference; design multimodal (vision + language) models and agentic workflows; scope and lead high-impact research projects; apply strong software engineering practices to bridge research and production; contribute to open-source model releases, datasets, public APIs, and technical reports. Requires 4+ years in ML infrastructure, end-to-end model development experience, familiarity with MoEs and long-context LLMs, Python proficiency with PyTorch/JAX/TensorFlow, cloud/container skills, and BS/MSc in a quantitative field (or equivalent) with 2+ years software development experience. This position is based in Seattle with on-site work, and Ai2 invites applicants from outside the United States.

Required Qualifications

  • BS or MSc in Computer Science, Statistics, Engineering, Applied Mathematics, or a related quantitative field (or equivalent experience)
  • A minimum of 2 years of software development experience
  • 4+ years of ML infrastructure experience — data preprocessing, model training, evaluation, inference, and deployment
  • Experience with end-to-end model development — dataset construction, training, fine-tuning, evaluation, profiling, and monitoring
  • Familiarity with modern model architectures — including LLMs (MoEs, long-context models), vision-language models (e.g., Molmo, LLaVA), and experience training and evaluating both
  • Agentic systems knowledge — tools, memory, and long-running workflows
  • Strong software engineering fundamentals — performant, scalable systems and confident debugging
  • Proficiency in Python and a major ML framework (PyTorch, JAX, or TensorFlow)
  • Familiarity with cloud and containerization (e.g., GCP, AWS, Docker)
  • BS or MSc in Computer Science, Statistics, Engineering, Applied Mathematics, or a related quantitative field (or equivalent experience)
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

$147k – $220k / yr

Senior Research Engineer, Olmo + Molmo · The Allen Institute

Apply on Sorce