Placeholder Company for Greater China logo
Placeholder Company for Greater China2 months ago

Research Internship – Reinforcement Learning for Large Foundation Models

$80,168–$124,800 year

On-site · Bellevue, Washington, United States

Type
Internship
Level
Entry Level
Education
Doctorate Or Professional Degree
Company size
Startup

Job Summary

Research Internship in Reinforcement Learning for Large Foundation Models: Develop stable and efficient RL algorithms for large foundation models, conduct large-scale RL experiments in complex reasoning and autonomous agent scenarios, and explore RL algorithms, reward modeling, and world models. Collaborate with Tencent AI Lab researchers to deliver impactful algorithms for real-world applications and produce influential publications. Ideal candidates hold a Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university, with research experience in NLP or ML, proficiency in Python and PyTorch, and a strong publication track record, plus excellent communication and teamwork for cross-functional collaboration. Internship lasts 3 months with possible extension; can start anytime in 2026; base pay range between $80,168.40 and $124,800 per year; paid sick leave, holidays, and company-sponsored medical plan eligibility for full-time interns.

Required Qualifications

  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university
  • Research experiences in natural language processing or machine learning
  • Proficiency in Python programming
  • Experience developing with deep learning frameworks such as PyTorch
  • Good publication track record and history of creativity and intellectual flexibility
  • Excellent communication and teamwork skills; capable of collaborating with cross-functional teams to drive project success and innovation

Desired Qualifications

  • Experience in natural language processing or machine learning research (preferred)
  • Good publication track record (preferred)
  • Proficiency in Python programming (preferred)
  • Experience developing with deep learning frameworks such as PyTorch (preferred)
  • Excellent communication and teamwork skills (preferred)
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

$80k – $125k / yr

Research Internship – Reinforcement Learning for Large Foundation Models · Placeholder Company for Greater China

Apply on Sorce