Placeholder Company for Greater China2 months ago

Research Internship – Reinforcement Learning for Large Foundation Models

Placeholder Company for Greater China

$80,168–$124,800 year

On-site · Bellevue, Washington, United States

Bellevue, Washington, United StatesOn-siteInternship$80,168–$124,800 yearEntry LevelDoctorate Or Professional DegreeStartup

Type

Internship

Level

Entry Level

Education

Doctorate Or Professional Degree

Company size

Startup

Job Summary

Research Internship in Reinforcement Learning for Large Foundation Models: Develop stable and efficient RL algorithms for large foundation models, conduct large-scale RL experiments in complex reasoning and autonomous agent scenarios, and explore RL algorithms, reward modeling, and world models. Collaborate with Tencent AI Lab researchers to deliver impactful algorithms for real-world applications and produce influential publications. Ideal candidates hold a Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university, with research experience in NLP or ML, proficiency in Python and PyTorch, and a strong publication track record, plus excellent communication and teamwork for cross-functional collaboration. Internship lasts 3 months with possible extension; can start anytime in 2026; base pay range between $80,168.40 and $124,800 per year; paid sick leave, holidays, and company-sponsored medical plan eligibility for full-time interns.

Required Qualifications

Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university
Research experiences in natural language processing or machine learning
Proficiency in Python programming
Experience developing with deep learning frameworks such as PyTorch
Good publication track record and history of creativity and intellectual flexibility
Excellent communication and teamwork skills; capable of collaborating with cross-functional teams to drive project success and innovation

Desired Qualifications

Experience in natural language processing or machine learning research (preferred)
Good publication track record (preferred)
Proficiency in Python programming (preferred)
Experience developing with deep learning frameworks such as PyTorch (preferred)
Excellent communication and teamwork skills (preferred)

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started