OpenAI3 days ago

Researcher, Agent Post-Training, Personality

OpenAI

$295,000–$445,000 year

On-site · San Francisco, California, United States

San Francisco, California, United StatesOn-siteFull Time$295,000–$445,000 yearMid LevelNot SpecifiedAI ServicesLarge

Type

Full Time

Level

Mid Level

Education

Not Specified

Company size

Large

Industry

AI Services

Job Summary

Researcher for the Agent Post-Training Personality team at OpenAI to help make OpenAI’s agents exceptional collaborators. Study what makes an agent thoughtful, clear, perceptive, appropriately proactive, and easy to work with, then translate those insights into evals, training data, reward signals, and model improvements. Responsibilities include developing rigorous understandings of collaborative behaviors across professional, creative, technical, and everyday contexts; turning qualitative judgments about model behavior into falsifiable hypotheses, evals, graders, and training interventions; studying user signals to determine which behaviors build trust and satisfaction; producing high-quality rollout data and preference data; improving reward models and RL objectives; collaborating with pretraining and post-training teams to shape upstream data choices; building pipelines for updating training data; partnering with product teams to turn consumer insight into model improvements and validating them in real workflows; owning projects end to end from observation to launch.

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started