Research Scientist / Engineer - Data & Evaluation
On-site · Palo Alto, California, United States
Job Summary
Research Scientists and Research Engineers to build the data and evaluation foundations for a web-scale video action model. You will design, implement, and scale data curation pipelines for web-scale video pretraining, develop video-specific annotation frameworks and quality filters, and build evaluation benchmarks to measure model capabilities. Collaborate with pre-training and post-training teams to ensure data quality informs research decisions, track model capability trends across training runs, and push forward scalable video understanding and generation quality at web scale.
Required Qualifications
- Strong understanding of data-centric ML and how web video data quality affects large generative model performance
- Experience building large-scale video data pipelines: ingestion, filtering, deduplication, and quality scoring
- Familiarity with video-specific data characteristics: temporal structure, motion quality, scene diversity, and action content
- Solid ML fundamentals with hands-on experience training or evaluating large generative models
- Ability to design evaluations for video generation models that are diagnostic, reproducible, and actionable
- Staff-level candidates are expected to define technical direction and drive research strategy independently; senior/MTS candidates execute complex projects with strong fundamentals and growing scope
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.