Senior Scientist, Synthetic Data and Privacy
$168,000–$304,750 year
Remote · New York City, New York, United States or California, United States
Job Summary
Senior Scientist focused on synthetic data and privacy for LLMs. Build LLM-based methods for synthetic data generation, privacy, and context-aware anonymization; optimize low-latency, high-throughput inference (distillation, quantization); scale frameworks for real-time usage; design and maintain open-source libraries and SDKs with robust APIs and documentation; publish original research at top conferences to sustain NVIDIA’s leadership; mentor interns and junior researchers. Collaborate with research, engineering, product teams, and external labs; contribute to NVIDIA NeMo ecosystem; pursue multilingual and multimodal data evaluation and privacy-preserving techniques; drive software excellence with modern tooling and CI/CD. PhD or equivalent and a track record of impactful research and software contributions are expected.
Required Qualifications
- PhD in Computer Science, Machine Learning, Statistics, or a related field, or equivalent experience
- 2+ years of applied LLM/NLP research and engineering
- Experience developing or maintaining software libraries used by a broad developer community
- Strong publication record at top venues (e.g., NeurIPS, ICML, ICLR, ACL)
- Contributions to open-source projects in ML, security, or privacy domains
- Ability to build and optimize scalable data processing pipelines for large-scale models
- Knowledge of GDPR/CCPA (preferred)
- Excellent collaboration across research, engineering, and product teams
- Mentorship experience (interns/junior researchers)
- Experience with inference optimization (quantization, distillation) and real-time frameworks
- Familiarity with NeMo ecosystem and related open-source tooling
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.