Data Science - AI Document Understanding, Co-op
Remote · United States
Job Summary
Data Science - AI Document Understanding Co-op at Ancestry who will design and implement AI-native, autonomous, multi-agent workflows to extract and organize text and image information from genealogical records. The role focuses on OCR/HTR, NER, Relation Extraction, Coreference Resolution, Summarization, and Knowledge Graphs across diverse historical collections, with collaboration on cloud deployment, model evaluation, and presenting insights to technical and non-technical audiences. The position is part-time/work-study for active master's or PhD students and requires advanced studies in a quantitative field with AI/LLM specialization.
Required Qualifications
- Currently pursuing an advanced degree (Master's or PhD preferred) in Computer Science, Data Science, Statistics, Mathematics, Linguistics, Engineering or related quantitative field
- Specialization in AI & LLMs including familiarity with foundational models such as GPT, Gemini, Qwen, Llama, Claude, etc.
- Experience with inference optimization, vLLM, LoRA, QLoRA, quantization, etc.
- Familiar with embeddings, vector databases, transformer models, with software development experience
- Strong proficiency in Python and relevant tools and libraries, including transformer models, multi-modal models, and general NLP (e.g., Hugging Face Transformers, agentic frameworks andworkflows, LangChain, LangGraph, CrewAI, AgentCore)
- Familiarity with cloud platforms and related AI/ML services such as Google Cloud Platform, GCP, Gemini API, Vertex AI, AWS EC2, S3, SageMaker, Model Registry, or Bedrock is a plus
- Ancestry is an Equal Opportunity Employer... background check requirements
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.