Advisor - Data Architect, Data Foundry
$151,500–$244,200 year
On-site · San Francisco, California, United States or Boston, Massachusetts, United States
Job Summary
Data Foundry Architect at Lilly responsible for designing schemas, ontologies, data models, and platform architectures to enable AI-native drug discovery. Lead work across data modeling, data platform and lakehouse architecture, and knowledge graph design; implement semantic standards (RDF/OWL/SPARQL), build and optimize ETL/ELT pipelines, and develop machine-actionable data assets. Partner with scientific software and Tech@Lilly teams to scale and productionize data architectures, ensure compliance, and support analytical model deployment. The role involves working with data from chemical, biological, and automation-generated sources, applying modern data platforms (Databricks, Snowflake) and graph/document/vector databases, and delivering end-to-end data infrastructure to support discovery workflows and AI agents.
Required Qualifications
- MS or PhD in Computer Science, Data Science, Bioinformatics, Computational Biology, Information Science, or related STEM field
- 6+ years data architecture, data engineering, or scientific informatics experience (MS) OR 2+ years (PhD)
- Deep expertise in data modeling/ontologies, data platform/lakehouse architecture, or knowledge graphs
- Experience with relational, graph, document, and vector databases; semantic web technologies; ETL/ELT; streaming platforms
- Familiarity with cloud platforms (AWS/Azure/GCP) and LIMS/ELN integrations
- FAIR data principles and scientific data standards in chemistry/biology
- Strong communication skills and ability to translate concepts to technical and scientific audiences
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.