Staff Data Engineer (TS/SCI with CI Polygraph) {S}
On-site · King of Prussia, Pennsylvania, United States
Job Summary
Staff Data Engineer/Scientist role focusing on developing AI/ML algorithms across various disciplines including large language models and predictive analytics. Responsibilities include leading an interdisciplinary team, implementing ETL pipelines, assembling complex data sets, building data infrastructure, maintaining data for metrics, and improving internal processes. Required skills include a B.S. degree and six years of experience as a Data Engineer/Scientist, proficiency in data pipeline development with Python, and an active TS/SCI clearance. Preferred qualifications focus on advanced degrees and extensive experience with AI/ML methodologies and collaborative tools.
Required Qualifications
- B.S. in data science, AI/ML, computer science, or related field
- Minimum six (6) years of relevant experience as a Data Engineer/Scientist
- Experience developing data pipelines and normalizing data with canonical Python packages (e.g. NumPy, Pandas, Polars)
- Experience contributing on a team using version control (e.g. git, GitLab, Bitbucket)
- Active TS/SCI U.S. Government Security Clearance with a recent Counter Intelligence Polygraph (CI)
Desired Qualifications
- M.S. or PhD in Data Science, AI/ML, Computer Science, or related field
- Experience with GitLab, DevSecOps utilizing test-driven development, containers (e.g. Docker, Docker Compose), cloud services (e.g. AWS), tools for distributed computing (e.g. Spark, Pyspark)
- Experience leading an interdisciplinary team of researchers and software developers
- Experience with Large Language Models and identifying ways to incorporate them into new domains and applications
- Applying Transformer-based architectures to domains outside of NLP such as computer vision
- Natural Language Processing algorithms such as BERT
- Reinforcement learning familiarity with Gymnasium Gym, OpenEnv, TorchRL, RLlib, and Stable Baselines
- Applying clustering algorithms and/or deep neural networks to real life problems
- Implementing tracking and pattern-of-life algorithms
- Experience with GenAI Ops techniques and frameworks (e.g. LangFuse, MLFlow, Arize Phoenix)
- Experience with Machine Learning libraries such as HuggingFace and LangChain
- Experience with Linux
- Familiarity with using AWS cloud computing resources such as EC2, S3, Lambda, Bedrock, etc.
- Experience with additional languages: Java, C++, Rust, Go, and/or C#
- Experience implementing algorithms on the GPU in Python or C++ using CUDA
- Experience in application deployment, virtualization, and containerization (e.g. Podman, Docker, Kubernetes, Rancher)
- Experience shaping and writing proposals
Additional Requirements
- This position requires an active TS/SCI U.S. Government Security Clearance with a recent Counter Intelligence Polygraph (CI)
- No visa sponsorship is available for this position
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.