ARKA logo
ARKA3 months ago

Staff Data Engineer (TS/SCI with CI Polygraph) {S}

On-site · King of Prussia, Pennsylvania, United States

Type
Full Time
Level
Mid Level
Education
Bachelors Degree
Company size
Unknown

Job Summary

Lead and mentor a team working on the implementation of ETL pipelines to support various AI/ML and large language model (LLM) solutions. Responsibilities include assembling complex data sets, building infrastructure for optimal data extraction, implementing AI/ML algorithms, and improving internal processes for scalability. Candidates should have a B.S. in data science or a related field with at least six years of experience, including data pipeline development and using Python. An active TS/SCI security clearance with a CI polygraph is required.

Required Qualifications

  • B.S. in data science, AI/ML, computer science, or related field
  • Minimum six (6) years of relevant experience as a Data Engineer/Scientist
  • Experience developing data pipelines and normalizing data with canonical Python packages (e.g. NumPy, Pandas, Polars)
  • Experience contributing on a team using version control (e.g. git, GitLab, Bitbucket)
  • Active TS/SCI U.S. Government Security Clearance with a recent Counter Intelligence Polygraph (CI)

Desired Qualifications

  • M.S. or PhD in Data Science, AI/ML, Computer Science, or related field
  • Experience with Gitlab, DevSecOps utilizing test-driven development
  • Experience with containers (e.g. Docker, Docker Compose)
  • Experience with cloud services (e.g. AWS)
  • Experience with distributed computing tools (e.g. Spark, Pyspark)
  • Experience leading an interdisciplinary team of researchers and software developers
  • Experience with Machine Learning libraries and frameworks such as HuggingFace and LangChain
  • Experience with Linux
  • Familiarity with using AWS cloud computing resources such as EC2, S3, Lambda, Bedrock, etc.
  • Experience with additional programming languages such as Java, C++, Rust, Go, and/or C#
  • Experience implementing algorithms on the GPU in Python or C++ using CUDA and other CUDA libraries
  • Experience in application deployment, virtualization, and containerization (e.g. Podman, Docker, Kubernetes, Rancher)
  • Experience shaping and writing proposals

Additional Requirements

  • No visa sponsorship is available for this position
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

ARKA

Staff Data Engineer (TS/SCI with CI Polygraph) {S}

Apply on Sorce