Protege logo
Protege1 week ago

Senior Software Engineer, Data Processing

Remote · United States

Type
Full Time
Level
Senior Level
Education
Not Specified
Company size
Unknown

Job Summary

Senior Software Engineer, Data Processing role focusing on the ingestion and processing layer for large-scale multimodal data (medical imaging, audio, video) to produce AI-ready datasets. You’ll own end-to-end ingestion pipelines, build parsers/validators/normalization logic, optimize for high volume and reliability, handle sensitive data (PHI) with proper security and compliance, track provenance and usage constraints, and collaborate with product and Data Lab to support new modalities. Must-haves include 5+ years in production backend/data systems, Python proficiency, distributed data processing, and AWS expertise; nice-to-haves cover modalities like medical imaging, HIPAA/PHI handling, Airflow/Dagster, GCP/Azure, startup experience, and ML/NLP/LLM familiarity.

Required Qualifications

  • 5+ years building and operating production backend or data systems, with real experience in data processing at scale
  • Hands-on experience designing and running large-scale data pipelines
  • Strong programming skills in Python
  • Experience with distributed data processing
  • Strong proficiency with AWS
  • Comfort with messy, varied, high-volume data and high ambiguity, with a knack for finding patterns in complex environments
  • Attention to detail without losing speed, and a bias to action
  • Excited to work on a product built around moving and processing large volumes of data
  • Curious, tenacious, and proactive
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

Protege

Senior Software Engineer, Data Processing

Apply on Sorce