Software Engineer, Data Infrastructure & Acquisition - Alexandria, VA, USA
$140,000–$200,000 year
On-site · Remote, Oregon, United States
Job Summary
Software Engineer for Speechify's Data Infrastructure & Acquisition team responsible for all aspects of data collection to support model training. Build and operate a petabyte-scale data ingestion pipeline, extend cloud infrastructure (GCP, Terraform), collaborate with AI Scientists to improve cost/throughput/quality, and help craft the dataset roadmap for next-generation products. Ideal candidate has BS/MS/PhD in CS, 5+ years in software development, strong Bash/Python in Linux, Docker and Infrastructure-as-Code experience, and familiarity with web crawlers and large-scale data processing. The role offers a fast-growing, entrepreneurial environment with competitive compensation and the opportunity to impact learning-difference-focused products in a fully distributed team.
Required Qualifications
- BS/MS/PhD in Computer Science or related field
- 5+ years of industry software development experience
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code concepts
- Professional experience with at least one major Cloud Provider (GCP)
- Experience with web crawlers, large-scale data processing workflows is a plus
- Strong communication skills, both written and verbal
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.