Software Engineer, Data Infrastructure & Acquisition - Pittsburgh, PA, USA
$140,000–$200,000 year
On-site · Remote, Oregon, United States
Job Summary
Software Engineer for Speechify's Data Infrastructure & Acquisition team focused on building and scaling data collection for model training. Responsible for identifying new audio data sources, bringing them into the ingestion pipeline, operating and extending cloud infrastructure (GCP, Terraform), and collaborating with Scientists to optimize cost, throughput, and data quality. Works with AI Team and Leadership to define the dataset roadmap powering Speechify’s next-generation products, with emphasis on high-quality, petabyte-scale data at low cost and tight integration of infrastructure, engineering, and research. Ideal candidate has a CS-related BS/MS/PhD, 5+ years of software development experience, strong scripting in Bash/Python on Linux, Docker proficiency, IaC experience, and familiarity with large-scale data processing and web crawlers.
Required Qualifications
- BS/MS/PhD in Computer Science or related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker
- Experience with Infrastructure-as-Code concepts (Terraform)
- Professional experience with at least one major Cloud Provider (GCP)
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.