Software Engineer, Data Infrastructure & Acquisition - Santiago, Chile
On-site · Remote, Oregon, United States
Job Summary
Software Engineer for Speechify's Data Infrastructure & Acquisition. Responsible for all aspects of data collection to support model training, building high-quality datasets at petabyte-scale, and integrating ingestion pipelines with cloud infrastructure (GCP, Terraform). Collaborate with AI Scientists to optimize cost, throughput, and data quality, and contribute to the dataset roadmap for next-generation products. Required to be proficient in bash/Python on Linux, Docker, and IaC, with experience on GCP; experience with web crawlers and large-scale data processing is a plus. Ideal candidate has BS/MS/PhD in CS or related field, 5+ years of software development, strong communication skills, and the ability to work in a distributed, fast-growing team.
Required Qualifications
- BS/MS/PhD in Computer Science or related field
- 5+ years of industry software development experience
- Proficiency with bash/Python scripting in Linux environments
- Proficiency with Docker and Infrastructure-as-Code concepts
- Professional experience with at least one major Cloud Provider (GCP)
- Experience with web crawlers, large-scale data processing workflows is a plus
- Ability to handle multiple tasks and adapt to changing priorities
- Strong communication skills, both written and verbal
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.