Software Engineer, Data Infrastructure & Acquisition - Taipei, Taiwan
On-site · Remote, Oregon, United States
Job Summary
Software Engineer for Speechify's Data Infrastructure & Acquisition on the AI Data team. Role focuses on data collection and ingestion to support petabyte-scale model training; responsibilities include sourcing new audio data, extending and operating the cloud ingestion pipeline (GCP, Terraform), collaborating with AI Scientists to optimize cost/throughput/quality, and helping craft the dataset roadmap for next-generation products. Ideal candidate should have 5+ years of software development experience, proficiency with Python and Bash in Linux, Docker, and Infrastructure-as-Code, and hands-on experience with cloud providers (GCP). Familiarity with web crawlers and large-scale data processing is a plus, along with strong written and verbal communication skills.
Required Qualifications
- BS/MS/PhD in Computer Science or related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code concepts
- Professional experience with at least one major Cloud Provider (GCP)
- Experience with web crawlers, large-scale data processing workflows is a plus
- Strong communication skills, both written and verbal
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.