Software Engineer, Data Infrastructure & Acquisition - Nagoya, Japan
On-site · Remote, Oregon, United States
Job Summary
Software Engineer for Speechify's Data Infrastructure & Acquisition role focusing on building and maintaining data collection and ingestion pipelines to support large-scale model training. Responsibilities include sourcing new audio data, expanding the ingestion pipeline, operating cloud infrastructure (GCP) and Terraform, collaborating with AI Scientists to optimize cost, throughput, and data quality, and shaping the AI Team’s dataset roadmap for consumer and enterprise products. The ideal candidate has a PhD/MS/BS in CS or related field, 5+ years of software development experience, strong Bash/Python skills, Docker and IaC expertise, and experience with cloud providers and data-processing workflows. The role emphasizes data collection, ingestion, scalable infrastructure, collaboration with researchers, and contributing to a fast-growing, entrepreneurial environment.
Required Qualifications
- BS/MS/PhD in Computer Science or related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code concepts
- Professional experience with at least one major Cloud Provider (GCP)
- Experience with web crawlers, large-scale data processing workflows is a plus
- Strong communication skills, both written and verbal
- Ability to handle multiple tasks and adapt to changing priorities
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.