Software Engineer, Data Infrastructure & Acquisition - Melbourne, Australia
On-site · Remote, Oregon, United States
Job Summary
Be scrappy to find new sources of audio data and bring it into our ingestion pipeline. Operate and extend the cloud infrastructure for the ingestion pipeline, currently running on GCP and managed with Terraform. Collaborate closely with Scientists to shift cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power Speechify’s next-generation models. Work with the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap powering our consumer and enterprise products. Ideal candidate has BS/MS/PhD in Computer Science or related field, 5+ years of software development experience, proficiency with bash/Python on Linux, Docker and IaC, and experience with at least one major cloud provider (GCP); experience with web crawlers and large-scale data processing is a plus.
Required Qualifications
- BS/MS/PhD in Computer Science or a related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code concepts
- professional experience with at least one major Cloud Provider (GCP)
- Experience with web crawlers, large-scale data processing workflows is a plus
- Strong communication skills, both written and verbal
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.