Software Engineer, Data Infrastructure & Acquisition - Brighton, United Kingdom
On-site · Remote, Oregon, United States
Job Summary
Data-side Software Engineer role focusing on data collection to support model training. Be scrappy to find new audio data sources and bring them into the ingestion pipeline. Manage and extend cloud infrastructure for ingestion (GCP, Terraform). Collaborate with AI Scientists to optimize cost, throughput, and data quality at scale. Partner with AI Team and leadership to define the dataset roadmap powering Speechify’s next-generation products. Ideal candidate has BS/MS/PhD in CS, 5+ years of software development, proficiency in bash/Python on Linux, Docker, and IaC with GCP; experience with web crawlers and large-scale data workflows is a plus; strong communication skills. The role offers a fast-growing, entrepreneurial environment with an asynchronous culture and opportunities to impact products used by millions, including accessibility-focused initiatives.”,
Required Qualifications
- BS/MS/PhD in Computer Science or a related field
- 5+ years of industry software development experience
- Proficiency with bash and Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code concepts
- Professional experience with at least one major Cloud Provider (GCP)
- Experience with web crawlers or large-scale data processing workflows is a plus
- Ability to handle multiple tasks and adapt to changing priorities
- Strong communication skills, both written and verbal
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.