Speechify logo
Speechify30 months ago

Software Engineer, Data Infrastructure & Acquisition - Columbia, SC, USA

$140,000–$200,000 year

On-site · Remote, Oregon, United States

Type
Full Time
Level
Mid Level
Education
Doctorate Or Professional Degree
Company size
Unknown
Industry
Media Services

Job Summary

Software Engineer for Speechify's Data Infrastructure & Acquisition on the AI team. Lead data collection and ingestion to support model training at scale, build and extend cloud ingestion pipelines (currently on GCP) with Terraform, and collaborate with AI scientists to optimize cost, throughput, and data quality. Work across teams to define the dataset roadmap powering next-generation products and datasets. Ideal candidates have a BS/MS/PhD in CS (or related field) and 5+ years of software development experience, strong Bash/Python scripting, Docker, Infrastructure-as-Code (Terraform or similar), and experience with cloud providers, data crawlers, and large-scale data processing. The role emphasizes scrappy data sourcing, scalable pipelines, and contributing to a fast-growing, asynchronous culture in a fully distributed environment.

Required Qualifications

  • BS/MS/PhD in Computer Science or a related field
  • 5+ years of industry experience in software development
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

$140k – $200k / yr

Software Engineer, Data Infrastructure & Acquisition - Columbia, SC, USA · Speechify

Apply on Sorce