Speechify logo
Speechify30 months ago

Software Engineer, Data Infrastructure & Acquisition - Pittsburgh, PA, USA

$140,000–$200,000 year

On-site · Remote, Oregon, United States

Type
Full Time
Level
Mid Level
Education
Doctorate Or Professional Degree
Company size
Unknown
Industry
Media Services

Job Summary

Software Engineer for Speechify's Data Infrastructure & Acquisition team focused on building and scaling data collection for model training. Responsible for identifying new audio data sources, bringing them into the ingestion pipeline, operating and extending cloud infrastructure (GCP, Terraform), and collaborating with Scientists to optimize cost, throughput, and data quality. Works with AI Team and Leadership to define the dataset roadmap powering Speechify’s next-generation products, with emphasis on high-quality, petabyte-scale data at low cost and tight integration of infrastructure, engineering, and research. Ideal candidate has a CS-related BS/MS/PhD, 5+ years of software development experience, strong scripting in Bash/Python on Linux, Docker proficiency, IaC experience, and familiarity with large-scale data processing and web crawlers.

Required Qualifications

  • BS/MS/PhD in Computer Science or related field
  • 5+ years of industry experience in software development
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker
  • Experience with Infrastructure-as-Code concepts (Terraform)
  • Professional experience with at least one major Cloud Provider (GCP)
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

$140k – $200k / yr

Software Engineer, Data Infrastructure & Acquisition - Pittsburgh, PA, USA · Speechify

Apply on Sorce