Software Engineer, ML Data Infrastructure
On-site · New York City, New York, United States or Toronto, Ontario, Canada
Job Summary
Software Engineer, ML Data Infrastructure at Ideogram to build robust data infrastructure powering petabyte-scale, multi-modal training pipelines. You will tackle complex distributed systems challenges, optimize data processing workflows, and partner with research scientists to translate data requirements into production-grade systems. The role emphasizes ownership, collaboration, and delivering scalable backend solutions using Python and a stack including Kubernetes, GCP services (Bigtable/BigQuery/Spanner/Pub/Sub), Docker, and Terraform, enabling cutting-edge AI design experiences for millions of users. The team values fast iteration, curiosity, mentorship, and a culture of belonging. Locations include in-person roles in New York, NY (US) and Toronto, ON (CA).
Required Qualifications
- 2-5 years developing and shipping large-scale distributed systems
- strong fundamentals in data structures and algorithms
- experience with databases and data storage architectures
- hands-on experience with large-scale data processing systems
- ability to drive projects from 0 to 1
- proactive ownership and bias toward action
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.