Senior Software Engineer - Data Platform
On-site · Pune, Maharashtra, India
Job Summary
Senior Software Engineer - Data Platform role focused on Batch Processing. Lead the design and operation of large-scale data platforms using lakehouse architectures to power analytics, ML, and business decision-making. Responsibilities include diagnosing complex distributed systems, architecting scalable solutions, building high-performance fault-tolerant data pipelines, driving automated CI/CD and observability, mentoring junior engineers, and participating in design reviews. Key skills include Apache Spark (PySpark/Scala), SQL, AWS, batch data pipelines, data modeling/partitioning/performance tuning, workflow orchestration (Airflow), and programming in Python/Java/Scala. Experience with Iceberg/Delta Lake/Hudi, streaming systems (Kafka/Flink), data governance, and infrastructure-as-code is preferred. Leadership and collaboration within a data platform context are expected.
Required Qualifications
- Minimum Qualifications: Bachelor’s/Master’s degree in Computer Science or related field
- 2–6 years of experience in data engineering / backend engineering
- Strong experience with Apache Spark (PySpark/Scala)
- SQL and large-scale data processing
- Cloud platforms (AWS preferred)
- Experience building batch data pipelines at scale
- Solid understanding of data modeling, partitioning, and performance tuning
- Experience with workflow orchestration tools (Airflow or equivalent)
- Strong programming skills (Python / Java / Scala)
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.