Senior Data Engineer
Remote · Warsaw, Mazovia, Poland
Job Summary
Senior Data Engineer to design and build cloud-native data platforms from greenfield to production, migrate on-premises systems to the cloud, and drive AI-ready data infrastructure. Collaborate with Machine Learning, Data Science, and Product teams as a technical contributor and thought leader. Lead near-real-time ingestion pipelines, define platform standards (Data Lake/Lakehouse, medallion architecture, data contracts), optimise Spark/PySpark scripts, implement CI/CD and code quality practices, and enable self-service tooling and microservices. Focus areas include event-driven architectures, agentic AI workflows, data quality, observability, and scalable data pipelines. Strong Python/SQL, Spark/PySpark, Databricks or Snowflake experience, multi-cloud exposure (Azure/AWS/GCP), Kafka/Spark Streaming, Airflow, Terraform, and English proficiency are required.
Required Qualifications
- 5+ years of professional experience in Data Engineering
- Strong Python and SQL development for pipeline development and optimisation
- Proficiency in Apache Spark / PySpark, including query optimisation and performance tuning
- Hands-on experience with Databricks (preferred) or Snowflake
- Experience with at least one major cloud provider: Azure (preferred), AWS, or GCP
- Experience with stream processing technologies (Kafka, Spark Structured Streaming)
- Solid understanding of ETL/ELT patterns, data modelling (dimensional, Data Vault), and data warehousing
- Experience with orchestration tools (Apache Airflow, Azure Data Factory, or equivalent)
- Knowledge of Infrastructure as Code (Terraform or equivalent)
- Understanding of production-grade system requirements: reliability, scalability, observability, and performance
- Upper-Intermediate English level
- Familiarity with RAG pipeline design and LLM integration patterns
- Knowledge of data governance frameworks and tools (Unity Catalog, Apache Atlas, or similar)
- Experience with dbt for data transformation and modelling
- Familiarity with MLflow, Feature Stores, or ML platform integration
- Self-driven and proactive in identifying improvements
- Comfortable working in a fast-paced, innovative environment
- Strong problem-solving mindset with attention to detail
- Open to experimenting with emerging technologies and approaches
- Remote work allowed
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.