Data Engineer - GenAI
On-site · Brussels, Brussels Capital, Belgium
Job Summary
Data Engineer in GenAI practice responsible for end-to-end design and deployment of GenAI-ready data pipelines and AI platforms. Build batch and streaming pipelines processing unstructured data (text, images, audio) to support LLM fine-tuning and retrieval-augmented generation. Hands-on development on cloud platforms (Databricks, Azure, AWS) and expertise in data transformation (dbt), orchestration (Airflow/Prefect/Dagster), and data governance for enterprise-scale AI solutions. Collaborate with ML engineers and GenAI specialists to design data models and data-as-a-service layers, advocate for clean, maintainable Python code, and leverage Docker/Kubernetes for deploying AI-driven microservices. Requires 3-5 years of data engineering experience, Master’s degree in a quantitative field, and strong Python/Spark/dbt skills. Based in Brussels, Belgium.
Required Qualifications
- Master’s degree in a quantitative field (CS/Engineering/Math or related)
- 3-5 years of data engineering experience (ideally in consulting)
- Strong programming in Python
- Experience with Spark (PySpark, SparkSQL) for large-scale processing
- Proven experience with dbt for data modeling
- Experience with cloud platforms (Databricks, Azure, AWS)
- Familiarity with Airflow, Prefect, or Dagster
- Knowledge of data governance and scalable data architectures
- Experience with vector data management and GenAI readiness
- English proficiency; Dutch or French a plus
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.