Infrastructure Engineer
Remote · Germany
Job Summary
Infrastructure Engineer role focused on owning and operating bare-metal GPU server fleets, NVIDIA software stack, and on-prem/in-air-gapped deployments. Responsibilities include designing and provisioning GPU server hardware, managing OS and firmware, configuring networking and storage, integrating with on-prem inference serving, and performing on-site build-outs and handovers. Requires hands-on experience with GPU stacks, Linux, PXE/iPXE/MAAS, Kubernetes substrate, and the ability to work in Germany (remote-first with occasional Berlin events). Nice-to-haves include German language, NVIDIA/NVIDIA DGX experience, InfiniBand/RDMA, and inference optimization. The team collaborates with SRE, ML, and MLOps for on-prem inference serving, model deployment, and performance tuning.
Required Qualifications
- 5+ years in bare-metal, HPC/GPU, data-center, or systems infrastructure engineering
- Strong bare-metal Linux (RHEL/Rocky/Ubuntu)
- experience with NVIDIA GPU stack (drivers, CUDA, GPU Operator, MIG, DCGM)
- on-prem and air-gapped environments
- travel to customer sites for builds and deployments
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.