STACK Infrastructure logo
STACK InfrastructurePosted 1 month ago

Infrastructure Reliability Engineer

On-siteChicago, Illinois, United States or Portland, Oregon, United States

Full TimeMid LevelBachelors DegreeEnterprise

Job Summary

Infrastructure Reliability Engineer with subject-matter expertise in electrical systems to ensure high uptime across STACK’s data-center portfolio. Responsible for leading deep-dive investigations and RCAs for electrical infrastructure failures (UPS, switchgear, breakers, relays, generators, grounding systems, STS behavior, VFD interactions, and power quality disturbances), evaluating system performance under fault conditions to identify systemic vulnerabilities, engaging OEMs and vendors to drive long-term improvements, and supporting maintenance standards and asset strategy. Collaborates with Workforce Development to enhance site-team training, contributes to availability reporting and risk trend monitoring to meet SLAs, and informs design review and turnover processes by identifying gaps in handoffs, limitations, or commissioning practices. Requires 5–8 years in critical infrastructure environments, strong fluency in power distribution and protection systems, waveform data/event log analysis, root cause methodologies, cross-disciplinary collaboration, and a Bachelor’s degree in Engineering or equivalent experience. Travel up to 25% domestically; locations include Manassas/VA, Sterling/VA, Portland/OR, Chicago/IL, and Dallas-Fort Worth/TX. Must be eligible to work in the United States; extensive background screening required.

Required Qualifications

  • Bachelor's degree in Engineering or equivalent experience
  • 5–8 years of experience in critical infrastructure environments (e.g., data centers, substations, power generation, or utility systems)
  • Strong technical fluency in mission-critical electrical systems, including power distribution architecture, UPS systems, generators, grounding methodologies, protective relays, switchgear, controls integration, and power quality analysis
  • Experience analyzing electrical failures through waveform data, event logs, relay coordination, commissioning findings, or forensic troubleshooting
  • Hands-on experience with root cause analysis and reliability methodologies (e.g., FMEA, RCM)
  • Demonstrated ability to work across disciplines (Ops, Eng, Vendors, Construction) to resolve complex technical issues
  • Experience with commissioning (Cx) and infrastructure design review processes

Apply to this job in one click with Sorce

Apply on Sorce