Graphcore logo
Graphcore1 month ago

Hardware Reliability Engineer

On-site · Tainan, Taiwan, Taiwan

Type
Full Time
Level
Senior Level
Education
Bachelors Degree
Company size
Unknown
Industry
AI Hardware

Job Summary

Plan and execute reliability validation across board, server, and rack levels for AI servers with liquid cooling and HVDC architectures; define and run environmental, accelerated, and mechanical tests (thermal/power cycling, humidity, corrosion, shock & vibration, HALT/HASS); lead shock & vibration validation for transportation and operational conditions; assess reliability risks for liquid cooling systems and HVDC components; perform reliability prediction and life data analysis (Weibull, MTBF); lead cross-functional design reviews and drive risk mitigation; conduct failure analysis and RCA using standard FA methodologies; define and maintain reliability/test specifications (JEDEC, Telcordia GR-63, JESD22, MIL-STD-810, ISTA, ASHRAE, UL, IEC); implement ongoing reliability testing for production quality; document results and support customer audits and certifications.

Required Qualifications

  • Bachelor’s or Master’s degree in Mechanical, Electrical, Reliability, Materials, or related Engineering
  • 10+ years of reliability engineering experience in AI servers, datacenter systems, HPC, or complex electronics
  • Hands-on experience with environmental, shock, and vibration testing
  • Strong knowledge of reliability methodologies and statistical analysis
  • Practical experience with liquid cooling and HVDC systems
  • Proven failure analysis and RCA capability
  • Strong communication skills in English; Mandarin a plus
Sorce

Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.

Hiring someone like this?

Get your role in front of qualified candidates on Sorce.

Get started

Graphcore

Hardware Reliability Engineer

Apply on Sorce