Distinguished Technologist - AI Model Performance Architect
$190,000–$274,000 year
On-site · Palo Alto, California, United States or Spring, Texas, United States
Job Summary
Distinguished Technologist - AI Model Performance Architect role focuses on bridging system memory architecture with AI model behavior to optimize performance through HW/SW co-design and workload-aware tuning. It requires analyzing bottlenecks across CPU/GPU/NPU, simulating workload execution, and optimizing model execution (initialization time, caching, KV reuse). The position drives HW/SW co-design for memory and scheduling, partners with ML teams to align model structure with system architecture, and develops technology roadmaps for electrical hardware engineering. It involves mentoring staff, providing design input for new product development, and communicating strategy and roadmaps to executives, industry partners, and customers. The role emphasizes cross-organizational collaboration, innovation, and leadership in long-range research and development projects.
Required Qualifications
- Four-year or Graduate Degree in Electrical Engineering, or related discipline or commensurate work experience
- Typically has 12+ years of work experience in electrical design and tools, software packages, architecture of electronic hardware, or related field
Apply with one swipe on Sorce. We auto-fill applications and apply on your behalf — no cover letters, no 40-minute forms.
Hiring someone like this?
Get your role in front of qualified candidates on Sorce.