Sr Manager, AI Systems Quality & Reliability, Annapurna AI Servers and Systems

Amazon Development Center U.S., Inc. · Austin, Texas, United States

Location
Austin
Job Type
full-time
Posted
July 02, 2026

Job Description

AWS Annapurna Labs is seeking a Senior Manager of Quality & Reliability Engineering to lead the QnR function within the Trainium Manufacturing, Quality and Reliability organization. You will own quality and reliability outcomes for all Trainium AI server products — from component qualification through fleet performance — leading an engineering team across multiple concurrent chip and system generations. This role defines reliability strategy for liquid-cooled and air-cooled platforms at rapidly scaling volumes, builds quality systems across a multi-supplier global manufacturing base, drives fleet failure investigations to root cause, and establishes the reliability characterization capabilities required for next-generation technologies.

Key job responsibilities
- Lead and grow a QnR engineering team, hiring, developing, and retaining top reliability and quality engineering talent.
- Set technical direction for component qualification, reliability testing (HALT, HTOL, ...

Ready to Apply?

Submit your application for Sr Manager, AI Systems Quality & Reliability, Annapurna AI Servers and Systems at Amazon Development Center U.S., Inc.

Apply Now