Job Description
S/he will build the relationship with colocation (COLO)service providers and work with them to manage the critical electrical/ mechanical systems within data centers. By responding to emergent failures, managing and mitigating the risk and tracking all daily maintenances of critical equipment, S/he will evaluate the service performance of COLO and drive them to improve with operation data. S/he will also work with internal teams like procurement to influence them improving on the SLA terms or strategy.
- Responsible for critical facility operation of Alibaba overseas data centers.
- Responsible for managing the changes on critical systems, responding the emergent failures and tracking preventative maintenance.
- Responsible for capacity management of facility systems including power and cooling, etc.
- Manage availability risk of facility systems and drive the resolution.
- Drive and implement projects to improve the capacity, efficiency and reli...