AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
The Infrastructure Operations (Data Center) Team is the backbone of AWS, supporting the rapidly growing AWS business and customers 24/7. We are committed to maintain the physical infrastructure of AWS, ensuring the standards for operational performance in the areas of safety, security, availability, productivity, capacity, efficiency, and cost.
We are looking for a Data Center Engineering Operations (DCEO) Engineer with experience in critical facilities management, and a result-driven individual with strong technical understanding and the drive and vision to take our data center engineering operations to the next level. The DCEO Engineer is responsible for engineering operations including risk management and mitigation, planning, implementation of corrective and preventative maintenance for critical infrastructure and vendor management within our AWS Data Center environment. They are responsible for day-to-day operational excellence, maintenance of the critical infrastructure, supervising specialist vendors, acting as first responders to incidents, and becoming subject matter experts for the facility.
Key job responsibilities
The Data Center Engineering Operation Engineer is responsible for ensuring that all electrical, mechanical, and fire/life safety equipment within the data center is operating within contract parameters within facilities. Often this will be including risk management and mitigation, corrective and preventative maintenance of critical infrastructure, vendor management and metric reporting.
• Act as the site primary point of contact for internal and external stakeholders involving communication and relationship management.
• Meet daily hours of operations, on call requirements and response during rotations.
• Manage minor and major planned and unplanned site works for critical infrastructure, with a solid understanding of the works involved, risks, mitigation and seeking approvals within relevant SLA’s as required.
• Responsible for the on-site management of contractors, sub-contractors and vendors, ensuring that all work performed is in accordance with established practices, procedures & local legislation.
• Establish performance benchmarks, conduct analysis, and prepare reports on all aspects of the data center facility infrastructure operations and maintenance.
• Generate change management requests & incident management tickets for DCEO activities.
• Work with DCO managers (IT), Networking, Logistics, Safety, Security and other business leaders and operating partners to coordinate projects, manage capacity, and optimize plant safety, performance, reliability, sustainability and efficiency.
• Establish documentation relevant to technical support of business & facility operations.
• Drive & implement projects to increase current facility capacity, efficiency, sustainability & reliability.
• Assist in recruiting efforts.
• Support operating partners in the resolution of any infrastructure engineering issues
A day in the life
In day today scale, you will be involved in:
• Troubleshoot facility and rack-level events within internal Service Level Agreements (SLA).
• Perform rack installs, rack decommissioning, and facility MEP management.
• Provide operational readings and key performance indicators to make sure uptime is maintained
• Responsible for the on-site management of contractors, sub-contractors and vendors, ensuring that all work performed is in accordance with established practices, procedures & local legislation.
• Performance and oversight of maintenance and operations on all electrical, mechanical, and fire/life safety equipment within the data center.
• Work schedule changes depending on specific site needs. Shifts can be up to 12-hours and may rotate on a predefined schedule. Some locations have on-call rotations.
About the team
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship and Career growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
• 4+ years of relevant work experience in maintaining a DC or Critical space facility and has ability to prioritize in complex, fast-paced environment.
• Familiar with the concepts and interaction of Service Management systems (Problem and Change).
• Ability to participate in a 24 x 7 rotating shift roster.
• Bachelor’s Degree in either Electrical Engineering, HVAC, Mechanical Engineering or relevant technical (military/trade school) degree and relevant experience in a critical environment.
• Understanding of the electrical and mechanical systems used in a data center environment, including but not limited to DRUPS, Transformers, Generators, Switchgear, UPS systems, ATS/STS units, PDUs, Chillers, AHUs and CRAC units.
• Experience in management of vendors/contractors performing construction, maintenance and upgrading works in large-scale critical environment.