The job focuses on redefining how AWS designs, builds, and operates its regions, enabling innovative cloud infrastructure and services for diverse customers. By tackling complex distributed systems challenges, team members contribute to a high-availability experience for AWS users. Collaboration and continuous improvement are key aspects of the team's culture.
You'll be responsible for
🔧
Supporting system requirements
Refining system requirements and participating in the development and delivery of operability-related features such as system health monitoring and self-healing automation.📈
Developing management tools
Creating or enhancing application and system management tools and processes to improve efficiency.📊
Monitoring system health
Monitoring the health of the fleet, automating maintenance tasks, and reporting systems as needed.