The job focuses on redefining how AWS designs, builds, and operates its regions, enabling innovative cloud infrastructure and services for diverse customers. By tackling complex distributed systems challenges, team members contribute to a high-availability experience for AWS users. Collaboration and continuous improvement are key aspects of the team's culture.
You'll be responsible for
🔧
Supporting system requirements
Refining system requirements and participating in the development and delivery of operability-related features such as system health monitoring and self-healing automation.📈
Developing management tools
Creating or enhancing application and system management tools and processes to improve efficiency.📊
Monitoring system health
Monitoring the health of the fleet, automating maintenance tasks, and reporting systems as needed.Skills you'll need
⚙️
Automation
Contributing to automation for new and current system experience, enhancing efficiency and reducing manual efforts.💻
Programming
Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, or Rust.🐧
Linux/Unix
Experience with Linux/Unix systems, particularly in high-availability, distributed systems and services.View more