The job involves supporting the development and management of various AWS services, including Compute, Database, and Storage. This position plays a crucial role in ensuring the operational excellence of AWS regions, impacting customers across diverse industries. The team values collaboration, innovation, and a supportive work environment, fostering a culture where everyone can thrive.
You'll be responsible for
📝
Defining system requirements
Defining and/or refining system requirements, participating in the development and delivery of operability-related features such as system health monitoring, diagnostics, repair, and other self-healing automation.⚙️
Developing management tools
Developing or further existing application and system management tools and processes that reduce manual efforts and increase overall efficiency.🔍
Monitoring system health
Monitoring the health of the fleet, automating system health, maintenance tasks, and reporting systems as needed.Skills you'll need
🖥️
Systems engineering
Hands-on experience in systems engineering, particularly in networking, storage systems, and operating systems.💻
Programming
Proficiency in at least one modern programming language such as Python, Ruby, Golang, Java, C++, C#, or Rust.🐧
Linux/Unix experience
Experience working with Linux/Unix systems and distributed systems at scale.