Overview
AWS has launched the European Sovereign Cloud (ESC), a significant development in Utility Computing (UC). We are seeking experienced systems engineers with a strong background in automation and operations to join the AWS Managed Operations team. You will help build, operate, and evolve operations and development teams delivering high-availability AWS services (EC2, S3, Dynamo, Lambda, Bedrock) exclusively for EU customers. For more information on ESC, refer to the AWS blog about the European Sovereign Cloud.
Your responsibilities
Oversee ongoing operations and expansion of the ESC, collaborating with global AWS teams.
Influence the evolution of AWS services and technology to improve availability, reliability, latency, performance, and efficiency.
Occasionally participate in on-call rotations to resolve incidents outside regular hours.
Review operational health of services within your team’s responsibility; identify anomalies and craft actionable bug reports.
Provide constructive feedback on change management documents and address operational backlog.
Develop and test scripts to improve workflows and automation.
Educate service teams about the European Sovereign Cloud to share knowledge and foster mutual understanding.
Collaborate with the team to deliver scalable services and maintain high-availability experiences for EU customers.
Commit to continuous learning and improvement for the reliability of software systems.
On-call and day-to-day
Expect a 24x7 on-call responsibility as part of a team to root-cause issues and maintain resilient and fault-tolerant systems.
Eligibility and Team
About the team: Utility Computing (UC) and European Sovereign Cloud (ESC) are part of AWS UC. AWS UC provides services ranging from foundational services (S3, EC2) to ongoing product innovations. Managed Operations engineers support customers requiring specialized security solutions for cloud services.
Fluency in written and spoken English.
Must be a national of an EU member state and reside in the EU to operate the ESC.
Amazon provides relocation support for successful EU relocations.
Employees will participate in an on-call rota.
Basic Qualifications
Experience in Linux OS and network troubleshooting, or networking administration and troubleshooting.
Experience in Python, Perl, or another scripting language.
Experience in systems engineering, site reliability engineering, and building/operating systems at scale.
Must be a national of an EU member state.
Ability to lead creation, revision, and improvement of standard operating procedures (SOPs) and drive operational best practices.
Preferred Qualifications
Experience with monitoring frameworks (CloudWatch, Datadog, Grafana, Elastic, or similar).
Experience mentoring junior engineers and leading cross-organizational initiatives.
Experience operating 24x7 high-availability distributed applications and optimizing fleet utilization.
Experience with Infrastructure as Code (CDK, CloudFormation, Puppet, Chef, Ansible, or similar).
Experience with CI/CD, DevOps practices, and Generative AI technologies including automated deployment, configuration management, and AI-powered automation tools.
Amazon is an equal opportunities employer. We value a diverse workforce and base decisions on experience and skills. For privacy information, please consult our Privacy Notice. Amazon is an equal opportunity employer and does not discriminate on protected statuses. If you need a workplace accommodation during the application process, please visit Amazon’s accommodations page.
Company: Amazon Development Centre Ireland LimitedJob ID: A10382213
#J-18808-Ljbffr