Site Reliability Engineer - Cloud & SaaS Operations Location: Limerick, Ireland | Hybrid The Role We are seeking an experienced Site Reliability Engineer (SRE) to join a growing Engineering team supporting a leading SaaS platform. You will ensure high availability, scalability, and performance of production, staging, and development environments, with a focus on automation, cloud operations, and production support. Key Responsibilities Build, maintain, and monitor highly available cloud infrastructure (Linux/Windows). Provide 24/7 production support and troubleshoot technical issues. Collaborate with application, DBA, and cloud teams to deliver scalable solutions. Implement automation and Infrastructure-as-Code (Terraform, Ansible, scripting). Monitor and improve system performance using tools like Prometheus, Grafana, or ELK. Ensure security best practices and disaster recovery processes are followed. Essential Skills & Experience 3+ years in SRE, DevOps, or Systems Administration roles. Hands-on experience with AWS services (EC2, S3, Lambda, VPC, IAM, etc.). Strong scripting/automation skills (PowerShell, Python, or similar). Familiarity with containerization (Docker, Kubernetes, Helm). Experience with multi-tier SaaS or microservices architectures. Good understanding of networking, load balancing, and patch management. Desirable CI/CD pipelines, Git, Azure DevOps. SQL Server administration. Experience in regulated environments or cloud security governance. AWS certification or equivalent. Apply now if you are a hands-on, proactive engineer passionate about building reliable, scalable cloud solutions. Reperio Human Capital acts as an Employment Agency and an Employment Business. Skills: SRE Cloud DevOps CI/CD