Senior Site Reliability Engineer
191746
Desired skills:
SRE, Azure, SaaS, Dublin
Reperio has partnered with a leading software company seeking an experienced Senior Site Reliability Engineer (SRE) to scale and support their Azure-hosted SaaS platform.
You'll play a key role in maintaining high system reliability, performance, and observability while collaborating closely with development, operations, and infrastructure teams to deliver a world-class customer experience.
Responsibilities:
1. Maintain high availability and reliability across Azure-based services.
2. Develop and enhance monitoring, alerting, and observability systems.
3. Automate provisioning, deployments, scaling, and incident response workflows.
4. Lead incident management, root cause analysis, and post-incident improvements.
5. Build and manage infrastructure via IaC tools (ARM, Bicep, or Terraform).
6. Optimize system performance and ensure compliance with ISO27001, SOC 2, and GDPR standards.
Requirements:
7. Proven experience in a SaaS or software product environment.
8. Strong expertise in Microsoft Azure infrastructure and core services.
9. Proficiency in scripting and automation (PowerShell preferred).
10. Hands-on experience with monitoring and observability tools (Azure Monitor, Grafana, Prometheus, Datadog).
11. Knowledge of containers (Docker, Kubernetes) and CI/CD pipelines.
12. Strong background in incident response and root cause analysis.
Benefits:
13. Work on a high-scale, cloud-native platform with a modern tech stack.
14. Collaborate with talented engineers in a DevOps-first culture.
15. Competitive compensation, benefits, and career development opportunities.
For more information or to apply, contact Seamus at Reperio Human Capital or apply via the provided link.
Reperio Human Capital acts as an Employment Agency and an Employment Business.
Seamus O'Rawe is recruiting for this role.
Get in touch with Seamus O'Rawe for more information: