Jobs
My ads
My job alerts
Sign in
Find a job Employers
Find

Senior site reliability engineer

DOCOsoft
Site reliability engineer
Posted: 5 October
Offer description

As an Senior Azure Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, availability, and performance of our Vew SaaS platform hosted on Microsoft Azure. You will collaborate closely with development, operations, and infrastructure teams to design, implement, and maintain highly scalable and resilient systems. Your primary focus will be on automation, monitoring, incident response, and continuous improvement to enhance the overall reliability of our services.ResponsibilitiesImplement and maintain highly available, scalable, and fault-tolerant systems on Azure.Monitor system health and performance metrics to ensure reliability and proactively address issues.Maintain a set of metrics and reporting to demonstrate the operational performance of the Incident & Problem Management processes.AutomationDevelop and maintain automation scripts and tools for provisioning, deployment, monitoring, and scaling of services.Implement Infrastructure as Code (IaC) using tools like Azure Resource Manager templates to ensure consistent and reproducible environments.Leverage AI-based automation to predict and prevent incidents before they impact customers.Monitoring And AlertingConfigure and maintain monitoring solutions to provide real-time visibility into system health and performance.Define and implement alerting strategies to detect and respond to incidents in a timely manner.Incident ResponseRespond to and resolve incidents, including root cause analysis, mitigation, and communication with stakeholders.Develop and maintain incident response playbooks to streamline response processes.Continue to develop robust Incident management processes that will enable effective management of our customers from an Incident & Problem perspective.Ensure support issues are resolved within the contractual SLA’s.Conduct post-incident reviews and implement recommendations to prevent recurrence.Security And ComplianceEnsure systems and infrastructure adhere to security best practices and compliance requirements.Implement and maintain security controls, encryption, and access management mechanisms.Continuous ImprovementIdentify areas for optimization and implement solutions to improve system reliability, performance, and efficiency.Participate in regular reviews and retrospectives to drive continuous improvement in processes and systems.Drive continuous service improvement to work towards achieving operational excellence.Maintain up-to-date knowledge of the latest technologies and best practices in application support.Engaging with Development and Quality Assurance Teams on Support issues.Key Skills/ QualificationsBachelor's degree in Computer Science, Engineering, or related field.Proven experience as a Site Reliability Engineer or similar role, preferably in a SaaS environment.Strong proficiency in Microsoft Azure services, including compute, networking, storage, and monitoring.Experience with automation tools and scripting languages such as PowerShell.Solid understanding of containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.Work with DevOps team to Improve CI/CD pipeline for reliability and deployment and efficiency.Experience with Bicep/Terraform and ARM templates for Infrastructure as Code (IaC).Hands-on experience with monitoring and logging tools such as Azure Monitor, Grafana, Prometheus, or Datadog.Knowledge of security best practices, compliance standards (e.g., ISO27001, SOC 2, GDPR), and relevant regulations.Excellent problem-solving skills and the ability to troubleshoot complex technical issues.Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.Preferred QualificationsAzure certifications such as Azure Administrator Associate or Azure Solutions Architect Expert.Familiarity with agile methodologies and agile development practices.Knowledge of cloud-native architectures and microservices-based applications.Experience with database technologies such as Azure SQL Database, Cosmos DB, or PostgreSQL.Comfortable in a Client Facing Role as role, with the ability to join regular Service Desk Review meeting with Clients.London Insurance Market Experience an advantage.DOCOsoft is a leading software and services provider to Lloyd’s of London and the broader London insurance market. We offer our people the opportunity to impact our growing business, exciting challenges to grow, a competitive salary, company pension, health insurance, remote and flexible working, and 25 days annual leave.Equal Opportunity Employer: DOCOsoft is committed to building an inclusive and diverse team that represents a variety of backgrounds, experiences and perspectives.
#J-18808-Ljbffr

Apply
Create an E-mail Alert
Job alert activated
Saved
Save
Similar job
Site reliability engineer
Dublin
Intuition IT – Intuitive Technology Recruitment
Site reliability engineer
€60,000 - €120,000 a year
Similar job
Senior site reliability engineer
Dublin
FIS
Site reliability engineer
€60,000 - €120,000 a year
Similar job
Site reliability engineer - apple services engineering
Dublin
Apple
Site reliability engineer
€100,000 - €125,000 a year
Similar jobs
jobs County Dublin
jobs Leinster
Home > Jobs > Engineering jobs > Site reliability engineer jobs > Site reliability engineer jobs in County Dublin > Senior Site Reliability Engineer

About Jobijoba

  • Company Reviews

Search for jobs

  • Jobs by Job Title
  • Jobs by Industry
  • Jobs by Company
  • Jobs by Location

Contact / Partnership

  • Contact
  • Publish your job offers on Jobijoba

Legal notice - Terms of Service - Privacy Policy - Manage my cookies - Accessibility: Not compliant

© 2025 Jobijoba - All Rights Reserved

Apply
Create an E-mail Alert
Job alert activated
Saved
Save