OverviewSenior Site Reliability Engineer role at DOCOsoft. You will help ensure the reliability, availability, and performance of our Vew SaaS platform hosted on Microsoft Azure. You will collaborate with development, operations, and infrastructure teams to design, implement, and maintain scalable and resilient systems with a focus on automation, monitoring, incident response, and continuous improvement.ResponsibilitiesImplement and maintain highly available, scalable, and fault-tolerant systems on Azure.Monitor system health and performance metrics to ensure reliability and proactively address issues.Maintain a set of metrics and reporting to demonstrate the operational performance of the Incident & Problem Management processes.Develop and maintain automation scripts and tools for provisioning, deployment, monitoring, and scaling of services.Implement Infrastructure as Code (IaC) using tools like Azure Resource Manager templates to ensure consistent and reproducible environments.Leverage AI-based automation to predict and prevent incidents before they impact customers.Configure and maintain monitoring solutions to provide real-time visibility into system health and performance.Define and implement alerting strategies to detect and respond to incidents in a timely manner.Respond to and resolve incidents, including root cause analysis, mitigation, and communication with stakeholders.Develop and maintain incident response playbooks to streamline response processes.Continue to develop robust incident management processes that enable effective management of customers from an Incident & Problem perspective.Ensure support issues are resolved within contractual SLAs.Conduct post-incident reviews and implement recommendations to prevent recurrence.Ensure systems and infrastructure adhere to security best practices and compliance requirements.Implement and maintain security controls, encryption, and access management mechanisms.Identify areas for optimization and implement solutions to improve system reliability, performance, and efficiency.Participate in regular reviews and retrospectives to drive continuous improvement in processes and systems.Drive continuous service improvement to achieve operational excellence.Maintain up-to-date knowledge of the latest technologies and best practices in application support.Engage with Development and Quality Assurance Teams on support issues.Collaborate with DevOps to improve CI/CD pipelines for reliability, deployment, and efficiency.Key Skills / QualificationsBachelor's degree in Computer Science, Engineering, or related field.Proven experience as a Site Reliability Engineer or similar role, preferably in a SaaS environment.Strong proficiency in Microsoft Azure services, including compute, networking, storage, and monitoring.Experience with automation tools and scripting languages such as PowerShell.Solid understanding of containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.Experience with CI/CD improvement for reliability and deployment efficiency.Experience with Bicep/Terraform and ARM templates for Infrastructure as Code (IaC).Hands-on experience with monitoring and logging tools such as Azure Monitor, Grafana, Prometheus, or Datadog.Knowledge of security best practices, compliance standards (e.g., ISO27001, SOC 2, GDPR), and relevant regulations.Excellent problem-solving skills and the ability to troubleshoot complex technical issues.Strong communication and collaboration skills for working in a cross-functional team.Azure certifications such as Azure Administrator Associate or Azure Solutions Architect Expert are a nice to have.About DOCOsoftDOCOsoft is a leading software and services provider to Lloyd’s of London and the broader London insurance market. It was founded in 2008 and has since grown to become one of the leading insurance software specialists in the London Insurance Market. We are a growing team with offices in London, Dublin, Tokyo, Portugal and Poland. We offer a range of benefits including flexible working, health insurance, pension, remote options, and competitive salary.Equal Opportunity EmployerDOCOsoft is committed to building an inclusive and diverse team that represents a variety of backgrounds, experiences and perspectives. We welcome applications from all suitably qualified candidates, and do not discriminate on the grounds of race, religion, gender, marital or family status, age, disability, sexual orientation, or any other basis protected by applicable law. If reasonable accommodations are required during any stage of the recruitment process, please let us know.Location: Dublin, Ireland
#J-18808-Ljbffr