DevOps SRE Engineer
Number of Vacancies – 1No
Full Time
Location: Dublin, Ireland
Work Type: Hybrid
Note: Candidate should have Valid "Right to Work" and should be available in Ireland.
· Irish or EU Citizen
· Stamp 4
· Dependant Visa with Permission to Work
As an SRE Engineer, you will serve as the steward of
production readiness
for Gateway products and their integrations with other platforms. In this role, you will collaborate closely with development teams to design, implement, and support services, ensuring
operational resilience, driving automation, and maintaining compliance
across all systems.
Key Responsibilities:
· Lifecycle Ownership: Engage in and improve the entire service lifecycle—from design and deployment to operations and continuous improvement.
· Operational Readiness: Ensure system availability, capacity, performance, monitoring, and self-healing capabilities are embedded throughout delivery.
· Incident Management: Practice sustainable incident response, lead blameless postmortems, and optimize Mean Time to Recovery (MTTR).
· Automation & CI/CD:
· Develop and maintain automation pipelines for certificate renewal, traffic routing, alerting, and compliance reporting using tools like Ansible, Venafi & XLR template.
· Support CI/CD pipelines for software promotion and operational gating.
· Reliability Engineering: Scale systems sustainably through automation and advocate for changes that improve reliability and velocity.
· Compliance & Risk Management: Drive initiatives for Safety & Soundness, PCI compliance, threat/toil reduction, and ITSM defect resolution.
· Monitoring & Observability: Implement robust logging, monitoring, and alerting standards to ensure system health and proactive issue detection. Hands-on experience with Dynatrace & Splunk monitoring tool configuration and alerting.
· Collaboration: Work with global teams across multiple time zones and mentor junior engineers.
· Continuous Improvement: Provide feedback loops to development teams on resiliency gaps and operational enhancements.
· Rotational On-Call & Flexibility:
· Participate in rotational on-call support for critical production systems.
· Demonstrate flexibility to take on additional responsibilities and ad-hoc duties as needed to support team and organizational goals.
Required Skills & Qualifications:
·
5+ years
of experience in Project, Site Reliability Engineering, or DevOps roles.
Technical Expertise: (Mandatory)
· Strong understanding of NGINX configuration, reverse proxying, load balancing, caching, and security hardening.
· Expertise in managing gRPC event-driven architectures.
· Proficiency in DevOps tools: Chef, Jenkins, Groovy, shell scripting, Bitbucket, Git, Ansible, XLR.
· Experience with AWS infrastructure, secure access practices, and cloud-native deployments.
Security & Compliance:
· Awareness of certificate lifecycle management, mutual TLS, SSL handshake, SSH keys, encryption standards.
· Familiarity with ITSM processes, compliance frameworks, and incident management.
Networking & Systems:
· Knowledge of client-server relationships, network layers (L1–L7), load balancers (BIG-IP F5), and application firewalls.
· Ability to analyze stack traces, TCP dumps, heap/thread dumps, and perform