DevOps / Site Reliability Engineer (Event-Driven Systems)
Location:
Dublin – Hybrid
Type:
Permanent
The Company
A well-established technology organisation building large-scale, high-availability platforms used across multiple markets.
Having a solid engineering team, low attrition, and a genuinely positive working environment.
You can expect a collaborative culture, modern tooling, and clear long-term career progression, supported by leadership that values quality, ownership, and people.
The Role
This is a senior hands-on engineering role focused on reliability, observability, and event-driven platforms.
You will work closely with production systems, helping teams understand system behaviour, improve resilience, and operate complex distributed services at scale.
This role suits someone who enjoys solving real production problems and working within a strong, supportive team environment.
What You'll Be Doing
Improving observability using modern monitoring, logging, and tracing tools
Designing and maintaining CI/CD pipelines for safe, automated deployments
Supporting production systems through incident response and root cause analysis
Operating and optimising Apache Kafka in high-throughput environments
Working with event-driven architectures, including CQRS and event sourcing
Applying strong security and compliance practices across platforms
Working primarily in Azure-based environments with some AWS exposure
What We're Looking For
Strong hands-on experience with
Apache Kafka
(essential)
Background in event-driven systems and frameworks such as
Axon
Experience with cloud platforms, Linux, CI/CD, and DevOps tooling
Comfortable working in production-critical environments
4–5+ years' experience in platform, DevOps, SRE, or backend engineering roles
#J-*****-Ljbffr