Senior Systems Reliability Engineer (SRE)
Employment Type: Full-time, Permanent
Location: Ireland (Remote)
About the company
A global fintech organisation operating mission‑critical trading technology is expanding its engineering presence in Ireland. The team builds and supports high-performance, low-latency systems used across financial markets, with a strong engineering culture and focus on reliability, fairness and technical excellence.
They are now expanding their engineering presence in Ireland and hiring their next Senior Systems Reliability Engineer to support mission‑critical trading systems.
The Role
As a Senior SRE, you’ll join a highly technical, engineering‑driven environment responsible for the reliability, performance, and operational excellence of a large-scale, bare-metal trading platform. This is a hybrid role combining systems engineering, observability, automation, and real‑time operational support.
You’ll work across the full stack – (Linux, networking, applications, hardware) and play a key role in building a follow‑the‑sun support model with teams in the U.S. and Europe.
What You’ll Do
Own the technical operations of trading systems running on bare‑metal infrastructure
Monitor, troubleshoot, and resolve issues across OS, network, hardware, and application layers
Build and improve automation, tooling, and configuration management (Ansible or similar)
Develop and maintain observability dashboards, alerts, and telemetry pipelines
Participate in deployments, start‑up/shutdown procedures, and change management
Contribute to engineering projects such as OS tuning, kernel‑level optimisation, and performance improvements
Collaborate with platform, development, and market operations teams
Document processes, mentor teammates, and promote operational best practices
What You Will Bring
Must‑Have Technical Skills
Strong Linux experience (comfortable with system processes, logs, services, troubleshooting)
Hands‑on scripting with Python or Bash
Experience with Ansible or similar configuration management tools
Solid understanding of networking fundamentals: TCP/IP, routing, multicast
Experience supporting large, distributed, or high‑availability systems
Must have technical skills in observability; Prometheus, Grafana, Splunk, Graylog, Telemetry, alerting systems (e.g. Alertmanager), log pipelines
Experience with Bare‑metal deployments
KDB experience
Familiarity with Arista/Cisco switches, Corvil, Solarflare/Mellanox NICs
Understanding of trading systems
Who You Are
This matters as much as the tech.
You are someone who:
Works well independently and in distributed teams
Communicates clearly and calmly
Is collaborative, low‑ego, and easy to work with
Can follow processes while still thinking critically
Learns quickly and enjoys understanding complex systems
Why Join?
Highly technical environment with deep engineering challenges
Exposure to kernel tuning, networking, automation, and performance optimisation
Flexible working arrangements with opportunity for travel
#J-18808-Ljbffr