Senior Software Engineer (DevOps) – Observability Platform
About Revolut
People deserve more from their money. More visibility, more control, and more freedom. Since 2015, Revolut has been on a mission to deliver just that. Our powerhouse of products—including spending, saving, investing, exchanging, travelling, and more—help our 65+ million customers get more from their money every day.
About The Role
Our Technology team builds the systems and experiences that keep Revolut moving. From the infrastructure behind our innovative app to the features used by millions of people around the world, they bring sharp thinking, speed, and a focus on meaningful impact to everything they do. We are looking for a Senior Observability Platform Engineer to create a platform that continuously monitors thousands of applications, databases, and other systems. You’ll deliver a product solution that engineering teams will rely on for crucial information about logging, metrics, and tracing.
What You'll Be Doing
Designing, implementing, and assembling scalable and resilient observability solutions across logs, metrics, and traces, leveraging existing market solutions or building from scratch
Building robust APIs and data pipelines to ingest, process, and expose observability data to product teams
Collaborating closely with product teams to understand their observability needs and integrate solutions that empower them to monitor, alert, and debug their components effectively
Optimising the observability infrastructure for performance, accuracy, cost‑effectiveness, and an exceptional user experience
Developing and maintaining tooling to automate the onboarding/sunsetting of components to the observability platform and streamline data collection
Contributing to the strategic roadmap of the observability platform, identifying and implementing new features and improvements
What You'll Need
7+ years of experience as a software engineer, with 3+ years focused on building and maintaining observability platforms or highly distributed systems
Familiarity with monitoring, alerting, and incident response best practices
Expertise in designing and implementing APIs and data pipelines for high‑throughput, real‑time data ingestion
A practical understanding of distributed systems and their unique observability challenges
Hands‑on experience with core observability tools, such as Prometheus, Grafana, Loki, ELK stack (Elasticsearch, Logstash, Kibana), Jaeger, OpenTelemetry, etc.
Experience with containerisation and orchestration technologies (Docker, Kubernetes) and infrastructure as code tools (e.g., Ansible, Terraform)
Proficiency in Python as your primary engineering language
Nice to Have
Previous experience in a DevOps, SRE, or developer experience role
Experience with multiple cloud platforms (AWS, GCP, Azure) and their native observability services
Contributions to open‑source observability projects
A track record of prototyping and sketching new solutions to complex problems
Commitment to Diversity and Inclusion
We launched our inaugural D&I Framework in 2021 to help us thrive and grow everyday. We encourage applications from people with diverse backgrounds and experiences, building a multicultural and inclusive workplace that creates innovative products and services for our customers.
#J-18808-Ljbffr