OverviewAt Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo brings to our workplace. We believe everyone deserves a fair shot at success and encourage candid consideration even if you’re a close match. This role is Senior Site Reliability Engineer – Site Reliability Engineering (Dublin).As a senior Site Reliability Engineer, you’ll ensure Klaviyo’s critical platforms are reliable, scalable, and sustainable while enabling rapid product development. We treat reliability as a core product feature and use software engineering to solve complex systems and operational challenges. Our work spans security, infrastructure, and software development, requiring us to understand systems and engineering. We build complex, foundational solutions that must be extremely reliable, secure, and performant at global scale. Our charter is to build and operate foundational services and infrastructure, define clear reliability objectives, reduce operational toil through automation, and continuously improve systems based on real production learnings. The work is highly visible and directly impacts how Klaviyos build software and how customers experience Klaviyo every day.How You’ll Make An Impact
ResponsibilitiesBuild and operate foundational, security-critical services with a strong emphasis on availability, scalability, latency, and fault toleranceApply software engineering principles to automate infrastructure, reduce operational toil, and improve system reliability at scaleDesign, implement, and evolve systems using SRE best practicesDefine and refine SLIs, SLOs, and error budgets to guide engineering decisionsImprove observability, alerting, and incident response to reduce mean time to detection and recoveryParticipate in on-call rotations with a focus on sustainable operations and automatic remediationsPerform quantitative analysis to understand system behavior, capacity constraints, and scaling limitsIdentify systemic risks and reliability bottlenecks and drive long-term, preventative solutionsCollaborate closely with product, platform, and security engineers to influence architecture early and ship reliable systemsMentor and pair with other engineers, helping raise the bar for reliability, operational maturity, and engineering excellence
Who You AreYou are a cloud-native, platform-focused SRE who uses software to build and operate reliable production systems at scale.You write and maintain production-quality code (e.g. Python, Go, or similar) to build internal platforms, automate operations, and improve system reliabilityYou have built, deployed, and operated distributed, cloud-native systems and understand failure modes such as partial outages, dependency failures, resource saturation, and cascading impactYou have experience operating containerized workloads and platforms (e.g. Kubernetes) in production, including deployment strategies, scaling behavior, and service networkingYou are comfortable participating in on-call rotations and diagnosing production issuesYou have designed and operated observability systems and know how to build actionable alerts that reflect real user and service impactYou apply SRE concepts such as SLIs, SLOs, error budgets, and burn-rate–based alerting to guide engineering decisions and operational responseYou have hands-on experience with infrastructure as code and declarative configuration (e.g. Terraform, Kubernetes manifests, policy-as-code)You have performed capacity planning, load testing, and performance analysis for distributed services and platformsYou routinely contribute to post-incident reviews and drive concrete, code-focused follow-up actions that prevent recurrenceYou are comfortable reviewing and contributing to technical designs, platform APIs, operational runbooks, and system documentationYou’ve already experimented with AI in work or personal projects, and you’re excited to dive in and learn fast. You’re hungry to responsibly explore new AI tools and workflows, finding ways to make your work smarter and more efficient.
Nice to haveExperience supporting security-critical platforms or building internal security toolingFamiliarity with identity, access management, secrets management, or policy enforcement systemsExperience operating systems at scale in cloud environments (AWS preferred)Background in resilience testing, fault injection, or chaos engineeringA strong comprehension of algorithms and data structures at scale
Tech StackKlaviyo’s platform is primarily built with Python and React and runs on AWS. Engineers join us from a wide range of technical backgrounds and are supported in learning our stack.Python / Django / FastAPIMySQL / Redis / MemcachedRabbitMQ / Celery / Apache Kafka / Apache PulsarAWS / Terraform / Kubernetes
Location & Work ModelThis role is based in Dublin, Ireland and follows a hybrid working model. Klaviyo supports work authorization and relocation for this position.We are committed to building inclusive teams and encourage applications from candidates of all backgrounds.
Additional InformationKlaviyo is growing fast and we have openings for all skill levels across all of our teams. Learn more about our engineering culture at https://klaviyo.tech. We use Covey as part of our hiring and/or promotional process. For jobs or candidates in NYC, certain features may qualify it as an AEDT. We began using Covey Scout for Inbound on April 3, 2025. Our salary range reflects the cost of labour in the country where the job post is advertised. The base salary offered for this position is determined by several factors, including the applicant’s job-related skills, relevant experience, education or training, and work location. In addition to base salary, our total compensation package may include participation in the company’s annual cash bonus plan, variable compensation (OTE) for sales and customer success roles, equity, sign-on payments, and a comprehensive range of health, welfare, and wellbeing benefits based on eligibility. Your recruiter can provide more details about the specific salary/OTE range for your preferred location during the hiring process. Base Pay Range In Local Currency: €92,000—€138,000 EUR
#J-18808-Ljbffr