Job Description
We are seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have a passion for automation, problem-solving, and ensuring the reliability and performance of complex systems.
About This Role
* This role is responsible for ensuring the reliability and performance of our infrastructure through detection, analysis, and prevention of issues.
The successful candidate will work closely with software engineers to advise on best practices for resilient code and review changes before deployment.
-----------------------------------
Requirements
SRE Experience: 1-3 years
Distributed Systems Architecture: Understanding of distributed system architecture; exposure to common design patterns, reliability, and scaling.
Familiarity with Infrastructure Design: Basic understanding of infrastructure design: Familiarity with the operational trade-offs of network, storage, and RPC serving designs.
Languages & Tools: Proficiency in at least one programming language (Python or Go). Familiarity with Docker & Kubernetes a plus!
Skills We're Looking For: