About the Role
We are seeking a Senior Site Reliability Engineer to join our team.
Key Responsibilities
* Act as a senior member of the SRE team, supporting activities including the backlog and workload of the team, scoping requirements, peer review of code, providing feedback to the rest of the team.
* Represent the team in management and stakeholder meetings.
* Ensure best practices are kept, and suggest improvements to our development processes where you see gaps.
Technical Skills and Qualifications
* 5+ years experience in an engineering role responsible for supporting a scaled SaaS platform running on Linux in a cloud environment.
* Experience working with high-performance systems, and solving complex engineering problems at scale (our platform processes ~100 Billion messages per year).
* Understanding of distributed systems design – including asynchronous tasks, event driven architecture, scheduling, caching and queue processing.
* The capability to carry out performance tuning from the API to Application to Database layer of the platform.
Benefits
* Strong communication skills and ability to explain complex technical solutions simply to others.
* Strong understanding of PHP, GoLang, MySQL, Opentelemetry, Prometheus.
* Experience with Cloud and DevOps technologies (AWS, Terraform, CI/CD etc.).
* Experience with specific technologies in our stack: Clickhouse, Kafka, Pulsar, Python.