Job Opening
Site Reliability Engineer - SRE
A highly skilled Site Reliability Engineer is required to collaborate with the team in designing and developing a comprehensive tooling, monitoring, control, self-service reporting, and analysis approach.
The ideal candidate will have expertise in monitoring solutions, scripting languages such as Powershell and Python, and experience with cloud platforms like Azure, AWS, or GCP.
You will be responsible for architecting and developing monitoring solutions for alert response and troubleshooting, leading technical projects, and working hands-on with scripting, tooling, and automation for continuous operations.
What You Get
* 100% remote work - 100% of the time
* Excellent salary
* Unrivalled benefits
* Opportunity to develop and try new things
* WFH allowance
Skills & Responsibilities
* 3 years in a technical role: DevOps, Software Engineering, Sys. Engineering
* Experience designing, installing & configuring monitoring solutions - 24x7 environments
* Strong cloud experience ideally Azure (AWS, GCP)
* Scripting: Powershell/ Python
* Monitoring tools: Open Tracing, Open Telemetry
* APM: Elastic, DataDog, New Relic
* Understanding of Networking
* Architect & develop monitoring solutions for alert response & troubleshooting
* Technical lead, hands-on scripting, tooling & automation for continuous operations
* Triage incidents & document steps to resolve