Role Overview
A systems development position is available, responsible for the design and maintenance of scalable software applications and cloud-based services.
This role requires a strong understanding of engineering best practices and the ability to work in an agile team environment to deliver features and improvements iteratively.
Key responsibilities include designing and building scalable backend systems, writing clean and efficient code, developing automation tools, enhancing observability, and supporting integration with AI-driven automation and observability platforms.
The ideal candidate will have 5+ years of experience in software engineering using Python, Go, or Java, along with strong skills in API development, CI/CD pipeline improvement, and collaboration with cross-functional teams.
Technical Requirements:
* Experience with cloud-native technologies and container orchestration.
* Strong knowledge of Linux/Unix systems and troubleshooting skills.
* Familiarity with monitoring and observability tools such as Prometheus, Grafana, Datadog, or Splunk.
* Understanding of key Internet protocols (TCP/IP, DNS, HTTP/S, TLS).
* Exposure to SRE practices including SLIs, SLOs, incident response, and postmortems.
Desired Skills:
* Experience working in compliance-sensitive environments.
* Exposure to chaos engineering, fault injection, or performance/load testing tools.
* Knowledge of AI/ML platforms and intelligent observability systems.