Job Opportunity:
We are seeking a skilled professional to fill the position of Cloud Technical Site Reliability Engineer. This role will play a crucial part in our company's digital transformation journey, enabling us to provide a premier service to our customers.
As a Cloud Technical Site Reliability Engineer, you will be responsible for collaborating with cross-functional teams to implement and deploy new features and enhancements. Your focus will be on ensuring reliability and performance standards are met. You will also automate repetitive tasks and processes to improve efficiency and reduce manual workloads. Additionally, you will monitor systems proactively, identifying and resolving any performance bottlenecks, availability issues or monitoring gaps.
Key responsibilities include owning the Problem Management process within your team, performing proactive problem management and conducting post-incident analyses to identify root causes and implement preventive measures to avoid future incidents. You will also support the Change Management and UAM process as required. Furthermore, you will create and maintain documentation such as operational procedures and Technical Operations Manuals.
Requirements:
* Strong knowledge of Linux/Unix systems and command line tools
* Proficiency in scripting languages such as Python and Shell
* Experience with configuration management tools like Ansible or Chef
* Experience with code management and deployment tools such as Bitbucket and Jenkins or native tooling
* Good understanding of networking and communication systems (TCP/IP, HTTP, DNS, etc.)
* Knowledge of containerization technologies (Docker, Kubernetes) and orchestration tools
Benefits:
* Work from home flexibility
* 23 days annual leave
* Excellent pension contributions
* Substantial health insurance contribution
* Employee assistance program
* WebDoctor
* Financial wellbeing coaches