Job Summary
Sites reliability engineering combines software and systems expertise to build and run large-scale, massively distributed, fault-tolerant systems.
* Lead a team of software and systems professionals on projects for end-users.
* Own end-to-end availability and performance of key services and develop automation to prevent problem recurrence.
* Design, write, and deliver software to enhance the availability, scalability, latency, and efficiency of services.
Requirements
* Bachelor's degree in Computer Science or a related field, or equivalent practical experience.
* 5 years of experience with programming in one or more languages.
* 3 years of experience with Unix/Linux systems internals and administration, or with networking (e.g., TCP/IP, routing, network topologies, SDN).
* 3 years of people-management experience.
Key Skills
* Software development.
* Systems administration.
* Team leadership.
* Problem-solving.
What You Will Do
This role will involve:
* Developing and maintaining software applications.
* Collaborating with cross-functional teams to identify and resolve technical issues.
* Leading and mentoring junior engineers.
About Us
We are committed to providing a supportive and inclusive work environment that fosters creativity, innovation, and growth.