Our goal is to ensure the reliability, scalability, and performance of mission-critical systems that are distributed across the globe.
Key Responsibilities
* Design and implement automation solutions to streamline system maintenance, monitoring, and operational workflows.
* Manage large-scale production outages, leading incident response and efficiency improvements.
* Develop tools and software to automate repetitive tasks, reduce manual intervention, and enhance system reliability.
Required Skills and Qualifications
* Experience in using AI and Large Language Models (LLMs) to boost operational efficiency.
* Understanding of standard networking protocols and components.
* Proficiency in Java, JEE, REST, Swift/Objective C, database schema design, and data access technologies.
Benefits and Opportunities
As a key member of our team, you will have opportunities for growth and development, as well as a chance to make a meaningful impact on the company's success.
Other Opportunities
We value diversity, equity, and inclusion, and strive to create an environment where everyone can thrive.