About this Role
This is a leadership position where you will lead a team of engineers on projects for users, focusing on uptime and availability. You will be responsible for designing, writing, and delivering software to improve the scalability, latency, and efficiency of Google's services.
Key Responsibilities
* Lead a team of Software/Systems Engineers on projects for users and be responsible for uptime.
* Own end-to-end availability and performance of key services and build automation to prevent problem recurrence.
* Manage on-call rotations across continents, using a follow-the-sun model.
About Us
We are an equal opportunity employer committed to building a workforce that is representative of our users. We provide a culture of belonging and an inclusive environment where everyone can thrive.
About Site Reliability Engineering (SRE)
SRE combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. Our focus is on ensuring that our services have reliability, uptime appropriate to users' needs, and a fast rate of improvement.
Responsibilities
1. Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Google's services.
2. Lead a team of Software/Systems Engineers on projects for users and be responsible for uptime.
3. Manage on-call rotations across continents, using a follow-the-sun model.
Required Skills and Qualifications
* Bachelor's degree in Computer Science or a related field, or equivalent practical experience.
* 8 years of experience with data structures or algorithms.
* 5 years of experience with software development in one or more programming languages.
* 3 years of people management experience, and experience designing, analyzing, and troubleshooting distributed systems.
Preferred Qualifications
* Experience working in computing, distributed systems, storage, or networking.
* Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
* Ability to debug, optimize code, and automate routine tasks.