Software Engineering Manager II, Site Reliability Engineering
At Google, we're building a team of software engineering managers to lead our site reliability engineering efforts.
About the Job
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. Our SRE teams ensure that Google's services have reliability, uptime appropriate to users' needs and a fast rate of improvement.
Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. As an SRE engineer, you'll have the opportunity to manage complex challenges of scale, use your expertise in coding, algorithms, complexity analysis and large-scale system design.
We encourage a culture of intellectual curiosity, problem solving and openness in our organization. We bring together people with diverse backgrounds, experiences and perspectives, and promote collaboration, self-direction and mentorship.
Responsibilities
* Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime.
* Own end-to-end availability and performance of key services and build automation to prevent problem recurrence. Automate response to all non-exceptional service conditions.
* Lead by example, mentor the team and establish credibility through quality technical execution.
* Manage on-call rotations across continents, using a follow-the-sun model.
* Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services.
About You
You will be responsible for products globally, providing technical leadership to key projects and empowering and developing teams to do the same.
We are looking for individuals who have a Bachelor's degree in Computer Science or related field, or equivalent practical experience. You should have at least 8 years of experience with data structures or algorithms, and 5 years of experience with software development in one or more programming languages.
In addition, you should have 3 years of people management experience, and experience designing, analyzing and troubleshooting distributed systems. Experience working in computing, distributed systems, storage, or networking is also desirable.
What We Offer
We offer a range of benefits to support our employees, including competitive pay, comprehensive health insurance, retirement planning tools and paid time off.
Google is proud to be an equal opportunity employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law.