Meta is seeking a Production Systems Engineer to join our Hardware Design and Release to Production (HDRTP) team in Dublin, Ireland. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver Meta's services globally. The HDRTP team is responsible for the end-to-end Hardware Lifecycle of all Meta servers, from exploration and development to production health. HDRTP Engineers work closely with Production Engineering teams, Enterprise Networking, Hardware Designers, Networking Teams, Manufacturers, Vendors, Datacenter Operation teams and New Product Introduction teams to ensure the smooth operation of systems across the planet.We encounter problems from the very smallest of scales (errors occurring at the microscopic scale, within single registers of a CPU) up to the very largest - deploying solutions to Meta's millions of devices globally. We look for people with proven experience of finding solutions to complex issues, embracing ambiguity and driving impact, who want to tackle the hardest problems in the domain.Typically we will hire engineers from backgrounds such as Site Reliability Engineer (SRE), Software Engineer, Systems Engineer, Systems Development Engineer, DevOps Engineer, Systems Administrator, or similar.
ResponsibilitiesBuild and develop tooling solutions to automate business critical processes in service of managing the health of the Meta production hardware fleetTroubleshoot, diagnose and root cause system failures, working with key partners to identify and deliver solutionsProactively identify opportunities to fix or enhance tooling, hardware and processesBuild subject matter expertise in one or more of the specialist areas covered by the RTP (Release To Production) team in DublinScientific approach to troubleshooting, root-cause analysis and investigation
Minimum QualificationsAn engineering degree is typical, or related technical discipline, or equivalent work experience4+ years experience coding in a higher-level language (Python, PHP, Java, Go, Rust, C++)Experience building, maintaining and debugging production services or platforms - usually (but not necessarily) in a linux/unix environmentKnowledge of server architecture and components across Compute/Storage/AI Systems/Networking
Preferred Qualifications4+ years experience coding in a higher-level language (Python, PHP, Java, Go, Rust, C++)Experience managing and debugging hardware platforms in a cloud environmentDemonstrated ability to drive projects to successful business outcomes