New
Site Reliability Engineer II
Microsoft | |
United States, Washington, Redmond | |
Nov 26, 2024 | |
OverviewCome build and maintain the world's computer as a member of the Microsoft Capacity Infrastructure Services team in Azure Core. The team ensures new servers are brought online (capacity buildout) to enable Azure customers to leverage the latest offerings, see the illusion of infinite capacity, and grow the Azure business efficiently at hyperscale.As a Site Reliability Engineer II, you'll work with a breadth of partners across Microsoft including developers in service teams, hardware engineers, network engineers, datacenter technicians, supply chain managers, and business leaders to rapidly debug and resolve issues delaying this carefully orchestrated buildout sequence. You'll drive continuous improvements with these teams to prevent repeats and address common classes of issues across the Azure software stack through design reviews and problem management.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesYou will participate in onboarding, code/design reviews, and regular meetings with the engineering teams that develop and manage products and services.You will independently develop code or scripts that automate the performance of repetitive and easily scalable operations processes.You will design, develop, and maintain telemetry pipelines and monitoring tools that detail operations metrics.You will analyze data and use data to drive improvements with engineering teams.You will respond to incidents during regular on-call rotations. |