As Team Xbox, we are on a mission to bring the joy and community of gaming to everyone on the planet. We deliver on that vision by putting players at the center, enabling you to play the games you want, with the people you want, anywhere you want.

Xbox Player Services (XPS) is at the heart of our ambition to reach billions of players across the globe, ensuring that every player feels included and engaged across Xbox. We do this through our commitments to amplifying the voices of our players, building trusted relationships with all our players, and by delivering foundational services and critical operations for Microsoft Gaming.

As part of XPS, the Xbox Services & Operations (XSO) team mission is to build and maintain services enabling increased customer engagement plus player growth across the Microsoft Gaming eco-system. A significant part of this responsibility involves adopting and/or creating Cloud Solutions. We leverage Azure Cloud platform capabilities; along with investing in the engineering of complementary Observability and Manageability solutions to run our Gaming business. We use this approach to apply consistent and scalable solutions across existing Xbox services, aiding in broad adoption across our Gaming business partners and consulting with acquisitions to assure their migrations to Azure Cloud are executed successfully with high degrees of Operational Excellence. The end objective being assurance that all systems within our Microsoft Gaming portfolio are operating on modern cloud capabilities thus eliminating the burdens and risks associated with dedicated infrastructure and/or legacy solutions. This role requires developing strong working relationships across business partners in Gaming, aiding in creating unified cloud adoption plans and ultimately assisting with consultation or engineering execution of the migrations.

The cloud engineering team in XPS is responsible for running infrastructure and platform engineering experiences at scale for Xbox services. We embrace the discipline of infrastructure as code to build secure and scalable platforms and services in collaboration with our partners across Gaming. We use the power of cloud to expand our experiences globally and solve business complexities. If you have a passion for building scalable, observable, automated systems and services look no further and give us a try.

We operate in an SRE model in which engineers get to experience the full engineering lifecycle from feature design through operation and support. We follow an agile, iterative, and quality driven approach in which we prioritize providing the best value to our customers and our business, finishing what we started, leveraging each engineer’s strengths, and learning and growing in each iteration. We prioritize reliability in everything we build and follow blameless engineering culture.

We’re looking for a talented Site Reliability Engineer who has a combination skillset of systems engineering and software development. If you have a history of designing, supporting, and owning services at internet scale as well as excellent communication and collaboration skills, then we would like to talk to you.


  • Collaborate with service development teams across Gaming to develop infrastructure and platform solutions which meet our global players’ high expectations of always available and reliable services

  • Contribute to a set of best patterns and practices for deploying cloud-based infrastructure as code in a secure, reliable and efficient manner

  • Provide subject matter expertise in troubleshooting issues impacting the performance, security, efficiency and reliability of cloud based services

  • Work in cross group teams on integration and migration solutions involving multi-cloud and hybrid on prem services



  • 10+ years demonstrated experience in designing and developing cloud based, internet scale services/solutions.

  • 5 years' experience of cloud native technologies and architectures


  • BA/BS or advanced degree in computer science preferred but not required

  • Experience with multiple cloud providers – Azure, AWS, GCP, etc

  • Solid understanding of traffic management and networking concepts

  • Excellent communication skills, including the ability to write concise and accurate technical documentation, communicate technical ideas to non-technical audiences

  • Demonstrated ability to impact/influence engineering and project teams, and to provide technical leadership in the decision making process

  • Understand Operational and Application security principles and patterns

  • High aptitude for recognizing and acting upon areas of opportunity to replace manual work with automation

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.