Site Reliability Engineer 2
Hyderabad, Telangana, India
5m ago

Security boundaries have changed, and identity is the new perimeter. With such a shift, there is a strong emphasis on carefully analyzing, and in many cases, hitting Refresh on the components that comprise the enterprise identity and access management platforms as we know them.

Microsoft is seeking an identity focused service engineer to join the Digital Security & Risk Engineering (DSRE) organization as we undergo the journey to ensure that the right identities get access to the right resources at the right time, protected each step of the way.

With your experience in Active Directory (AD), Azure (including Azure AD), federation, strong authentication, and overall service health, you will help steer our own identity-centric digital transformation and support hundreds of thousands of employees as they seek to empower every person on the planet to accomplish more.

Identity Site Reliability Engineering is a hybrid, production-oriented discipline combining deep software engineering and systems skills.

  • As experts in reliability engineering, SREs are typically responsible for improving service availability, latency, scalability, performance, and efficiency;
  • they remove toil from service management, and take ownership for changing production state, monitoring, incident response, and capacity planning.

    SREs take a holistic approach across client and server : they understand the dependencies both between and within services and drive continuous improvement across them.

    They care about the user experience and improve business outcomes as the business unit itself defines them. SRE as a discipline improves things via engineering first and foremost, rather than being a primarily reactive function.


  • Provide technical engineering for a cross-functional, highly visible, operations team supporting the Identity platform for Microsoft’s corporate network.
  • Identify opportunities and drive the implementation of automation to improve service health, manageability, reliability and telemetry.
  • Own, triage, investigate and resolve service issues with an emphasis on broad communications, learning & teaching throughout the process
  • Read, write, configure, design, and script end-to-end service telemetry, alerting and self-healing capabilities for platforms.
  • Author functional and technical documentation. Communicate on a deeply technical level with product engineering, project management and operations teams to improve and optimize products, improve infrastructure, and evolve services.
  • Remain current on new technologies, methods and procedures including, but not limited to, coding practices such as Test Driven Development, Continuous Integration, and Continuous Deployment.
  • Qualifications

  • Strong knowledge on PowerShell / Scripting Knowledge, Windows Services infrastructure to automate day to day activities.
  • Programming knowledge of .net and C#.
  • Intermediate knowledge of SQL database.
  • Expert-level knowledge on identity technologies : Azure Active Directory, Active Directory, Federation Services, Replication
  • Working / understanding of IAM (identity access management) Tools, Like MIM
  • Experience in supporting engineering / supporting Multi-Factor Authentication
  • Extensive troubleshooting / debug experience
  • BS / BA in Computer Science or equivalent work experience is preferred
  • Ability to understand and use the underlying platforms, infrastructure, or frameworks for a given service / system / product.
  • Ability to understand, and reason about, interactions between complex systems.
  • Solid understanding of the specific ways in which distributed systems fail, as opposed to code monoliths.
  • Solid understanding of monitoring, alerting, and observability philosophies and best practices.
  • Ability to analyze, decompose, design, and integrate systems or systems architectures.
  • Ability to write tools, automation, and enhancements in arbitrary subcomponents of services / products to deliver and manage software in production (both around services, and within them).
  • Ability to context-switch and effectively balance many competing priorities within many different scopes : quarters, weeks, hours.
  • Ability to advocate for a well-defined customer experience, and bias towards implementing SLOs and measurements to achieve this.
  • Highly adaptable to changing circumstances; able to put the right amount of structure on ambiguous and fluid situations, perhaps with contradictory information.
  • Collaborating first and foremost; ability to exercise cross-team co-operation and influence.
  • Ability to have empathy for any customer or partner and keep a higher-level perspective.
  • Broadly speaking, an SRE team instantiates the feedback loop of continual improvement for a product.
  • Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form