SRE Engineers - Web Performance
Gurugram, IN
1d ago


An efficient Site Reliability Engineering (SRE) professional is as much about how you think as your technical skills. The SRE role requires a mix of development and operations skills that combine software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.

As a part of the SRE team, you will manage the complex challenges of scale that are unique to the client, while using your expertise in coding, systems, the complexity of operating systems, and large-scale system design.

SRE's culture of diversity, intellectual curiosity, problem-solving, and openness is key to its success. We bring together people with a wide variety of backgrounds, experiences, and perspectives.

Role and Responsibilities

  • Build deep knowledge of the business and understand the end-to-end customer journey.
  • Partner with stakeholders to improve design, visibility, availability, scalability, and performance of services.
  • Efficiently automate manual processes, deep dive into incidents, and facilitate blameless postmortems.
  • Improve alert management, decision making, analysis, and various optimization techniques by measuring data using standardized telemetry.
  • Support planned changes with deployment, post-deployment monitoring, and create new dashboards or alerts as needed to monitor recent changes in the system
  • Evaluate open-source and vendor products, create proofs of concepts (POCs) and migrate applications to the external cloud.
  • Adhere to crucial company controls necessary to meet internal or external audit requirements.
  • Exhibit inspirational leadership and build a talented, cohesive, result-oriented, and healthy team environment.
  • Build value-proposition presentations, case studies, and accelerators.
  • Skills and Experience

  • 6+ years of experience in software development, technical operations, and running large-scale applications.
  • 3+ years of experience in a multi-Scrum environment managed by Scrum Master.
  • 3+ years of work experience in Chrome Development Tools, Google Lighthouse, RUM / Synthetic monitoring tools using Blue Triangle.
  • 3+ years of experience working in working in Service Engineering, Support, or Operations.
  • Hands-on experience in supporting tasks related to web performance assessments for international websites.
  • Expertise in supporting international e-commerce applications.
  • Very good understanding of the IT Infrastructure Library (ITIL) framework and various IT Service Management (ITSM) tools available in the marketplace.
  • Good experience in project management tools like Jira.
  • Should be very good at SRE processes.
  • Expertise in providing engineering solutions for gathering or publishing data across distributed architectures for automation, monitoring, intelligent alerting, and self-healing.
  • Must have a deep appreciation of IT tools, techniques, systems, and solutions.
  • Excellent communication skills along with experience in driving triage calls with stakeholders.
  • Should have creative problem-solving skills related to cross-functional issues amidst the changing priorities.
  • Should be flexible and resourceful in swiftly managing the changing operational goals and demands.
  • Passionate about operational excellence and governance.
  • Good experience in managing escalations and taking complete responsibility and ownership of all critical issues to get a logical closure.
  • Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form