Software Engineer III- SRE
Walmart Global Tech India
Chennai, India
1d ago

Your Responsibility

  • Collaborate closely with development and business teams.
  • Partner with extended site reliability teams and service assurance teams to address upstream and downstream events.
  • Proficient at handling and resolving Incidents and Events.
  • Drive Problem resolution and build user stories.
  • Debug defects as well as develop dashboards using modern monitoring tools (e.g. Dynatrace, Splunk) to enable reduction in detection time.
  • Effectively lead a bridge and escalate as part of a larger incident management process.
  • Function as member of a DevOps Team following the agile practice to provide design inputs and operational standard methodologies.
  • Provides monitoring of key application performance and capacity constraints to mitigate potential incidents before they impact the customer.
  • Conduct data mining / analysis activities to provide meaningful insights to support issue identification, resolution, etc.
  • Monitor and measure accuracy of inbound data feeds, data conditioning processes and work with Engineering leaders to identify and drive resolution of quality gaps.
  • Agility in working operations, development & data analytics
  • What do we expect from you?

  • Bachelor’s Degree in computer science, computer science engineering or related technical experience.
  • Experience with identifying application / infrastructure risks and mitigation strategy and the ability to work with a team to ensure risks are mitigated.
  • Experience with debugging techniques for root cause analysis of issues.
  • ITIL® working knowledge : Event, Incident, Release, Problem and Knowledge Management.
  • Experience in one or more of the following : programming languages, networking, Linux / Windows, middleware, databases, cloud technologies.
  • Proficiency in Shell / C# / Python scripting,
  • Solid understanding of distributed architectures, Java frameworks and Cloud technologies like Azure, GCP and Kubernetes
  • Effectively communicate to business and leadership on restoration
  • Demonstrate the ability to collaborate and contribute to established goals.
  • Influence team members with creative changes and improvements by challenging status quo and demonstrating risk taking.
  • Basic programming experience (SQL, any software programming language) is required
  • Strong Communications, organisation & multitasking skills
  • Initiative to learn new things
  • Superior organisation, communication, interpersonal and leadership skills
  • Must be a proven performer and team player that enjoy challenging assignments in a high-energy, fast growing
  • Must be a self-starter who can work well with minimal guidance and in fluid environment
  • Must be excited by challenges surrounding the development of massively scalable & distributed system
  • Agility and ability to adapt quickly to changing requirements and scope and priorities
  • Preferred

  • Experience in collaboration with multiple teams across geography
  • Experience with presenting to and influencing at all levels within a large, cross-functional organisation.
  • Experience in supply chain domain
  • Experience of working in massively large-scale data
  • Experience of building products that are powered by data and insights
  • Experience in debugging and troubleshooting large scale and cross system platform
  • Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form