Member of Technical Staff
Bangalore, IN
1d ago

Mission :

Ensure that VMware Cloud on AWS operates at high reliability and performance at scale with minimum human touch for our customers.

The VMC on AWS Site Reliability Engineering team is looking for quality software developers with a diverse set of experiences and skill sets to build and run the exciting new VMWare Cloud on AWS services.

As a SDDCaaS SRE you will provide service insight, response, and service management to maintain high service reliability with low touch through extensible services / platforms, standardized processes, data insights, and product input.

Responsibilities for Service Health, Optics, Service Management, Orchestration, and Remediation & Troubleshooting

By joining our diverse team you will be responsible for the VMC on AWS service and all aspects of it in production including the user experience.

This includes designing and developing solutions to improve service optics, availability, performance, and security. You will build services that enrich monitoring and automation through data analytics and applied tooling (ML, Clustering, anomaly detection, AI, etc.

Through Service Response (Incident management, problem management, and participation on the globally staffed Service Watch) you will use metrics and optics systems to ensure performance, scalability, and stability.

You will ensure proper metrics are implemented to measure service health and drive error budgeting. Through partnerships you foster with the development teams you will support new features, services, releases, and become an authority in our services.

You will focus on building solutions to better operate our services at scale : auto remediation, reducing manual intervention during production incidents, service metrics, optics, monitoring, process automation, data integrity, and service turn-up / turn-down.

Requirements and Preferred skills :

  • Experience in developing automation / scripting in Python.
  • Experience administrating operating, troubleshooting, and monitoring cloud infrastructure.
  • Experience in managing and troubleshooting vSphere, NSX and other VMware products.
  • Good troubleshooting skills in applications, storage, networking and SaaS services.
  • Be part of a 7x24 service watch rotation, using a follow the sun model
  • Minimum Qualifications :

  • BS in Computer Science or related technical field, OR equivalent industry experience.
  • 5+ year Experience as DevOps, Operations Engineer, or SRE (development for large online services)
  • Preferred qualifications :

  • Experience with container orchestrators (Kubernetes, Docker Native Orchestration, Mesos, Docker Swarm).
  • Experience with Linux administration
  • Category : Engineering and Technology

    Subcategory : Software Engineering

    Experience : Manager and Professional

    Full Time / Part Time : Full Time

    Posted Date : 2021-06-23

    Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form