Staff System Engineer - DevOps
Walmart Labs
Bangalore, Karnataka, IN
3d ago
source : Instahyre

Responsibilities :

  • Lead trouble shooting and triaging issues related to data pipeline failures or slowness, built using Map Reduce, hive or Spark to ensure SLA adherence.
  • These may be batch as well as streaming processes.
  • Resolve issues related to several data platform components and open source database solutions like Presto / Druid or cloud native components.
  • Resolve issues related to commercial tools and products, which are an integral part of platform.
  • Be a gate keeper to ensure sanity of production systems Builds tools to continuously monitor and alert platform components & data pipelines.
  • Continually improve CI / CD tools, processes and procedures.
  • Participate in ongoing design, implementation, and maintenance of systems and tools across our data platform.
  • Write and maintain infrastructure documentation.
  • Own production incidents / issues and provide level 2 response to infrastructure incidents and alerts.
  • Work with third-party vendors to resolve infrastructure issues.
  • Identify right open source tools to improve infrastructure by performing research.
  • POC / Pilot and / or interacting with various open source forums.
  • Deploy and monitor products on Cloud platforms.
  • Mentor junior team members.
  • Promote and support company policies, procedures, mission, values, and standards of ethics and integrity.
  • Requirements :

  • Bachelor's / Master's Degree with 3+ yrs of experience in Computer Science or related streams.
  • 6+ years of experience of managing data platform, ETL pipelines
  • Strong understanding of internals of at least 1 distributed processing framework like Map reduce, Hive, or Spark.
  • Strong understanding of MPP Databases, previous experience of administrating MPP DB will be added advantage.
  • Strong understanding of continuous integration, deployment and operations concepts.
  • Experience with code repositories (Git) and continuous integration tools (Jenkins, Maven).
  • Software provisioning and deployment automation tools (Ansible).
  • Excellent knowledge of Linux system.
  • Experience with configuration management.
  • Experience with a UNIX shell scripting languages such as Bash, Ruby, Perl or Python.
  • System administration exposure.
  • Experience in building scalable / highly available distributed systems in production a plus.
  • Ability to work with distributed teams in a collaborative and productive manner.
  • A self-motivated learner and builder with strong customer focus and obsession with quality.
  • Apply
    Add to favorites
    Remove from favorites
    Apply
    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Continue
    Application form