Site Reliability Engineer
3d ago

Job Description :

Site Reliability Engineer

Cloudera is looking for an accomplished Site Reliability Engineer (SRE) to play a key role in advancing Cloudera’s product offerings in the cloud.

In this role, you will be at the intersection of two white-hot areas in today’s technical landscape : Cloud and Big Data.

Over the past few years, Cloudera has experienced tremendous growth, making us the leading contributor to the Hadoop ecosystem and a leading provider of enterprise solutions for Big Data.

The purpose of this team is to accelerate Cloudera’s next stage of growth by enabling our customers to unlock the full potential of the cloud and Hadoop.

On this team, you will be immersed in many exciting, innovative technologies and projects that will be critical to our customers’ data management needs in the cloud.

Key Responsibilities

Innovate and automate improvements to our Cloud Operations.

Identify and promote best practices and patterns for the setup, configuration and management including databases, servers, networking and storage systems.

Continuously review and enhance processes and operating procedures needed to maintain the most cost-effective enterprise-grade cloud infrastructure.

Participate in an on-call rotation alongside the engineers who build our production backends.

Track our cloud customer SLAs and be on-call to ensure total conformity to our customer commitments.

Create and maintain complete and accurate documentation for the purpose of operational audits including security and compliance.


3+ years industry experience in a DevOps, Site Reliability Engineering or Software Engineering role.

Experience programming with Python, Go, Java or similar languages.

Experience supporting production SaaS and adhering to other key metrics such as reliability and high availability.

Experience with performance analysis, troubleshooting, tuning, and capacity planning.

Experience with automating deployment of software to production instances and owning software releases.

Participated in an on-call rotation to help ensure services stay up and running.

Strong Linux and systems experience.

Experience with cloud technologies such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform.

Experience with Terraform for cloud automation.

Bonus Skills

Experience with Spinnaker, Jenkins, or other orchestration and CI / CD tools.

Experience with Kubernetes, Docker, or related containerization technologies.

Experience with database systems including Postgres and MySQL.

Good knowledge of 2 of these languages : Python, shell, java, go, groovy.

Experience deploying or managing large scale distributed Linux environments.

Report this job

Thank you for reporting this job!

Your feedback will help us improve the quality of our services.

My Email
By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
Application form