Site Reliability Engineer JD
Engineering discipline with Software and System engineering skills to analyse, resolve and propose solutions for the large scare enterprise infrastructure.
Ensure uptime of the environment and proactively manage the infrastructure by managing the services and reaching out to the users at fast pace.
SRE role demands running better production systems by creating engineering solutions to address problem in the environment.
Focus on areas of automation by reducing manual work in the infrastructure and triage system integration problem to build a reliable solution.
SRE will needs to work on root cause analysis for problem and provide possible solutions. Proactively identify potential outages, keeping incremental process improvements that is key for the availability of infrastructure.
Success criteria for the role include problem solving, collaborating, divergent ideas for problems solving, ownership of issues are key for success.
As part of this role candidate is expected to review the existing trends in issues (priority incidents), analyse the reason and propose a solution to mitigate the issue.
Work in RCAs for the exisitng problems and provide technical insights on why the issue occured and what should be remediation
Shift Timings - we operate in 2 shifts 7 a.m. to 3 : 30 p.m. IST or 1 : 00 p.m. to 9 : 30 p.m. IST
Also expect candidates to be available for oncall support during offshift hours for any ECCs.
Any graduate with a min of 9 years’ experience
Working experience as SRE is desired