Site Reliability Engineer
CareStack™ - Dental Practice Management
Trivandrum, Kerala, India
3m ago
source : Linkedin

Company Overview

CareStack is a complete cloud-based dental software solution for scheduling, clinical, billing, patient engagement, and reporting needs of dental offices of any size - whether it's a single location or a large multi-site DSO with hundreds of locations.

The company was founded in 2015 and the commercial launch was done in early 2018. Since then, more than 1000 offices have chosen CareStack as their single source of truth.

This is the fastest growth till date in the dental practice management software market, dominated by 100 year old distribution companies.

More about CareStack

  • Rated by independent B2B software reviews and research analysts as the most modern, innovative and customer experience focused company in the space with the fastest growth in the segment.
  • Important strategic go to market partnerships with dental industry leaders like Delta Dental, Darby Dental and several others.
  • Venture backed with over $60M raised from leading financial and strategic investors.
  • HQ'd in Orlando, FL with offices in Minnesota, Bangalore, Trivandrum and Cochin.
  • Role Overview

    CareStack seeks to hire Site Reliability Engineer (SRE) to be part of the monitoring team and we count on our SREs who focusses on the health of our production system.

    You will be proactive in developing tools to monitor the system at all levels, anticipate issues and implement solutions before they impact the users.

    When production issues outages, slowness, processing delays, errors and failures do occur, you will take ownership to resolve them quickly.

    Key responsibilities :

    Own the Production Environment Monitoring

    Run the production environment by monitoring availability and taking a holistic view of the system health. Improve reliability and quality of the production infra.

    Gather & analyze metrics to assist in fault finding and getting it resolved. Provide primary operational support and engineering for the Product.

    Troubleshooting Support Escalation

    Site reliability engineers may have to spend a considerable amount of time fixing cases related to support escalation. They should fully know critical issues to route support escalation incidents to concerned teams.

    Critical support escalation cases, however, go down as site reliability engineering operations mature.

    On-call Process Optimization

    Site reliability engineers’ job will involve the implementation of strategies that increases system reliability and performance through on-call rotation and process optimization.

    You will also have to add automation for improved collaborative response in real-time, besides updating documentation, runbook tools, and modules to ready team for incidents.

    Documenting Knowledge

    As Site reliability engineers take part in on-call duties, technical production support they gain a sustainable historical knowledge and to ensure seamless flow of information between teams, you will have to document the knowledge gained.

    Team Player

    You should proactively communicate to resolve dependencies within and outside the team. You should build the ability to preempt conflicts within the team and resolve them if required.

    As a knowledge giver, you should foster a culture of learning and sharing knowledge within and across squads. You should engage in conversations that are objective and data driven.

    Strive and push your team in this direction. Understand organizations culture code and streamline conversations and activities that will further instil this code.

    Mentor and coach new additions to your team.

    Value / Mission Alignment

    Be a champion for CareStack within the product team. Help drive workplace and cultural norms within your business unit that align with CareStack company values.

    As an SRE you will :

  • Be on a PagerDuty rotation to respond to the product’s availability incidents.
  • Use your on-call shifts to prevent incidents from ever happening.
  • Develop metrics, monitoring, and alerting to observe the health of the production system.
  • Make monitoring and alerting alerts on symptoms and not on outages.
  • Document every action so your findings turn into repeatable actions and then into automation.
  • Debug production issues across services and levels of the stack.
  • This role may be for you if you

  • Have an insatiable itch to join and the courage to scale an early-stage technology company.
  • Have 3+ years of extensive experience in managing complex, high-volume applications in a production environment.
  • Have an enthusiastic, go-for-it attitude. When you see something broken, you can’t help but fix it.
  • Have Hands on experience in analyzing hardware and software logs and metrics.
  • Have an urge to collaborate and communicate asynchronously.
  • Have an urge for delivery quickly and iterating fast.
  • Are possessing a data-driven approach to problem solving and communications.
  • Can balance urgency with sound decision making and careful execution.
  • Can balance multiple assignments in a fast-paced environment.
  • Possess coding experience beyond simple scripts.
  • This role may not be for you if you

  • Don’t have the itch to be creative and not a fan of continuous improvement in self, process, and the system.
  • Don’t have the fire in you to be a key contributor in the process of building highly available, scalable, and reliable systems.
  • Don’t have the trait to be transactional in discussions, accept your mistakes and appreciate better ideas from your peers or reportee.
  • Haven’t developed the habit of doing objective conversations that are data driven.
  • Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form