CAREERS

Senior Site Reliability Engineer at Ritual.co
Toronto, CA

About the Role

The Senior Site Reliability Engineer is a very hands-on role. You will be writing code, and building tools and dashboards, and reporting/monitoring metrics. As the team grows the role will focus on developing tooling, process and methodology to empower the entire engineering org to move faster and safer.  Furthermore, you will be a key evangelist for maintaining high degrees of reliability within the platform.

 

Here is a list of responsibility for the role:

  • Manage production development and improve deployment stability, global availability.
  • Design and implement observability infrastructure for production system.
  • Implement and monitor production metrics and dashboards.
  • Advise and implement production network & securities rules and best practices.
  • Deliver purpose-built solutions to meet clear milestones.
  • Drive excellence for reliability through building tooling to solve reliability challenges, designing efficient and standardized process, relentless automation, engineering reliability back into applications and maximizing performance.
  • Participate in on-call rotations and provide inputs to your team and partners to sustain SLAs.
  • Architect, review, develop and deliver applications to improve availability, scalability, performance and efficiency of our services.

Must Haves

  • Experience with cloud platforms
  • Experience with Containers and orchestration
  • Extensive experience in Dev Ops, SRE, or Other Production operations roles
  • Strong cloud, network and systems Architecture Design Skills.
  • Expertise with database server design
  • Experience with incident management and response
  • Advanced expertise with at least programming language (preference python, go, or Java would be great) Polyglot preferred.

Nice To Haves

  • Experience building microservices.
  • Experience with Kubernetes (Istio, ambassador, GKE)
  • Experience with large MySQL clusters (Galera, Percona, MariaDB)
  • Experience with metrics, monitoring, alerting and dashboard tools (FluentD, telegraf, InfluxDB, ELK, Grafana, DataDog, StackDriver, PagerDuty etc)
  • Experience with IaC and SCM tools (teraform, ansible, packer)
  • Experience with CICD, pipelines and tools (Jenkins)

What We Offer

  • Opportunity to work on an amazing consumer-facing product that our customers love.
  • Competitive compensation package and equity in the business.
  • Healthcare coverage and a generous vacation policy.
  • You will get a daily credit in-app credit towards lunches and coffees.
  • Your choice of the development environment to make you most productive.
  • A pantry full of snacks.
  • Fun company-sponsored events and off sites

What We Offer

  • Opportunity to work on an amazing consumer-facing product that our customers love.
  • Competitive compensation package and equity in the business.
  • Healthcare coverage and a generous vacation policy.
  • You will get a daily credit in-app credit towards lunches and coffees.
  • Your choice of the development environment to make you most productive.
  • A pantry full of snacks.
  • Fun company-sponsored events and off sites