Overview

Customer.io is looking for a collaborative Site Reliability Engineer (SRE) who loves solving interesting puzzles and is excited to help us build out a scalable, reliable platform that our customers love. The successful candidate should also be able work independently when needed and be able to lead other SREs.

Job Position: Senior Site Reliability Engineer

Job Location: Remote

Job Description

  • We are seeking product-minded, empowered individuals who work collaboratively with their peers on interesting problems, and get those solutions into the hands of customers quickly. We value diversity, attracting the best people in the world to serve as colleagues. Our flexibility and freedom to work from anywhere in the world enables you to craft a work environment in which you can do your best work.

Job Responsibilities

  1. Design, build, and maintain core infrastructure pieces that allow Customer.io scaling to support real-time processing and delivery of billions of messages
  2. Plan the growth of Customer.io’s infrastructure
  3. Automate the deployment process to make it as boring as possible
  4. Be on our on-call rotation to respond to Customer.io availability incidents and provide support for technical support engineers with customer incidents
  5. Ensure we have adequate observability of our infrastructure and applications.
  6. Debug production issues across services and levels of the stack.
  7. Take an active role in a friendly and supportive team that encourages you and the entire company to grow as individuals, professionals, and teams
  8. Learn, practice, and share with your coworkers through code review, pair programming, team collaboration, and training to help improve our collective knowledge and best practices together

Job Requirements

  1. Preferably 7+ years of experience as a site reliability engineer and/or software engineer
  2. Experience in managing and working with RDB systems (MySQL)
  3. A solid understanding of problems of scalability and experience deploying and managing distributed applications on cloud infrastructure
  4. Proven experience building cloud infrastructure via code using Terraform and automating operational toil
  5. Deep knowledge of UNIX environments and the ability to apply modern collaborative development practices.
  6. Go experience is a nice to have, but most of our engineers have succeeded while picking it up on the job
  7. Experience working with Google Cloud Platform is nice to have
  8. A collaborative mindset backed by excellent communication skills and a desire to help us make great decisions in an empathetic and respectful way
  9. Ability to work independently in your timezone and make progress on tasks and projects without needing frequent guidance
  10. Ability to work with all Engineering teams and lead technical discussions and solve problems
  11. Inclination to proactively find problems and solve them
  12. Ability to coach junior SREs and elevate them.
  13. Ability to take on long-term projects and see them through completion
  14. A security-first mindset
  15. A self-starter who values synchronous and asynchronous work
  16. Based in APAC time zones

How to Apply
Interested and qualified candidates should:
Click here to apply online

Tagged as: Information Technology