Site Reliability Engineer II (BCDR)

Boston | Norwalk | Remote | Rochester | Toronto

Datto, the world’s leading provider of IT solutions delivered through managed service providers, is looking for a Site Reliability Engineer to join a growing team. Datto is a creative company at its core and is an exciting and dynamic workplace. We're 100% focused on our managed service provider partners and believe that with the right technology, managed service providers can change how businesses around the world operate. Datto provides data protection, business continuity, networking, business management, and file backup and sync products that empower and protect the clients of our 17,000+ partners. We're headquartered in Norwalk, Connecticut and have 22 offices worldwide. Learn more at datto.com.

As Reliability Engineering Community at Datto our goals are to:

  • Enable our product engineering teams to do their best work
  • Be the conduit between software engineering and other engineering groups such as infrastructure, platform, security, support and database engineering
  • Advocate for Reliability Engineering and its benefits across Datto
  • Provide our product engineering teams with tools, knowledge and services to collect data and observe their respective applications
  • Continuously research, prototype and evolve our technology, services and develop our next-best-practices 

We jump in when needed to quickly help resolve service impacting issues. We continuously learn and collaboratively improve our incident management and detection. We provide valuable knowledge and guidance during incident retrospectives/post mortems and educate individual teams on data collection and analysis in order to improve the long-term reliability of our technology.

The Site Reliability Engineer will ensure that our Datto Business Continuity Product Technology Ecosystem is observable, durable, scalable, secure and reliable.

Your job function and responsibilities include:

  • Develop, deploy, and maintain the appropriate systems, services, and tooling in Datto’s production environment that provides constant feedback to stakeholders
  • Implement best practices promoting service availability/reliability and fault tolerance
  • Help us simplify and evolve the appropriate observation technology and practices 
  • Support our efforts to continuously and dynamically scale the Datto BCDR Product Technology Ecosystem and reduce human intervention as needed by automating any repetitive operational activities
  • Collaborate with the Product and Software Development teams to determine the products reliability strategy, including the establishment of SL{X} - Objectives (SLOs), Indicators (SLIs), Management(SLM) and Agreements(SLAs)
  • Assist in collecting the right data so that SLI’s can be measured, monitored, the right teams can be alerted and dashboards can be built for investigation and reporting purposes
  • Guide the software engineering teams through our Reliability Engineering review and ensure that service reliability best practices are a core tenet of all new software design and development
  • Collaborate and partner with the Datto SRE community as well as external SRE practitioners to ensure overall consistency, cross product and platform reliability as well as continuously learn and share knowledge
  • Be a contributor to our SRE Community and our Employee Resource Groups
  • Troubleshoot complex issues effectively; continually develop next-best practices, improve and evolve processes and reliability based on post-mortem analysis
  • Participate in our operational activities like hiring new team members, be a buddy to a new team member, share your knowledge during Engineering demos, support our Customer Engagement and Problem Management teams
  • Communicate with Users, Support, and Engineering teams in the event of an incident

Your Experience:

  • System, Software, Infrastructure and Platform engineering including but not limited to automation, release management, performance analysis, capacity planning
  • Systematic approach to troubleshooting
  • Ability to create observation infrastructure, tooling and processes that supports logging, metrics capture, statistics, event based monitoring and tracing within cloud-based ecosystems
  • Experience with platform or infrastructure management tooling
  • Well rounded communication skills, self-motivated & willing to learn continuously
  • Ability to work independently and as part of a remote/hybrid team

At Datto, we believe our employees are our greatest asset and offer all full-time employees a wide-ranging benefits package, including:

  • Comprehensive health-care benefits
  • Free lunch every Friday
  • Flexible working hours
  • Flexible paid time off
  • Paid parental leave
  • Free food, drinks, and fresh organic fruit
  • Charity match program
  • Education reimbursement
  • Employee Resource Groups
  • And more!

By submitting an application, you acknowledge we will process your data to consider you for the position you apply for and for other open positions within our company for which you may be suited. We collect and store your data following our Recruiting Privacy Practices.

Datto is an equal opportunity employer.

Site Reliability Engineer II (BCDR)

Demographic Questions

Individuals seeking employment at Datto are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. In order to track the effectiveness of our recruiting efforts and ensure we consider the needs of all our employees, please consider answering the following questions.

Completion is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter and any information that you do provide will be recorded and maintained in a confidential file.

Your responses to any of the following questions will be anonymized and only used to improve Datto’s diversity and inclusion initiatives. These responses will not be used / reviewed in connection with your application for employment.

I identify my gender as:

I identify as transgender:

I consider myself a member of the LGBTQ+ community

I identify my sexual orientation as:

I identify my ethnicity as:

Veteran status:

I have a physical disability:

loadingspinner

Sorry, your application was not successfully submitted

Hurray! Your application was successfully submitted

Back to Careers