logo

View all jobs

Senior Site Reliability Engineer

Bedford, MA
We are seeking an experienced Senior SaaS Operations Engineer who can build out and support our web infrastructure (servers, databases, networks, monitoring, reporting) as well as our processes and procedures (continuous integration, backups, disaster recovery). The successful candidate will drive the design, development and execution of our transition to an AWS cloud infrastructure that supports the internally developed suite of software applications as well as internal systems.
The impact you’ll have:
The Senior Operations Engineer’s responsibilities include, but are not limited to, the following:
  •  Build, operate, scale, and troubleshoot the SaaS infrastructure.
  • Work with other teams to make sure that the infrastructure/applications that depend on it work together seamlessly.
  • Write tools for systems and infrastructure automation.
  • Collect and report on operational metrics for SLA reporting and capacity planning.
  • Monitor the health of our production infrastructure.
  • Conduct patching and server security maintenance, including vulnerability testing/management
What we’re looking for:
Successful candidates will thrive in a fast-paced environment and demonstrate a record of achievement:
  • 8-10 years of website operations experience in startup environment
  • Linux sys admin experience on Amazon Web Services (EC2, RDS, CloudFront, CloudWatch, DynamoDB)
  • Has built systems and infrastructure supporting web applications in a production environment (Terraform, Puppet)
  • Has experience operating a 24/7 production web application
  • Experience with MySQL, Java/Tomcat/Apache, Splunk and open-source tools for systems monitoring / management
  • Experience coding for task automation – shell/Perl/Python

Share This Job

Powered by