Americas, Any

Sr. Site Reliability Engineer


Aha! is a different kind of high-growth SaaS company. We are the world's #1 roadmap software and help people achieve their best. Over 5,000 enterprises and 500,000 product, innovation, and engineering leaders trust our software to build lovable products and be happy doing it. We are self-funded, highly profitable, always distributed, and have no sales team. Being an always-remote company means we hire intrinsically motivated people who love to learn, support their teammates, and want to work from where they are happiest.

Our team

Aha! engineering is a mid-sized, fully remote team that is highly productive. We are centered around North American time zones so we can collaborate during the workday.
  • We move quickly: We ship code multiple times a day. We believe in getting new features in front of customers and iteratively improving as we learn what works and what does not.
  • We collaborate: We each bring unique experiences and skills to the table. Working together to share that knowledge benefits the entire team and helps us produce the best results for our customers.
  • We value product over process: We want the team to have the time and focus to solve complex challenges. We aim to minimize the overhead introduced by heavyweight processes and excessive meetings.
  • We enjoy: We like what we do. And we want you to love your job too. Learn more about The Responsive Method, our company values, and the generous benefits we offer.

Our technology

Our web application is a single-instance, multi-tenant Ruby on Rails monolith supported by Postgres (database), Redis (background jobs), and memcached (Rails caching). We also run a Node.js webserver to support collaborative editing and real-time updates. Our application is hosted on Amazon Web Services and architected with ECS for reproducibility and scalability.

We embrace new technologies that help us deliver a lovable product, but we also remain cognizant of the maintenance overhead that a new library or platform brings. We solve the problems in front of us rather than prematurely optimizing to address issues that may never materialize.

We do most of our planning and collaboration in Aha! Roadmaps and built Aha! Develop so that software engineers and their teams could take advantage of those same rich features. We use Slack and Zoom for video calls. (Email? Rarely.)

Your experience

We believe that being a kind person who elevates the rest of the team is just as valuable as writing great code. You have strong problem-solving skills and experience working on important functionality for a cloud-based product. You are humble, eager to learn, and always willing to help others learn as well. You want to work with people who enjoy picking up a problem and solving it, regardless of the technologies and techniques involved.

You have helped build and operate a cloud-based SaaS product at considerable scale, and want to do it again. You have plenty of experience building infrastructure using terraform. You are calm under pressure, and respond methodically to anomalies or outages. You love the services you manage to be fast and efficient.

Most of our features involve writing significant Ruby on Rails code, so experience working in a Rails codebase is a big plus.

Your work at Aha!

Site Reliability Engineers at Aha! ensure the platform remains stable, reliable, and secure for the world's biggest and most innovative companies. You will implement significant operations architectural features, contribute to supporting product developers, and consult with product developers whenever there is a concern about performance or scalability. Day to day, this will look like:
  • Setting and monitor SLOs for the organization, working with product and engineering teams to ensure they are met
  • Building and maintaining monitoring, observability, and autoscaling solutions for our own services, as well as those we purchase from AWS
  • Writing and maintaining production runbooks and operations documentation
  • Providing on-call operational support for production on a rotation
  • Assisting platform engineers in building new infrastructure services, and consulting with application engineers to help build fast, reliable features

If the Sr. Site Reliability Engineer role sounds appealing, we would love to hear from you. (A real human reviews every application.)

Grow with us

Everyone deserves to reach their fullest potential. We know that when we do work that matters with people we care about in a high-growth environment, we feel engaged and alive. And our goal is to help you do just that. We offer all the benefits you would expect and more, including profit sharing. The specific benefits listed below are reflective of what we offer U.S.-based hires. We also do our best to extend identical benefits to international teammates.
  • Generous salary with annual profit sharing for all
  • Medical, dental, and vision plans — for many teammates, we cover 100 percent of the premiums
  • Up to 200 hours of paid time off a year to spend however you want
  • 30 to 90 days of paid parental leave and five to 10 days of paid care and bereavement leave
  • Up to $1,000 annually for third-party education, along with paid time off to immerse yourself in learning
  • Aha! contributes a percentage of your total compensation each year towards your retirement
Browse other jobs