Staff Software Engineer, Site Reliability
Job Type
Full Time
Salary
$159k - $239k
Skills
React
Summary
Mission/Vision: Babylist is a trusted platform for growing families, aiming to help parents make decisions with confidence and build happy families.
Key Responsibilities: Manage AWS infrastructure, improve CI systems, support developers in troubleshooting, and establish monitoring best practices.
Growth Opportunities: Work with cutting-edge technologies in a high-traffic environment, contribute to system optimization, and drive continuous improvement in site reliability.
Description
Who We Are
Babylist is the trusted platform for millions of growing families. For over a decade, Babylist has been the technology solution for expecting parents and the community that supports them, expanding from baby registry into a full-service platform that helps parents make decisions with confidence, stay connected, and build happy and healthy families. Every year Babylist helps over 9M people make purchases through its registry, app, ecommerce shop, and comprehensive product guides. The Babylist ecosystem now includes Babylist Health, which provides access to products and services including insurance-covered breast pumps, Expectful, a new voice in health and wellness for pre-pregnancy through postpartum care, and The Push, a branded content studio that works with the biggest companies in the baby space. With over 59M monthly pageviews and 1.2M TikTok followers, Babylist is a generational brand leading the $88 billion baby product industry. To learn about Babylist’s registry options, editorial content, and more, visit
Our Ways of Working
We have team members located across the United States spanning multiple time zones. This means we put in extra effort to make sure we connect and collaborate in ways that make sense for us. We know how valuable the flexibility of remote work is for our employees.
We know that personal connection is the foundation for the great work we do together. In order to build those relationships with team members in other cities, we meet with coworkers in person two times a year at a full company offsite and a departmental offsite. These offsites are expected of employees and are great opportunities to meet the people you work with every day and to do some of the work that is much more difficult to do virtually.
Our Tech Stack
React
Ruby on Rails
AWS
Sidekiq
MySQL
Redis
Native iOS and Android
What the Role Is
Babylist is looking for both a Staff and Senior Software Engineer, Site Reliability to join our Platform team. In this position, you will play a vital role in ensuring our systems and services' stability, scalability, and reliability. You will work closely with all Babylist Engineering teams to support shared infrastructure and developer tools. Your expertise in site reliability engineering, AWS cloud infrastructure, and modern DevOps practices will be instrumental in optimizing our systems and driving continuous improvement.
Who You Are
6+ years of experience as a Site Reliability Engineer or similar role, demonstrating a strong background in maintaining highly available and scalable systems
Experience supporting high-traffic consumer-facing websites, understanding the unique challenges and considerations in maintaining such systems
Proficiency with Terraform is a must, as you will be a member of the team responsible for managing and building our AWS infrastructure using Infrastructure as Code (IaC) practices
You possess strong experience working with AWS cloud-based infrastructure and services, ensuring their reliability, performance, and security
Proficiency with Docker and Kubernetes is essential, as you will contribute to the design, deployment, and management of containerized applications in our environment
You have a solid understanding of cloud-native systems design, including CDNs, load balancers, cloud networking, DNS, caching, and distributed systems
Troubleshooting and debugging are second nature to you, allowing you to quickly identify and resolve issues across various environments
Experience designing and supporting CI systems such as CircleCI, Jenkins, or GitHub Actions
You are familiar with monitoring and alerting best practices, utilizing tools like Datadog, Cronitor, Sentry, and PagerDuty to ensure proactive identification and resolution of issues
Proven experience in on-call management best practices, including effective incident response, escalation procedures, and post-incident reviews to drive continuous improvement and ensure system reliability
You have excellent verbal and written communication skills, and the ability to collaborate effectively with cross-functional teams
How You Will Make An Impact
Manage and build our AWS infrastructure using Infrastructure as Code (IaC) tools like Terraform. You will ensure that our EKS clusters and databases are running up-to-date versions, optimizing performance and reliability
Improve the speed and reliability of our Continuous Integration (CI) systems to support the entire Engineering Team, enabling faster and more efficient development and deployment processes
Provide support to developers in troubleshooting issues across local development, staging, and production environments
Establish, communicate, and support best practices for monitoring and alerting. This will involve setting up effective monitoring systems and defining actionable alerts for proactive incident management
Why You Will Love Working At Babylist
We invest in the infrastructure you’ll need to be supported and successful: tools, opportunities to connect with colleagues, and a stipend to help you set up your office
We build products that have a positive impact on millions of people’s lives
We work at a sustainable pace which means work/life balance is a real thing here
We believe technology and data can solve hard problems
We believe in exceptional management
We are an antiracist organization and doing the work to support differences of all kinds
We offer competitive pay and meaningful opportunities for career advancement
We have great benefits like company paid medical, dental, and vision, a generous paid parental leave policy, and 401k with company match
We care about employee wellbeing with perks for physical, mental and emotional health, parenting, childcare, and financial planning
Babylist takes a market-based approach to pay, and pay may vary depending on your location. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location.
The estimated pay range for this role is $159,360.00 - $239,040.00
In addition, Babylist offers equity, bonus, and benefits, including company paid health, dental and vision insurance, 401(k) matching, flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies.
_IMPORTANT NOTICE: Our company takes the security and privacy of job applicants very seriously. We will never ask for payment, bank details, or personal financial information as part of the application process. All of our legitimate job postings can be found on our official career site. Please be cautious of job offers that come from non-company email addresses (@babylist.com), instant messaging platforms or unsolicited calls.
_
#bi-remote