Engineering Manager - Site Reliability
In this role, you will work on the Site Reliability team as a manager and contributor, on a team that encompasses everything from datacenter operations all the way to developer tooling. Site Reliability Engineering’s mission is to help Sprout Social run a reliable, available and fault tolerant service for our customers. If our product teams build the cars, then Site Reliability Engineering builds the roads, the gas stations, and the engines that help those cars go. We are a mixed discipline team of tool builders, programmers, operators, and problem solvers who keep Sprout running with speed, efficiency, and reliability.
Part of this will be supporting and developing other members of the team - conducting regular check-ins, providing career and professional support, coaching and advising, conducting performance reviews, etc. While you should be excited about building the infrastructure that powers Sprout, we want you to be equally excited about the trajectory of your team and your part in forming them as stellar site reliability engineers.
Here’s a small snapshot of what you might be doing on the Site Reliability Engineering team:
- In addition to engineering responsibility, you will also have the managerial responsibility to support the careers of fellow site reliability engineers
- Bring vision to the team in the form of new technology recommendations, ideas and approaches
- Drive your team to build in Sprout’s best interests by being cognizant of business requirements, market need, customer pains, etc.
- Support and enhance the Sprout Social culture
- Build system monitoring tools in Python and Go so our engineering teams can respond to customer production issues quickly
- Automate server and resource provisioning with Chef and Terraform so our customers can count on a platform that is repeatable, consistent, and reliable
- Help us store billions of social messages in MySQL and Cassandra, and scale our storage as our data volume and velocity continues to grow
- Shape the future of our work with containers in Docker and Kubernetes, as we build more services and functionality to serve our customers
- Deliver our software to our customers continuously by improving our Jenkins deployment pipelines
- Build fault tolerant, highly available, and zero downtime infrastructure in AWS
- Help triage production issues as part of our sparse on-call support schedule, and drive our page rate towards zero
Desired Skills and Experience
Need to have:
- Minimum of 5 years of operational software development experience at scale
- Minimum of 2 years of management experience in a software environment, preferably a high-growth, SaaS company
- Demonstrable leadership, project and logistics management, and people mentoring skills
- Previous experience in fostering and growing a strong engineering culture in an agile and DevOps environment
- Smart, a very quick learner and gets things done
- Demonstrated experience working with large cloud deployments
- Very clear written and verbal communicator
Nice to have:
- Experience with multiple cloud providers, including Rackspace and AWS
- Strong applied knowledge of concurrency, I/O and distributed systems
- Heavy SQL and NoSQL database experience
- Experience designing large-scale automation systems
About Sprout Social
Sprout Social powers open communication between individuals, brands and communities through elegant, sophisticated software. We are relentless about solving hard problems for our customers and committed to both customer and team success.
Team Sprout is a group of very talented, smart and passionate people with broad interests and backgrounds. We’re proud to regularly be recognized for software, product and company culture achievements. Our team’s shared belief in Sprout’s mission promotes a culture of openness, empowerment and fun. We have built a benefits program to match the strength of our team. This program includes:
- Insurance and benefit options that are built for both individuals and families, including generous company contributions
- Progressive benefit programs, like our parental leave program and free Divvy bike memberships
- High-quality and well-maintained equipment - your computer will never prevent you from doing your best
- Beautiful, convenient and state-of-the-art offices in Chicago’s Loop
- Solid programs in some of the staples: transportation, disability and life insurance
- Wellness initiatives to ensure both health and financial well-being of our team
- Breakfast and lunch options onsite, and more healthy snack options and beverages than you can imagine
- Growing corporate social responsibility program that is driven by the involvement and passion of our team members