Oh Snap!
This job is no longer active - but you can still view the details below.

Site Reliability Engineer - DevOps

| Chicago

Description

 

ThinkTime, a start-up company backed by parent company Productive Edge, is looking for a Site Reliability Engineer. Our SRE is responsible for making thinktime.com highly reliable, fault-tolerant, maintainable, and scalable through monitoring, automation and identifying improvements needed in the system. The Site Reliability Engineer will participate in and improve the overall life cycle of development, from inception and design through implementation and production deployment, and ongoing operation.  They will do this through a combination of design consulting, conducting reviews, identifying additional tools and processes needed, capacity planning, and retrospective reviews.

Ideally, you will have a development background that shifted away from day-to-day development, has a passion for improving reliability, maintainability, application performance, and is an excellent troubleshooter.  As a Site Reliability Engineer, you must also have experience with the NewRelic APM or similar platforms and be able to guide us in adopting and implementing the best tools and practices.  

 

Responsibilities:

  • Proactively monitor and review application performance.  
  • Create monitoring dashboards and alerts using NewRelic, Solarwinds DPA for database monitoring, and log entries for log aggregation
  • Be partly responsible for responding to production incidents that are escalated to the Engineering team and helping identify solutions and improvements to prevent issues from occurring in the future
  • Design and implement monitoring and alerting solutions to identify possible issues before they impact customers
  • Work with the production support team to adopt monitoring tools and processes.
  • Identify improvements to make the system more fault tolerant and scalable, and work with the development team to implement those improvements
  • Participate in design reviews and make recommendations to improve the reliability and maintainability of the system
  • Help triage and respond to incidents escalated to the Engineering team, including emergencies,  escalating to the development team as needed
  • Automate operations including infrastructure changes and releases by enhancing our existing Ansible & Jenkins-based solution.  (Recommend automation improvements and additional tools as needed)
  • Participate in root cause analysis reviews to discuss the root cause of production issues, and identify improvements to avoid in the future
  • Test the resiliency of the system via tools such as Chaos Monkey
  • Ensure software has good logging and diagnostics
  • Create and maintain operational run books
  • Contribute to the overall product roadmap
  • Experience with load testing tools a plus!
  • Work on feature requests, defects and other development tasks, in particular, those related to monitoring, reliability, and scalability

 

Needed Skills:

  • Experience with NewRelic or similar infrastructure monitoring and APM solutions.
  • Linux Administration experience
  • Windows Administration experience
  • Experience automating operations, including releases and infrastructure, changes  Experience with Ansible or similar
  • Experience working with relational databases, including an understanding of relational table designs, and SQL experience
  • Experience with ElasticSearch or other NoSQL databases
  • Experience with Redis or other distributed caches
  • Experience with containers and Kubernetes
  • Software Development experience (.NET preferred but not required)
Read Full Job Description

Technology we use

  • Engineering
  • Product
  • Sales & Marketing
    • .NETLanguages
    • C#Languages
    • JavaLanguages
    • JavascriptLanguages
    • PerlLanguages
    • PythonLanguages
    • RLanguages
    • SqlLanguages
    • SwiftLanguages
    • jQueryLibraries
    • jQuery UILibraries
    • ReactLibraries
    • angularLibraries
    • VueLibraries
    • AngularJSFrameworks
    • ASP.NETFrameworks
    • Ember.jsFrameworks
    • HadoopFrameworks
    • Node.jsFrameworks
    • Ruby on RailsFrameworks
    • SpringFrameworks
    • Microsoft SQL ServerDatabases
    • MongoDBDatabases
    • MySQLDatabases
    • OracleDatabases
    • Google AnalyticsAnalytics
    • IllustratorDesign
    • InVisionDesign
    • SketchDesign
    • Visual StudioDesign
    • ConfluenceManagement
    • JIRAManagement
    • Microsoft ProjectManagement
    • Constant ContactEmail
    • MailChimpEmail

Location

PE is in trendy River North with great bars & restaurants nearby. Plus, the office is easy to get to with various train & bus stops being close!

An Insider's view of Productive Edge

How would you describe the company’s work-life balance?

The days of 9-5 are over, in a good way. We’ve been successful because we get things done. Our expectations are our employees are simple, and that is to collaborate and get things done. Adios punch card!

Tory

Digital Marketing

What does career growth look like on your team?

From our interns up through our managing partners, we believe everyone has the opportunity to learn from each other. We run flat where our directors and partners are the front-line working with our team to ensure they are heard and have a clear growth plan in place. Yes, the door is always open.

Tim

Delivery Director

What are some things you learned at the company?

I've learned the importance of teamwork and communication. At PE, all projects are team-based with a lot of things in motion during each step of the process: from the sales team to the delivery teams to the finance and leadership teams. Having that camaraderie and communication makes a smooth process that works internally as well as for clients.

Kate

Finance Manager

What are Productive Edge Perks + Benefits

Productive Edge Benefits Overview

Work Hard, Play Hard
Celebrate often is our motto. Celebrate the success of others and the work accomplished. From quarterly outings to weekly events, to just hanging around in the office after work, we celebrate with those around us for making our company great.

Community Appreciation
We love to share our passion for technology with others. Learn more about the ways we're empowering our Chicago community and how you can get involved.

Health Insurance & Wellness Benefits
Flexible Spending Account (FSA)
Disability Insurance
Dental Benefits
Vision Benefits
Health Insurance Benefits
Life Insurance
Pet Insurance
Retirement & Stock Options Benefits
401(K)
Performance Bonus
Child Care & Parental Leave Benefits
Flexible Work Schedule
Family Medical Leave
Company sponsored family events
Acme co. sponsors family oriented events Annually.
Vacation & Time Off Benefits
Generous PTO
Paid Volunteer Time
Paid Holidays
Paid Sick Days
Perks & Discounts
Beer on Tap
Casual Dress
Commuter Benefits
Company Outings
Game Room
Stocked Kitchen
Some Meals Provided
Happy Hours
Professional Development Benefits
Job Training & Conferences
Lunch and learns
Cross functional training encouraged
Promote from within
Mentorship program
More Jobs at Productive Edge16 open jobs
All Jobs
Data + Analytics
Dev + Engineer
Product
Project Mgmt
Sales
Project Mgmt
new
Chicago
Developer
new
Chicago
Developer
new
Chicago
Developer
new
Chicago
Developer
new
Chicago
Data + Analytics
new
Chicago
Developer
new
Chicago
Sales
new
Chicago
Product
new
Chicago
Project Mgmt
new
Chicago
Developer
new
Chicago
Developer
new
Chicago
Developer
new
Chicago
Project Mgmt
new
Chicago
Developer
new
Chicago