Site Reliability Engineer

Sorry, this job was removed at 10:50 p.m. (CST) on Wednesday, October 18, 2017
Find out who's hiring in Chicago.
See all Developer + Engineer jobs in Chicago
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.
Overview

At Relativity, we make great software that helps users organize data, discover the truth, and act on it. Our product is used by more than 13,000 organizations around the world – in the cloud, on-premises, or both – to manage large volumes of data.

Here you can own your career in a community of values-driven people who help our customers around the world solve complex data challenges. If this sounds like the place for you, check out the details of this position below.

The Site Reliability Engineer is responsible for activities related with monitoring, operating and improving resiliency of a large distributed enterprise cloud solution. This role is also responsible with making changes directly or providing feedback and suggestions to other Engineering teams on how to make the overall system more performant and more reliable.

 Responsibilities

The Site Reliability Engineer is responsible for delivering results for the Product Development department by:

  • Maintaining a highly-distributed system in a public cloud with distributed database, compute and storage systems
  • Contributing to a Lean (Kanban), or hybrid team to solve the operational challenges
  • Deploy changes into testing and production environments
  • Provide feedback at Change Advisory Board (CAB) meetings regarding upcoming changes and changes that have been implemented
  • Provide feedback to Engineering teams regarding areas of the software that require more monitoring\alerting capabilities as well as can be engineered to be more resilient
  • Track changes to the system in the Change Management Database
  • Following practices and procedures that adhere with industry best practices for operating a large-scale infrastructure and software system
  • Collaborate with software development teams to understand new features being delivered to the cloud solution and gain an understanding of how to monitor\operate
  • Continuously improve monitoring and alerting capabilities of the system as well as make changes to make the application and infrastructure more resilient
  • Support Problem and Incident managers by providing information regarding trends of reoccurring issues within the application and cloud infrastructure

 

In addition to the above responsibilities, the Site Reliability Engineer is expected to display professionalism in the following ways:

  • Maintain an attitude of commitment through outward display of willingness
  • Practice positive interactions - lean on encouragement in place of judgment
  • Impress responsibility on others by displaying ownership in tasks
  • Act in the interest of the overall team and our customers
  • Understand the needs of our customers
Qualifications
  • Experience working in an Operations Center
  • Experience supporting public cloud based infrastructure
  • At least one year of experience with Windows Server, Linux, IIS and SQL Server experience; designing and deploying systems from the ground up, with knowledge and experience deploying and provisioning storage and networking
  • Experience with storage knowledge required
  • Cloud Services – Knowledge around MS Azure and other cloud offerings is a plus
  • OS / Software – Microsoft Windows Server, Linux, Internet Information Services, MS SQL Server, and typical back-office product knowledge
  • Automation – Powershell, Chef, Python experience to help with automating repeatable tasks
  • VMware – vCenter, ESXi, vCloud Automation Center
  • Storage – General knowledge of iSCSI vs Fiber Channel, NAS, SAN, DAS, local
  • Networking – General networking knowledge, VLANs, routing, VMware based switching, and firewall concepts
  • Ability to maintain a calm demeanor when things are going wrong to troubleshoot issues effectively
  • A big picture mentality around solutions architecture and a Keep It Simple philosophy
  • AWS, Azure, or VMware certification a plus
  • Excellent communication and inter-personal skills, including the ability to communicate difficult technical concepts in a straight-forward, simple, manner

Minimum Qualifications:

  • Bachelor’s Degree or equivalent in Computer Science or related disciplines
  • 2 + years of experience with scripting and automation languages (Powershell, Chef, Ruby, Python, etc.)
  • 2 + years of supporting customer facing web delivered software
  • 1 + years of cloud experience
  • Experience with SQL Server and No SQL Systems (Elastic, Mongo, Cassandra, etc.)
About Us

Our software has more than 150,000 active users in more than 40 countries from organizations including the U.S. Department of Justice, more than 70 Fortune 100 companies, and more than 195 of the Am Law 200. We have grown significantly over the last several years and continue striving to build software that helps solve our customers’ toughest e-discovery and unstructured data challenges.

If you’re ready to grow with us, we’d love to hear from you.

#LI-KV1 

If you’re ready to grow with us, we’d love to hear from you.

Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

We’re a community of passionate, life-long learners tackling challenging problems. We care about each other and about our community.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about RelativityFind similar jobs