Senior Site Reliability Engineer at iManage (Chicago, IL)
Senior Site Reliability Engineer
Position Overview
We are looking for a Senior SRE that is interested in building something from the ground up with our new and exciting cloud platform. In this role you will contribute with a newly formed SRE team. You will participate in architectural and design discussions, along with efforts to avoid and reduce toil, and of course, provide a scalable, reliable platform for the success of our customers and organization.
Key Responsibilities
- Participate in Agile Sprints and associated ceremonies
- Drive innovation and platform evolution
- Scale cloud infrastructure to support our growing ecosystem based on Docker and Mesos
- Provide reliable, predictable deployment and maintenance of distributed systems
- Adhere to security best practices
- Write and design automation, monitoring, diagnostics and debug tooling
- Participate in production support and on-call rotations
- Conduct incident management and contribute to associated retrospective/post mortem as needed
Requirements
- 3+ years in a SRE role
- Working knowledge of the SCM tools such as Ansible, Puppet, Chef, or Salt - Salt and/or Ansible preferred
- Experience with IaC (Infrastructure as Code) concepts and tooling - Terraform preferred
- Solid understanding of working with git and gitflow
- Knowledge of Docker engine and ecosystem
- Can troubleshoot and debug container issues at any level, including container networking
- Understanding of Docker networking, including different network plugins and frameworks such as Calico
- Experience with Mesos / Marathon ecosystem - Kubernetes a plus
- Strong knowledge and understanding of microservices based architectures
- Good understanding of networking including L2 and L3 concepts
- Strong background in administrating and maintaining Linux based systems
- Strong scripting skills including ability to write scripts from scratch using Python and/or Bash
- Can identify and mitigate reliability risks
- Excellent communication and troubleshooting skills
- Experience with Continuous Integration and Continuous Delivery models including Blue/Green and Canary release models is a plus
- Experience working with HashiCorp Vault, Consul, and Terraform, provisioning experience with Mesos or Kubernetes clusters as well as knowledge of network architecture, VMWare, KVM & OpenStack are all desired skills
About iManage
iManage transforms how professionals in legal, accounting and financial services get work done by combining the power of artificial intelligence with market leading document and email management. iManage automates routine cognitive tasks, provides powerful insights and streamlines how professionals work while maintaining the highest level of security and governance over critical client and corporate data. Over one million professionals at over 3,000 organizations in over 65 countries – including more than 2,000 law firms and 500 corporate legal departments – rely on iManage to deliver great client work.
Learn more at: www.imanage.com