Site Reliability Engineer
What We'll Bring
At TransUnion, we have a welcoming and energetic environment that encourages collaboration and innovation. We are consistently exploring new technologies and tools to be agile. This environment gives our people the opportunity to hone current skills and build new capabilities, while discovering their genius.
Come be a part of our team – you’ll work with great people, pioneering products and cutting-edge technology.
The Global Transformation Engineering Operations organization is looking for a talented Site Reliability Engineering Advisor that is looking to have a widespread impact across the organization. This individual will work with multiple teams supporting the U.S Markets Core Online technology applications focused on on-prem infrastructure delivery and evolve to managing infrastructure as code, through our cloud migration journey.
What You'll Bring
Broad knowledge of middleware technologies such as Tomcat, JBoss EAP, WebSphere and industry trends with subject matter specialization in web and application server infrastructure hosting and administration.
Experience with upgrading middleware infrastructure components such as Tomcat, JBoss, WebSphere, Apache, IBM HTTP Server, and Nginx.
Proven Expertise in Ansible/Ansible Tower for platform automation across multiple technology stacks. Knowledge of IaC products on cloud like CloudFormation, Terraform, and Packer.
Proven experience with Bitbucket
Proven experience with Jenkins
Experience in instrumenting monitoring tools like AppDynamics, Wily Introscope, etc.
Experience in log management tools such as Splunk
Experience in working in an Agile (SAFe/Kanban) environment and agile tools (e.g. Rally).
Must be self-motivated & directed, detailed-oriented, capable managing complexity and have a curious mind that continuously strives for operational improvement leveraging automation and self-healing processes
Works well in a dynamic, fast-moving and collaborative work environment with a positive attitude and solid work ethic
Demonstrate ability to develop a deep understanding of the business domains supported and a keen interest and drive to continuously learn new technologies and competencies
Ability to coach and guide other engineers through complex technical matters including competent to work with and lead virtual teams
What We'd Prefer to See:
Experience with containerization (EKS or Docker/Kubernetes)
Scripting technologies including Bash, Perl, Python, or similar tools
Experience with AWS cloud platform including AWS Cloud Certifications
Impact You'll Make
Provide technical guidance and direction to Core Online Site Reliability Engineering (SRE) team and mature SRE practices
Lead and collaborate to create and deliver innovative infrastructure solutions based on technical requirements, product roadmap and anticipated feature releases
Partner with Core Online Application Development teams to design, develop and execute cloud infrastructure migration
Install new / upgrade existing application servers and configure the middleware technology components in accordance with TransUnion standards and project/operational requirements
Improve system reliability through the optimization and automation leveraging tools like Ansible
Perform system and application monitoring, verifying integrity and availability of all hardware, server resources, systems and key processes
Protect & secure environments and improve supported services’ security posture including platform/environment patching in keeping with in keeping with information security standards
Drives solutioning to problems and critical support issues for Core Online applications, address issues/incidents expeditiously, perform problem & performance analysis to identify root causes of issues and mitigation action as warranted