Guild Mortgage Logo

Guild Mortgage

Senior Site Reliability Engineer

Posted 10 Days Ago
Remote
Hiring Remotely in United States
95K-136K Annually
Senior level
Remote
Hiring Remotely in United States
95K-136K Annually
Senior level
The Senior Site Reliability Engineer executes reliability strategies, designs and maintains infrastructure, improves monitoring and deployment processes, collaborates with teams for system reliability and performance optimization.
The summary above was generated by AI

Guild Mortgage Company, closing loans and opening doors since 1960. As a mortgage banking firm we are dedicated to serving the homeowner/buyer. Our goal is to provide affordable home financing for our customers, utilizing the best terms available while providing a level of professionalism and service unsurpassed in the lending industry.

Position Summary

The Senior Site Reliability Engineer is responsible for executing the organizational reliability strategy and participating in resiliency design reviews to ensure the reliability, scalability, and performance of our company's software systems and applications meet organizational service level objectives (SLOs) and error budgets. The role is responsible for designing, implementing, and maintaining the infrastructure and tools necessary to support our platforms, as well as improving our monitoring, automation, and deployment processes. This role involves strategic planning, technical leadership, and collaboration with various stakeholders including Guild’s Product Delivery, Data Services, DevOps, DataOps, and Infrastructure teams to support organizational goals.

Compensation

This role is an exempt position with a targeted salary range of $94,882 to $136,096 annually.

Compensation at Guild is influenced by a wide array of factors including but not limited to local and federal minimum wage requirements, education, level of experience, and applicant’s geographical location.

Essential Functions

  • Participate in resiliency design reviews and lead complex problem-solving efforts.
  • Design, implement, and maintain monitoring systems to track the performance, availability, and reliability of services.
  • Respond to incidents promptly, investigate root causes, and coordinate efforts to mitigate and resolve them.
  • Analyze performance data, and plan for scalability and capacity requirements.
  • Identify and optimize performance bottlenecks, both at the infrastructure and application levels.
  • Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  • Implement and enforce change management practices to ensure safe and controlled changes to the production environment.
  • Design and implement fault-tolerant systems and practices to minimize downtime and ensure service availability.
  • Collaborate with the GRC team on developing and maintaining disaster recovery plans and procedures relevant to the software supported to minimize the impact of catastrophic failures.
  • Work with the Incident Management and other teams to conduct a thorough analysis of incidents, document postmortem reports, and implement improvements based on lessons learned.
  • Work closely with development, operations, and other teams to foster a culture of reliability, and provide feedback on system design and architecture for improved reliability.

Qualifications

  • Bachelors Degree directly related to the position or equivalent, preferred.
  • A combination of education and experience may be considered in lieu of the Bachelor’s degree.
  • Minimum five years experience.
  • Collaborate with stakeholders to define RPO / RTO for Guild’s system footprint.
  • Expert in Cloud-based redundancy, high availability, and reliability strategies.
  • Expert in reliability, scalability, and performance optimization.
  • Expert at maintaining Linux / Unix and Windows systems administration, provisioning, configuration, monitoring, and troubleshooting Web Servers in a 7x24 customer facing environment.
  • Strong Linux and Windows Administration & scripting.
  • Solid Database Administration skills (MySQL, MariaDB, RDS, Sql Server, and Azure Storage services).
  • Deep knowledge of current methodologies in high performance operations and scalable multi-site implementations.
  • Proven Experience with large-scale software implementation (high transaction volume, high-availability concepts).
  • Deep knowledge of software deployment, versioning (GIT) and release management processes.
  • Experienced with infrastructure design, implementation, and support.
  • Proficient at automated provisioning, automated configuration management, and containerization solutions and tools.
  • Experienced in cloud-based hosting solutions (AWS, Azure, GCP).
  • Experienced with Cloud server environments (AWS, Google Cloud, or Azure).
  • Experienced in Agile software development best practices utilizing Continuous Integration & Delivery Pipelines as well as agile tools such as Jira.
  • Excellent written and verbal communication skills.
  • Proficient in communicating to both technical and management levels.
  • Ability to interact with external customers and staff members.
  • Highly adaptable.
  • Ability to work in a fast paced, constantly expanding environment.
  • Excellent verbal and written communication skills required.
  • Highly organized and detail-oriented; ability to work in a fast-paced, metrics-driven environment required.
  • Proficiency in Microsoft Office Suite, Word, Excel, Wiki, collaborative cloud-based programs, and third-party software applications required.
  • Commitment to company values.
  • Customer Service - Proactive attention to each person.
  • Integrity - Do and say what's right.
  • Respect - Treat others with dignity.
  • Collaboration - Listen and work together.
  • Learning - Seek knowledge and strive for improvement.
  • Excellence – Deliver the unexpected.

Supervision 

Job Scope:  Responsible for understanding the department/functional area objectives and goals and how own job contributes to achievement of these goals; may recommend changes and enhancements based on analysis and evaluation of circumstances.

Complexity:  Problems encountered are often complex and may involve significant resource coordination and availability, evaluating and resolving discrepancies with data, analyses, processes, etc. using own expertise and judgment.

Impact:  Decisions and actions primarily impact own work with moderate impact on peers in their area; contributes as team member rather than leader.

Interaction/Supervision:  Works under broad direction with considerable latitude for independent actions; guided by professional standards, desired outcomes and unit/project/program specifications.

Requirements 

  • Work is primarily sedentary; mobility in an office setting.
  • Ability to operate standard office equipment and keyboards.
  • Regularly required to accurately perceive, distinguish and interpret information received visually and through audio; e.g., words, numbers and other data broadcasted aloud/viewed on a screen, as well as print and other media.
  • Office environment – moderate noise, no substantial exposure to adverse environmental conditions.
  • Travel 5% or less.
  • Learn new tasks, remember processes, maintain focus, complete tasks independently, and make timely decisions in the context of a workflow.
  • Work is primarily performed during the business week, Monday - Friday; occasional night or weekend may be necessary.

Guild offers a pleasant work environment, competitive compensation and excellent benefits package; including medical, dental, vision, life insurance, AD&D, LTD and 401(k) with employer match. 

Guild Mortgage Company is an Equal Opportunity Employer.

REQ#: SENIO018160

Equal Opportunity Employer
This employer is required to notify all applicants of their rights pursuant to federal employment laws. For further information, please review the Know Your Rights notice from the Department of Labor.

Similar Jobs

Yesterday
Remote or Hybrid
96K-163K Annually
Senior level
96K-163K Annually
Senior level
Blockchain • Fintech • Payments • Consulting • Cryptocurrency • Cybersecurity • Quantum Computing
Lead reliability, scalability, and production operations for a greenfield enterprise application. Influence design for production readiness, own incident response, define SLIs/SLOs, build observability and automation, enhance CI/CD, and improve developer experience across infrastructure and application stacks.
Top Skills: AWSChatgptClaudeCopilotDockerElasticsearchGithub ActionsGoGrafanaKubernetesOpensearchOpsgeniePrometheusSpring Boot
11 Days Ago
Remote or Hybrid
United States
175K-200K Annually
Senior level
175K-200K Annually
Senior level
eCommerce • Fintech • Payments • Software
The role involves ensuring software reliability and performance, managing incidents, developing infrastructure automation, and mentoring junior engineers within a platform team.
Top Skills: AWSCloudFormationDatadogKubernetesOpentelemetryRubyRuby On RailsTerraform
14 Days Ago
Remote or Hybrid
CO, USA
110K-145K Annually
Senior level
110K-145K Annually
Senior level
Information Technology • Insurance • Software
The Sr. Site Reliability Engineer at Vertafore will own the reliability and performance of production services, design incident response protocols, and enhance system observability while applying software engineering practices.
Top Skills: .NetAWSC#Ci/CdJavaKubernetesLinuxPythonReactWindows

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account