Top Reliability Engineer Jobs in Chicago, IL

Reposted 5 Days AgoSaved
In-Office
Chicago, IL
130K-170K Annually
Senior level
130K-170K Annually
Senior level
Artificial Intelligence • Cloud • Information Technology • Mobile • Software • Consulting
The role involves designing and implementing OpenTelemetry solutions, optimizing telemetry infrastructure, establishing SRE practices, and managing observability across cloud platforms.
Top Skills: ArgocdAWSAzureBashCloudFormationDockerGCPGithub ActionsGitlab CiGoJavaJenkinsNode.jsOpentelemetryPowershellPulumiPythonRustTerraform
Reposted 5 Days AgoSaved
In-Office
Chicago, IL
106K-145K Annually
Senior level
106K-145K Annually
Senior level
Artificial Intelligence • Automotive • Internet of Things • Software
The Senior Site Reliability Engineer will manage system health, automate solutions, resolve incidents, and collaborate across teams to enhance performance and reliability.
Top Skills: APIsArmAzureAzure CliBicepCloud InfrastructureDevOpsGitPowershellTerraformVirtualization
24 Days AgoSaved
Easy Apply
Remote
Chicago, IL
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Own reliability, automation, and DevOps for Coinbase's corporate IAM platform: on-call/incident response, CI/CD and IaC pipelines, identity lifecycle tooling, observability and disaster recovery, documentation, and cross-team IAM advisement to ensure secure, scalable access for a global workforce.
Top Skills: AbacAuth0AWSAzureC#Ci/CdContainer OrchestrationDuoEntraidGCPGenerative AiGitGoIacJavaMfaOktaPingPythonRbacRubySsoTerraform
24 Days AgoSaved
Easy Apply
Remote
Chicago, IL
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Senior SRE on the IT Operations team owning reliability, monitoring, and incident response for AI infrastructure. Build automation, CI/CD and Kubernetes tooling, improve observability and documentation, and develop internal full-stack tools using Go or Python. Partner with Infrastructure, Security, and Compliance to scale secure, resilient AI deployment pipelines.
Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxPuppetPythonRubySaltTerraform
24 Days AgoSaved
Easy Apply
Remote
Chicago, IL
Easy Apply
218K-257K Annually
Senior level
218K-257K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Own reliability, monitoring, and incident response for AI infrastructure; build automation and CI/CD tooling; manage Kubernetes/Docker production workloads; partner with infrastructure, security, and compliance; improve observability and documentation; develop internal full‑stack tooling in Go or Python.
Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxLog AggregationNetwork SecurityPuppetPythonRubySaltTerraform
7 Days AgoSaved
In-Office or Remote
Chicago, IL
Mid level
Mid level
Hardware
Lead technical services engineer guiding and training engineers, designing IT architecture, troubleshooting network security and third-party control integrations, coordinating projects, providing customer training and field support, and managing personnel and resources.
Top Skills: 802.1XAmxCrestronExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft WordRadiusSecurity Certificate Management
Reposted 7 Days AgoSaved
In-Office or Remote
Chicago, IL
130K-165K Annually
Senior level
130K-165K Annually
Senior level
Information Technology • Insurance • Professional Services • Software
The Senior Site Reliability Engineer will enhance system reliability through infrastructure automation, support core applications, and optimize performance, collaborating with development teams on deployment processes.
Top Skills: AWSCdktfCircleCICloudfrontDockerElasticsearchGithub ActionsJenkinsLambdaRdsRedisRuby On RailsS3TerraformTypescript
Reposted 7 Days AgoSaved
In-Office or Remote
Chicago, IL
160K-179K Annually
Senior level
160K-179K Annually
Senior level
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
8 Days AgoSaved
Hybrid
Chicago, IL
165K-288K Annually
Senior level
165K-288K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Information Technology • Analytics • Financial Services • Cybersecurity
Lead adoption and standardization of SRE practices across the enterprise. Establish SRE governance, define reliability metrics (SLOs/SLIs), build a Community of Practice, run training/forums, enable automation and tooling, partner with platform teams on observability, chaos engineering, and self-healing, and drive cross-functional alignment for resilience and incident management.
Top Skills: AutomationAzure MonitorChaos EngineeringCi/CdCloud-NativeDevOpsDynatraceHybrid ArchitecturesIncident ManagementObservabilityPlatform EngineeringPrometheusRelease EngineeringSelf-HealingSplunkSre
Reposted 9 Days AgoSaved
Hybrid
Chicago, IL
245K-270K Annually
Senior level
245K-270K Annually
Senior level
Information Technology • Consulting
As a Senior Staff Site Reliability Engineer, you will lead the SRE team, advocate best practices, ensure resilience in cloud architecture, and mentor team members.
Top Skills: ArgocdCircleCIGoogle Cloud PlatformKubernetesPulumiTerraformTypescript
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Cloud • Information Technology
As a Sr. Site Reliability Engineer, you'll ensure service reliability, build automation, and collaborate on infrastructure improvements while mentoring others.
Top Skills: AnsibleCatchpointDockerElkGoGrafanaHashicorp VaultJenkinsKubernetesLinuxPrometheusPythonTerraform
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
113K-176K Annually
Senior level
113K-176K Annually
Senior level
Other • Social Impact
As a Senior Site Reliability Engineer, you will manage and improve Wikimedia's infrastructure, handle operational tasks, automate processes, and provide mentorship while participating in a 24/7 on-call rotation.
Top Skills: AnsibleBashDebianGoGrafanaHhvmKubernetesMemcachedPHPPrometheusPuppetPythonRedisRuby
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
115K-135K Annually
Mid level
115K-135K Annually
Mid level
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.
Top Skills: AWSClickhouseGoKafkaKubernetesPulumiPythonTerraform
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
173K-321K Annually
Senior level
173K-321K Annually
Senior level
Cloud • Security • Software • Cybersecurity
Design and maintain reliable infrastructure solutions for a cloud data protection platform. Ensure application scalability and support through CI/CD and monitoring tools while collaborating in a global team.
Top Skills: AppinsightsAws CloudformationAzure Api ManagementAzure Arm TemplatesAzure Cosmos DbAzure DevopsAzure Entra IdAzure FunctionsAzure MonitorAzure Storage ServicesBashBitbucketElastic StackGitGoMicrosoft TfsPowershellPythonServerless FrameworkTerraform
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
82K-229K Annually
Senior level
82K-229K Annually
Senior level
Cloud • Software
Design, implement, and support Kubernetes and compute platforms in a private cloud. Oversee architecture and standardization across hardware, OS, and cloud orchestration.
Top Skills: AnsibleBashCi/CdHelmKubernetesLinuxOpenstackPythonTerraformUbuntu
Reposted 14 Hours AgoSaved
Remote
Chicago, IL
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills: AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Reposted YesterdaySaved
Remote
Chicago, IL
Mid level
Mid level
Blockchain • Software
Build, operate, and scale production Kubernetes infrastructure using GitOps and declarative IaC. Design CI/CD workflows, observability, and secure-by-default systems. Troubleshoot networking/storage, participate in on-call rotations, automate operational workflows, and drive postmortems and reliability improvements.
Top Skills: ArbitrumArgocdArgocd ApplicationsetsAWSAzureBashCloudwatchCodebuildGCPGithub ActionsGitopsGoGrafanaK9SKubernetesLinuxLokiMimirPrometheusPrysmPythonTerraformYamlZerodev
Reposted 11 Days AgoSaved
In-Office or Remote
Chicago, IL
165K-225K Annually
Senior level
165K-225K Annually
Senior level
Artificial Intelligence • Cloud • Information Technology • Software
Build and operate production-grade AI infrastructure using Kubernetes, ensuring high availability, reliability, and performance. Develop custom operators and implement automation for efficient operations and monitoring.
Top Skills: AnsibleBashElk StackEnterprise Storage SystemsGrafanaHigh-Performance NetworkingKubernetesLinuxNvidia Gpu TechnologiesPrometheusPythonTerraform
24 Days AgoSaved
Remote or Hybrid
Chicago, IL
Senior level
Senior level
Software
Lead reliability activities for photonic integrated circuits (PICs): evaluate failure modes, coordinate accelerated stress tests, develop life models from aging-data, and drive failure mode analyses across design, development, and production teams.
Reposted YesterdaySaved
Remote
Chicago, IL
Senior level
Senior level
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills: AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
2 Days AgoSaved
Remote
Chicago, IL
Senior level
Senior level
Software • Web3
Lead reliability practices across teams: embed early in projects, define SLIs/SLOs, build multi-cloud paved roads with Terraform, run on-call, drive org-wide incident maturity and tooling.
Top Skills: AWSAzureGCPRuby On RailsTerraformTypescriptWebcontainers
2 Days AgoSaved
Remote
Chicago, IL
124K-171K Annually
Senior level
124K-171K Annually
Senior level
Healthtech • Pharmaceutical • Manufacturing
Support and maintain production Core Speech systems: deploy, monitor, alert, perform capacity planning, respond to on-call incidents, and drive system performance and architecture improvements.
Top Skills: AnsibleAws CloudfrontAws DocumentdbAws Ec2Aws EfsAws EksAws RdsAws S3ContainerdDockerElasticsearchFilebeatGitGitGitlabGoGocdGrafanaJavaJythonKibanaKubernetesLogstashMongoDBPostgresPythonRedisShellSolrTerraform
Reposted 2 Days AgoSaved
Remote
Chicago, IL
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Artificial Intelligence • Blockchain • Information Technology • Consulting
Lead design and build of production-grade Azure infrastructure using Terraform, ensuring scalable, secure, and repeatable deployments. Provide technical leadership, platform enhancements, observability and incident response improvements, and Tier 2 infrastructure support while collaborating with engineering, security, and product teams to meet enterprise readiness and feature parity goals.
Top Skills: ArgoAzureGoGrafanaKubernetesPrometheusPythonSpaceliftTerraform
Reposted 2 Days AgoSaved
In-Office or Remote
Chicago, IL
200K-200K Annually
Senior level
200K-200K Annually
Senior level
Cloud • Software
The Senior Site Reliability / Gitops Engineer will drive automation and collaboration within the IS team, enhancing Canonical's IT operations and services while managing infrastructure as code and cloud technologies.
Top Skills: Cloud ComputingDockerElasticsearchGitopsGrafanaIacKubernetesLinuxPrometheusPython
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account