Top Remote Reliability Engineer Jobs in Chicago, IL

Reposted 5 Days AgoSaved
Remote
USA
Senior level
Senior level
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills: ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
6 Days AgoSaved
Remote
United States
96K-192K Annually
Senior level
96K-192K Annually
Senior level
Blockchain • Financial Services • Cryptocurrency • Web3
Design, build, and operate scalable, observable infrastructure for AI agent workflows. Build platform services, APIs, and SDKs; manage cloud, Kubernetes, and model-serving compute; implement IaC, CI/CD, monitoring, incident response, security controls, and runbooks; collaborate with AI and data teams to productionize agent prototypes.
Top Skills: AWSBashCi/CdDockerKubernetesPythonTerraform
Reposted 6 Days AgoSaved
Remote or Hybrid
United States
154K-199K Annually
Senior level
154K-199K Annually
Senior level
3D Printing • Aerospace • Hardware • Robotics • Software
Lead the reliability and scalability of BRINC's production systems, building secure cloud infrastructure and improving incident response. Collaborate with teams for optimal system performance.
Top Skills: AWSInfrastructure As CodeJavaScriptNode.jsPython
Reposted 6 Days AgoSaved
Remote
USA
113K-175K Annually
Senior level
113K-175K Annually
Senior level
Information Technology • Internet of Things • Software • Virtual Reality
Lead reliability, availability, and resiliency strategies for large-scale systems, drive operational excellence, and provide technical mentorship across engineering teams.
Top Skills: AWSCi/CdJavaMongoDBRabbitMQZookeeper
Reposted 7 Days AgoSaved
Remote
United States
140K-197K Annually
Expert/Leader
140K-197K Annually
Expert/Leader
Artificial Intelligence • Cloud • Information Technology • Software • Big Data Analytics
As Staff SRE for Project Volcano, you'll own reliability, architect infrastructure, scale data services, and set SRE practices while mentoring teams.
Top Skills: ArgocdDatadogGrafanaHelmKubernetesPostgresPrometheusRedisTerraformTerragrunt
7 Days AgoSaved
Remote
United States
152K-253K Annually
Mid level
152K-253K Annually
Mid level
Cloud • Security • Software • Cybersecurity
Join the GOV/Sovereign Cloud SRE team to maintain and improve reliability for the Veeam Data Cloud. Responsibilities include incident response, SLIs/SLOs, observability (monitoring, alerting, dashboards), runbooks and documentation, IaC and CI/CD work in compliance-restricted environments, and participation in on-call rotations. Collaborate with engineering, security, and compliance teams to implement high availability and automation.
Top Skills: ArgocdAzureAzure DevopsAzure GovernmentC#Elk StackGithub ActionsGitlab CiGoGrafanaJavaJavaScriptKubernetesOpentelemetryPrometheusPulumiTerraformTerragruntTypescript
Reposted 7 Days AgoSaved
Remote
United States
Mid level
Mid level
Healthtech • Software
Maintain reliability, performance, and scalability of cloud-hosted services and databases. Implement SRE best practices, define SLIs/SLOs, respond to incidents, build monitoring and automation, perform DBA tasks (backups, restores, tuning), support CI/CD and DB migrations, and document runbooks and procedures.
Top Skills: Amazon RdsAzure Sql DatabaseBashEcs FargateFlywayGitlabJenkinsKubernetesLiquibaseOctopus DeployOraclePostgresPowershellPythonRedisSolarwinds DpaSQL Server
Reposted 16 Days AgoSaved
In-Office or Remote
Chicago, IL, USA
130K-165K Annually
Senior level
130K-165K Annually
Senior level
Information Technology • Insurance • Professional Services • Software
The Senior Site Reliability Engineer will enhance system reliability through infrastructure automation, support core applications, and optimize performance, collaborating with development teams on deployment processes.
Top Skills: AWSCdktfCircleCICloudfrontDockerElasticsearchGithub ActionsJenkinsLambdaRdsRedisRuby On RailsS3TerraformTypescript
Reposted 16 Days AgoSaved
In-Office or Remote
2 Locations
160K-179K Annually
Senior level
160K-179K Annually
Senior level
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
Reposted 7 Days AgoSaved
Remote
United States of America
89K-184K Annually
Entry level
89K-184K Annually
Entry level
AdTech • Digital Media • Information Technology • Other
As a Software Engineer in the Tooling and Reliability Platforms team, you'll develop AI services, manage incident tools, and utilize Infrastructure as Code for high-availability systems. You'll focus on integrating AI workflows and improving operational resilience for Yahoo's brands.
Top Skills: AWSCloudFormationDockerGCPGoJavaKubernetesPythonTerraform
8 Days AgoSaved
Remote
United States
175K-200K Annually
Senior level
175K-200K Annually
Senior level
Artificial Intelligence • Healthtech • HR Tech • Software
Own the Heroku-to-GCP migration, maintain Postgres and data pipelines, optimize high‑traffic code paths, build monitoring/alerting, lead incident response and post‑mortems, reduce costs and scale proactively, and coach other infrastructure engineers.
Top Skills: AppsignalBigQueryBugsnagCannyClaude CodeFivetranGoogle Cloud PlatformHerokuHexHotwireInfrastructure-As-CodePostgresRuby On Rails
8 Days AgoSaved
Remote
USA
143K-243K Annually
Senior level
143K-243K Annually
Senior level
Healthtech • Information Technology • Telehealth
Lead the design, build, and operation of scalable observability and telemetry platforms. Implement IaC and automation, support monitoring/alerting, troubleshoot production distributed systems, participate in incident response/on-call, and mentor engineers while driving platform reliability and cross-team technical decisions.
Top Skills: AWSBashCi/CdClickhouseCloudFormationDockerElasticsearchFluentdGoGrafanaKafkaKubernetesLinuxOpensearchOpentelemetryPrometheusPythonTcpdumpTerraformVectorWireshark
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 8 Days AgoSaved
Remote
USA
Senior level
Senior level
Information Technology • Software • Consulting
The Senior SRE will design and implement automated Dynatrace configurations, integrate REST APIs, and develop TypeScript tooling for platform reliability, while ensuring observability and automation practices are followed.
Top Skills: APIsAws CloudformationAws CodebuildAws LambdaDynatraceTypescript
Reposted 8 Days AgoSaved
Remote
United States
212K-265K Annually
Expert/Leader
212K-265K Annually
Expert/Leader
Real Estate • Travel • PropTech
The Engineering Manager for Storage SRE will lead a team to ensure reliable database operations, improve developer experience, and expand tooling and operational models, focusing on mission-critical systems.
Top Skills: Cloud InfrastructureDatabasesSite Reliability EngineeringStorage Systems
9 Days AgoSaved
Remote
United States
Mid level
Mid level
Blockchain • Software
Build, operate, and scale production Kubernetes infrastructure using GitOps and declarative IaC. Design CI/CD workflows, observability, and secure-by-default systems. Troubleshoot networking/storage, participate in on-call rotations, automate operational workflows, and drive postmortems and reliability improvements.
Top Skills: ArbitrumArgocdArgocd ApplicationsetsAWSAzureBashCloudwatchCodebuildGCPGithub ActionsGitopsGoGrafanaK9SKubernetesLinuxLokiMimirPrometheusPrysmPythonTerraformYamlZerodev
9 Days AgoSaved
Remote
US
123K-144K Annually
Senior level
123K-144K Annually
Senior level
Internet of Things
Maintain and evolve an EKS-based Kubernetes platform, build CI/CD pipelines (GitHub Actions, OIDC), manage IaC with Pulumi/Terraform/OpenTofu on AWS, operate observability stack, enforce security best practices, diagnose production incidents, participate in on-call rotation, and produce runbooks and documentation to improve reliability.
Top Skills: AWSAws Secrets ManagerEksExternal Secrets OperatorGithub ActionsGrafanaIamKubernetesOidcOpentofuPulumiTerraformVectorVictorialogsVictoriametrics
9 Days AgoSaved
Remote
USA
62K-144K Annually
Senior level
62K-144K Annually
Senior level
Internet of Things
Operate and evolve an EKS-based Kubernetes platform, build CI/CD pipelines, manage infrastructure as code (Pulumi/Terraform/OpenTofu) across AWS, maintain observability and security practices, respond to incidents and perform post-mortems, participate in on-call rotation, and produce runbooks and architecture documentation while collaborating with distributed engineering teams.
Top Skills: ArgocdAWSAws Secrets ManagerExternal Secrets OperatorFluxGithub ActionsGrafanaImapKeycloakKubernetes (Eks)OidcOpentofuPulumiSmtpTerraformVectorVictorialogsVictoriametrics
Reposted 9 Days AgoSaved
Remote
USA
Senior level
Senior level
Fintech • Information Technology
As a Site Reliability Engineer at Alpaca, you will ensure system reliability and performance, troubleshoot issues, and collaborate with teams to design scalable features.
Top Skills: GoGormLinuxPgxPostgresPrometheusSqlc
Reposted 9 Days AgoSaved
Remote
USA
113K-176K Annually
Senior level
113K-176K Annually
Senior level
Other • Social Impact
As a Senior Site Reliability Engineer, you will manage and improve Wikimedia's infrastructure, handle operational tasks, automate processes, and provide mentorship while participating in a 24/7 on-call rotation.
Top Skills: AnsibleBashDebianGoGrafanaHhvmKubernetesMemcachedPHPPrometheusPuppetPythonRedisRuby
Reposted 9 Days AgoSaved
Remote
USA
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.
Top Skills: AWSClickhouseGoKafkaKubernetesPulumiPythonTerraform
Reposted 9 Days AgoSaved
Remote
United States
82K-229K Annually
Senior level
82K-229K Annually
Senior level
Cloud • Software
Design, implement, and support Kubernetes and compute platforms in a private cloud. Oversee architecture and standardization across hardware, OS, and cloud orchestration.
Top Skills: AnsibleBashCi/CdHelmKubernetesLinuxOpenstackPythonTerraformUbuntu
Reposted 9 Days AgoSaved
Remote
United States
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Cloud • Information Technology
As a Sr. Site Reliability Engineer, you'll ensure service reliability, build automation, and collaborate on infrastructure improvements while mentoring others.
Top Skills: AnsibleCatchpointDockerElkGoGrafanaHashicorp VaultJenkinsKubernetesLinuxPrometheusPythonTerraform
Reposted 9 Days AgoSaved
Remote
United States
173K-321K Annually
Senior level
173K-321K Annually
Senior level
Cloud • Security • Software • Cybersecurity
Design and maintain reliable infrastructure solutions for a cloud data protection platform. Ensure application scalability and support through CI/CD and monitoring tools while collaborating in a global team.
Top Skills: AppinsightsAws CloudformationAzure Api ManagementAzure Arm TemplatesAzure Cosmos DbAzure DevopsAzure Entra IdAzure FunctionsAzure MonitorAzure Storage ServicesBashBitbucketElastic StackGitGoMicrosoft TfsPowershellPythonServerless FrameworkTerraform
Reposted 9 Days AgoSaved
Remote
United States
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills: AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Reposted 10 Days AgoSaved
Remote
United States
115K-135K Annually
Mid level
115K-135K Annually
Mid level
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account