Get the job you really want.

Top Remote Reliability Engineer Jobs in Chicago, IL

Reposted YesterdaySaved
In-Office or Remote
2 Locations
160K-179K Annually
Senior level
160K-179K Annually
Senior level
Fintech • Payments
The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.
Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk
Reposted 19 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
219K-245K Annually
Expert/Leader
219K-245K Annually
Expert/Leader
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills: AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
Reposted 19 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills: AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
20 Days AgoSaved
Remote or Hybrid
United States
170K-215K Annually
Senior level
170K-215K Annually
Senior level
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills: Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
Reposted 15 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
130K-150K Annually
Mid level
130K-150K Annually
Mid level
Marketing Tech
The Cloud Reliability Engineer develops, configures, and deploys cloud tools, enhances applications, ensures observability, and participates in on-call rotations.
Top Skills: AWSCi/CdDockerGithub ActionsGoGoogle BigqueryGCPKubernetesLinuxPythonSQLTerraform
Reposted 21 Days AgoSaved
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves improving software reliability, automating processes, collaborating with teams on system optimization, and mentoring engineers to establish reliability as a core value.
Top Skills: AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
7 Days AgoSaved
In-Office or Remote
Chicago, IL, USA
173K-173K Annually
Senior level
173K-173K Annually
Senior level
Logistics • Software • Transportation
Design and maintain infrastructure and software architecture, focusing on automation, observability, security, and developer productivity. Troubleshoot issues and optimize databases.
Top Skills: GoInfrastructure As CodeJavaScriptKubernetesLinuxPythonShell Script
20 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Database • Analytics
Drive reliability, availability, scalability, and performance of ClickHouse Core. Build alerts, run incident response and blameless postmortems, debug production issues, submit fixes, lead chaos engineering and on-call/escalation processes.
Top Skills: Clickhouse,Clickhouse Cloud,Sql,Shell,Python,C++,Aws,Azure,Google Cloud Platform
23 Days AgoSaved
Remote
USA
Senior level
Senior level
Database
Manage and optimize Postgres databases at scale on AWS RDS, own reliability/monitoring, execute low-downtime upgrades and migrations, troubleshoot production issues, participate in on-call rotation, and collaborate with platform and product teams.
Top Skills: Aws RdsBarmanGoPgbackrestPostgresTypescriptWal-G
23 Days AgoSaved
Remote
United States
145K-180K Annually
Senior level
145K-180K Annually
Senior level
Legal Tech • Software
Lead automation and optimization of Filevine's data platform: performance tune MSSQL/Postgres, optimize Snowflake, provision infrastructure with Terraform/AWS, run stateful containers on Kubernetes, integrate AI/LLM and MCP for operational automation, manage CI/CD, capacity planning, documentation, and serve in 24/7 on-call rotation.
Top Skills: Microsoft Sql Server (Mssql),Postgresql,Snowflake,Terraform,Aws,Docker,Kubernetes,Gitlab,Octopus Deploy,Python,Powershell,C#,Entity Framework,Dapper,Dynamodb,Opensearch,Redis,Mcp (Model Context Protocol),Llms
Reposted 5 Hours AgoSaved
Remote
IL, USA
100K-171K Annually
Senior level
100K-171K Annually
Senior level
Insurance
The role entails managing and operating the Cyber Recovery Environment, ensuring resilience against cyber attacks through system design, implementation, and maintenance of storage infrastructure.
Top Skills: Amazon S3AutomationAWSAzureCyber Recovery InfrastructureFibre ChannelGCPHitachi SanIaasIscsiMongoDBNasNetapp NasNfsS3SanSmb/CifsStorage Solutions
6 Hours AgoSaved
Remote
United States
Senior level
Senior level
Logistics • Software • Transportation
Lead and mentor teams in DevOps and SRE, architect scalable Azure Cloud infrastructure, implement CI/CD and IaC, ensure database reliability, and drive cross-functional collaboration.
Top Skills: Azure CloudAzure DevopsCi/CdCosmosdbDockerElkGrafanaKubernetesMySQLPostgresPrometheusRedisSQL ServerTerraform
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 12 Hours AgoSaved
Remote
United States
Mid level
Mid level
Healthtech • Software
Maintain reliability, performance, and scalability of cloud-hosted services and databases. Implement SRE best practices, define SLIs/SLOs, respond to incidents, build monitoring and automation, perform DBA tasks (backups, restores, tuning), support CI/CD and DB migrations, and document runbooks and procedures.
Top Skills: Amazon RdsAzure Sql DatabaseBashEcs FargateFlywayGitlabJenkinsKubernetesLiquibaseOctopus DeployOraclePostgresPowershellPythonRedisSolarwinds DpaSQL Server
Reposted 12 Hours AgoSaved
Remote
United States
128K-165K Annually
Senior level
128K-165K Annually
Senior level
Insurance
Lead reliability strategy and architecture for critical systems, drive incident management and root-cause analysis, build automation and SRE tooling, influence release/change practices and compliance, and mentor junior engineers to improve operational reliability.
Top Skills: AngularAWSCi/CdCloudFormationContainerizationJavaJavaScriptLogsNettyNext.JsNode.jsNon-Relational DatabasesObservability (MetricsOrchestrationOrmReactRelational DatabasesServicenowSpringSpring BootTomcatTracing)
Reposted 12 Hours AgoSaved
In-Office or Remote
United States
Senior level
Senior level
Software
The role involves managing compute infrastructure for decentralized applications, requiring critical thinking, documentation skills, and experience in Kubernetes and blockchain management.
Top Skills: BlockchainGitopsInfrastructure-As-CodeKubernetesProgramming Languages
YesterdaySaved
In-Office or Remote
2 Locations
Senior level
Senior level
Big Data • Cloud • Information Technology
The Site Reliability Engineer at Iron Mountain will troubleshoot escalated tickets, manage Windows Server builds, perform security patching, and collaborate with customers and vendors to resolve issues and maintain systems.
Top Skills: CloudComputeHyper-Converged InfrastructureLinuxMicrosoft Endpoint Configuration ManagerNetworkNutanixPowershellRubrikStorageVirtualizationWindows Server
Reposted YesterdaySaved
Easy Apply
Remote
USA
Easy Apply
Senior level
Senior level
Artificial Intelligence • eCommerce • Retail
Lead the SRE and DevOps team, ensure infrastructure reliability, oversee cloud operations, drive automation, and collaborate cross-functionally.
Top Skills: AzureBashCi/CdDatadogDockerElk StackGoGrafanaKubernetesPowershellPrometheusPythonTerraform
Reposted YesterdaySaved
Easy Apply
Remote
United States
Easy Apply
172K-215K Annually
Senior level
172K-215K Annually
Senior level
Aerospace • Big Data • Greentech • Hardware • Social Impact
Design, deploy, and operate compute services for on-premises and cloud satellite imaging platforms. Build reproducible, scalable, highly available deployments, troubleshoot distributed systems, optimize constrained environments, document and automate operations, and participate in on-call rotations to ensure reliability for customer-facing and air-gapped deployments.
Top Skills: AlloyAnsibleBashCudaGitopsGrafanaHelmJIRAK3SKubernetesKustomizeOpentelemetryPrometheusProxmoxPythonRke2TalosTerraform
Reposted YesterdaySaved
Easy Apply
Remote
United States
Easy Apply
150K-185K Annually
Mid level
150K-185K Annually
Mid level
Software
Join the SRE team to improve monitoring, alerting, observability, and reliability of Fireblocks' production systems. Triage incidents, run RCA, create runbooks and automation (Python, Lambda, shell, Ansible, ArgoCD), collaborate with R&D/support, and participate in on-call rotation.
Top Skills: AnsibleArgocdAWSAws LambdaAzureBashBitbucketC++ChefCoralogixDatadogDockerGerritGitGitlabGCPHelmJavaScriptKubernetesLinuxMySQLNew RelicNginxNode.jsPhabricatorPrometheusPuppetPythonShellSplunk
Reposted YesterdaySaved
Remote
USA
110K-130K Annually
Senior level
110K-130K Annually
Senior level
Real Estate • Financial Services • PropTech
As a Site Reliability Engineer, you will support AWS Cloud products, optimize processes, enhance automation, and ensure system reliability and performance.
Top Skills: ArgocdAWSAzure DevopsBashCi/CdCloudwatchDockerEksFluxcdGitKubernetesPowershellPythonSQLTerraform
2 Days AgoSaved
Easy Apply
Remote
US
Easy Apply
110K-175K Annually
Senior level
110K-175K Annually
Senior level
Cloud • Software
In this role, you'll support large-scale applications, improve observability, mentor team members, and ensure reliability by collaborating on deployments and writing automation scripts while providing 24/7 support.
Top Skills: AnsibleAWSBashConfluenceDockerElk StackGCPGitlab CicdGrafanaJenkinsJIRAKubernetesLinuxMongoDBMySQLNagiosOciPerlPostgresPrometheusPuppetPythonTerraform
Reposted 2 Days AgoSaved
Easy Apply
Remote or Hybrid
USA
Easy Apply
117K-147K Annually
Senior level
117K-147K Annually
Senior level
Cloud • Security • Software
Design, build, and maintain cloud-hosted infrastructure and CI/CD pipelines for a large identity platform. Improve deployment automation, reliability, observability, and cost optimization. Collaborate across teams, evaluate technologies, participate in planning, and join an on-call rotation to support production services.
Top Skills: Ci/CdDockerGCPGitGoKubernetes
Reposted 2 Days AgoSaved
Easy Apply
Remote
United States
Easy Apply
170K-200K Annually
Senior level
170K-200K Annually
Senior level
Software
Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.
Top Skills: AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform
25 Days AgoSaved
Remote
United States
175K-275K Annually
Senior level
175K-275K Annually
Senior level
Software
Own reliability, performance, and scalability of PostgreSQL infrastructure. Implement HA, replication, observability, capacity planning, automation, and DR. Support engineering teams with migrations, query optimization, on-call incident response, runbooks, and tooling to enable safe DB operations.
Top Skills: AnsibleAuroraAws RdsChefDatadogDynamoDBElasticacheGoGrafanaIndexingMvccPatroniPgbouncerPostgresPrometheusPythonQuery PlannerReplicationRubySQLTerraformVacuum TuningWal
12 Days AgoSaved
In-Office or Remote
Chicago, IL, USA
230K-330K Annually
Senior level
230K-330K Annually
Senior level
Travel
The Senior Site Reliability Engineer will automate and optimize infrastructure on Google Cloud, improve cost efficiency, and support on-call incidents, working closely with the engineering teams.
Top Skills: BashContainersDatadogGCPHelmIstioKubernetesKustomizePythonSQL
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account