Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Remote Reliability Engineer Jobs in Chicago, IL

Runpod

Site Reliability Engineer

Reposted 2 Hours AgoSaved

Easy Apply

Remote

USA

Easy Apply

150K-200K Annually

Senior level

150K-200K Annually

Senior level

Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)

As a Site Reliability Engineer, you will ensure system stability and resilience, define reliability standards, and automate operational processes while collaborating cross-functionally to improve performance and reduce incidents.

Top Skills: BashCi/CdDockerGoGrafanaKubernetesLinuxPrometheusPython

Dropbox

Staff Site Reliability Engineer, Production Engineering

Reposted 2 Hours AgoSaved

Remote

United States

223K-302K Annually

Expert/Leader

223K-302K Annually

Expert/Leader

Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy

The role involves defining reliability strategies, leading initiatives across teams, enhancing monitoring and incident response, and mentoring engineers at Dropbox.

Top Skills: Ai TechnologiesDebuggingDistributed SystemsIncident ResponseObservabilityReliability Risk ManagementSlasSlos

Domino Data Lab

Staff Site Reliability Engineer

YesterdaySaved

Easy Apply

Remote or Hybrid

Easy Apply

200K-230K Annually

Senior level

200K-230K Annually

Senior level

Artificial Intelligence • Machine Learning

Lead development of AI-assisted reliability tooling, own incident response end-to-end, improve observability and SLO/SLI frameworks, scale single-tenant SaaS operations, mentor engineers, and reduce recurring operational toil through engineering and automation.

Top Skills: Cloud PlatformsGoKubernetesLinuxLlm/Ai ToolingLogs And TracingObservability ToolingPythonSlo/Sli Frameworks

MongoDB

Site Reliability Engineer (Senior or Staff), Atlas

Reposted 10 Days AgoSaved

Easy Apply

Remote or Hybrid

2 Locations

Easy Apply

127K-249K Annually

Senior level

127K-249K Annually

Senior level

Big Data • Cloud • Software • Database

As a Senior Site Reliability Engineer, you'll design and build complex systems, support Atlas platform operations, automate processes, and ensure high availability of services.

Top Skills: AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls

Zscaler

Site Reliability Engineer-SkillBridge Intern

Reposted 3 Days AgoSaved

Easy Apply

Remote or Hybrid

USA

Easy Apply

Internship

Cloud • Information Technology • Security • Software • Cybersecurity

This internship role focuses on SRE skills, requiring collaboration and problem-solving in dynamic environments for Zscaler's Zero Trust Exchange team.

Top Skills: AnsibleAws EcsKubernetesLinuxPythonTerraform

MongoDB

Senior Site Reliability Engineer, Fleet Management

Reposted 14 Days AgoSaved

Easy Apply

Remote or Hybrid

2 Locations

Easy Apply

127K-249K Annually

Senior level

127K-249K Annually

Senior level

Big Data • Cloud • Software • Database

Develop and maintain Kubernetes runtime environments, support developers, resolve critical issues, and participate in on-call rotations for production systems.

Top Skills: AWSAzureCert-ManagerCorednsCrdsCriCsiGatekeeperGCPGoHelmKubernetesKustomizeOperatorsPythonTerraform

PointClickCare

Senior Database Reliability Engineer (SQL)

Reposted 2 Hours AgoSaved

Remote

USA

146K-162K Annually

Senior level

146K-162K Annually

Senior level

Healthtech • Software

The Database Reliability Engineer manages and maintains cloud-based database infrastructures for SaaS applications, focusing on automation, process improvement, and collaboration with engineering teams.

Top Skills: AnsibleAWSAzureAzure Data FactoryC#DatabricksGCPGitGrafanaInfluxdbMySQLPostgresPowershellPythonSQLSQL ServerTerraform

ClickHouse

Database Reliability Engineer - Core Team

Reposted 2 Hours AgoSaved

Remote

United States

75K-150K Annually

Senior level

75K-150K Annually

Senior level

Database • Analytics

As a Database Reliability Engineer at ClickHouse, you'll improve reliability, manage escalation processes, support incident response, and enhance database performance while collaborating across teams.

Top Skills: AWSAzureC++ClickhouseGoogle Cloud PlatformPythonShellSQL

Nokia

Senior HW Reliability Engineer

YesterdaySaved

Remote or Hybrid

United States

Senior level

Software

Lead reliability engineering for Silicon Photonics hardware: define and validate reliability models, perform MTBF/MTBCF predictions, analyze field data, direct verification testing and root-cause analysis, drive corrective actions, and mentor cross-functional teams to improve product reliability.

Top Skills: Derating AnalysisDfmeaMtbcfMtbfSherlockSilicon PhotonicsTelcordiaThermal DesignWindchill Qs

MongoDB

Site Reliability Engineer (Senior or Staff), Infrastructure Security

Reposted 6 Days AgoSaved

Easy Apply

Remote or Hybrid

United States

Easy Apply

127K-249K Annually

Senior level

127K-249K Annually

Senior level

Big Data • Cloud • Software • Database

The Senior Site Reliability Engineer will lead security design and implementation for cloud infrastructures, mentor teams, and automate security solutions.

Top Skills: AnsibleAWSAzureCloud Security ToolsCloudFormationGCPGoTerraform

Babylist

Staff Engineer, Site Reliability

7 Days AgoSaved

Easy Apply

Remote or Hybrid

United States

Easy Apply

227K-272K Annually

Senior level

227K-272K Annually

Senior level

eCommerce • Healthtech • Kids + Family • Retail • Social Media

Own and evolve Babylist's AWS infrastructure and developer platform using Terraform and Kubernetes. Improve CI/CD reliability, support engineers across environments, define monitoring and alerting standards, lead incident response and postmortems, and shape platform architecture to scale for millions of users.

Top Skills: AWSCdnCircleCICronitorDatadogDnsEksGithub ActionsKubernetesLoad BalancersMySQLPagerdutyRdsRedisRuby On RailsSentrySidekiqTerraform

Circle (circle.so)

Senior Site Reliability Engineer

7 Days AgoSaved

Easy Apply

Remote

United States

Easy Apply

130K-140K Annually

Senior level

130K-140K Annually

Senior level

Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software

Lead SRE work to keep Circle highly available and performant: respond to incidents, own monitoring/alerting/log management, manage and optimize MySQL/Postgres/ClickHouse/Redis databases, maintain server infrastructure and deployment pipelines, collaborate with engineering teams, and build internal SRE tooling and automation.

Top Skills: AWSClickhouseKubernetesLlm-Based Tools (Copilots)MySQLPostgresRedis

New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free

OpsMill

Product Reliability Engineer | US

2 Days AgoSaved

Remote

2 Locations

Mid level

Information Technology • Software • Database • Automation

Owner of on-prem reliability and escalations: reproduce and resolve L2/L3 issues across heterogeneous Kubernetes environments, build diagnostics and automation, improve CI and e2e test stability, establish performance baselines, harden install/upgrade flows, and write tooling in Python/Go/Rust to reduce repeat incidents.

Top Skills: BenchmarkingCiCi/CdContainersE2E TestingGoHealth ChecksHelmInstallersIntegration TestingKubernetesLoad GenerationLogsMetricsNetworkingObservabilityPackagingProfilingPythonRbacRustStorageSupport BundlesTraces

Coinbase

Senior Site Reliability Engineer, Workforce Identity

8 Days AgoSaved

Easy Apply

Remote

USA

Easy Apply

186K-219K Annually

Senior level

186K-219K Annually

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

Own reliability, automation, and DevOps for Coinbase's corporate IAM platform: on-call/incident response, CI/CD and IaC pipelines, identity lifecycle tooling, observability and disaster recovery, documentation, and cross-team IAM advisement to ensure secure, scalable access for a global workforce.

Top Skills: AbacAuth0AWSAzureC#Ci/CdContainer OrchestrationDuoEntraidGCPGenerative AiGitGoIacJavaMfaOktaPingPythonRbacRubySsoTerraform

Coinbase

Senior Site Reliability Engineer, Core AI Infrastructure

8 Days AgoSaved

Easy Apply

Remote

USA

Easy Apply

186K-219K Annually

Senior level

186K-219K Annually

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

Senior SRE on the IT Operations team owning reliability, monitoring, and incident response for AI infrastructure. Build automation, CI/CD and Kubernetes tooling, improve observability and documentation, and develop internal full-stack tools using Go or Python. Partner with Infrastructure, Security, and Compliance to scale secure, resilient AI deployment pipelines.

Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxPuppetPythonRubySaltTerraform

Coinbase

Staff Site Reliability Engineer, Core AI Infrastructure

8 Days AgoSaved

Easy Apply

Remote

USA

Easy Apply

218K-257K Annually

Senior level

218K-257K Annually

Senior level

Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3

Own reliability, monitoring, and incident response for AI infrastructure; build automation and CI/CD tooling; manage Kubernetes/Docker production workloads; partner with infrastructure, security, and compliance; improve observability and documentation; develop internal full‑stack tooling in Go or Python.

Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxLog AggregationNetwork SecurityPuppetPythonRubySaltTerraform

Zocdoc

Senior Site Reliability Engineer

Reposted 9 Days AgoSaved

Easy Apply

Remote or Hybrid

USA

Easy Apply

180K-220K Annually

Senior level

180K-220K Annually

Senior level

Healthtech • Information Technology • Software • Telehealth

The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.

Top Skills: AWSDockerGCPKubernetes

Synchrony

AVP, Reliability Engineer - OnePay

14 Days AgoSaved

In-Office or Remote

2 Locations

100K-170K Annually

Senior level

100K-170K Annually

Senior level

Fintech • Financial Services

The AVP, Reliability Engineer ensures high availability and performance of OnePay applications, troubleshooting issues, enhancing monitoring capabilities, and developing automation for operational excellence in a cloud environment.

Top Skills: AnsibleBashChefDevOpsGoJavaScriptJenkinsNew RelicPowershellPuppetPythonSplunkSreTerraform

Capital One

Sr. Distinguished Engineer, Acquisitions Platform & SRE Foundations

Reposted 21 Days AgoSaved

Remote or Hybrid

Chicago, IL, USA

286K-392K Annually

Senior level

286K-392K Annually

Senior level

Fintech • Machine Learning • Payments • Software • Financial Services

The role involves leading the Card Acquisitions engineering organization, promoting engineering excellence, mentoring engineers, and delivering innovative solutions. Responsibilities include system design, hands-on coding, and developing a multi-year strategy to enhance operational efficiency and customer acquisition through advanced technologies.

Top Skills: GoJavaJavaScriptPublic Cloud TechnologiesPythonSpa FrameworksTypescript

HiBob

Senior Site Reliability Engineer - Remote EST

Reposted 12 Days AgoSaved

Remote or Hybrid

United States

190K-235K Annually

Senior level

190K-235K Annually

Senior level

HR Tech • Information Technology • Professional Services • Sales • Software

Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.

Top Skills: Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython

Nokia

Senior Reliability Engineer

8 Days AgoSaved

Remote or Hybrid

United States

Senior level

Software

Lead reliability activities for photonic integrated circuits (PICs): evaluate failure modes, coordinate accelerated stress tests, develop life models from aging-data, and drive failure mode analyses across design, development, and production teams.

Upstart

Senior Software Engineer, Site Reliability

Reposted 13 Days AgoSaved

Easy Apply

Remote

United States

Easy Apply

167K-231K Annually

Senior level

167K-231K Annually

Senior level

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software

Lead technical direction for software architecture and cross-team initiatives focusing on scaling consumer-facing systems and maximizing loan originations while maintaining compliance and system integrity.

Top Skills: AWSCi/CdDockerGithub ActionsInfrastructure As CodeReactRuby On Rails

Deepgram

Site Reliability Engineer - AI & ML Infrastructure (Kubernetes, AWS & Terraform)

Reposted 13 Days AgoSaved

Remote

USA

150K-220K Annually

Senior level

150K-220K Annually

Senior level

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI

The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.

Top Skills: AWSBashGoKubernetesPythonSlurmTerraform

Filevine

Senior Database Reliability Engineer

Reposted 12 Days AgoSaved

Remote

United States

145K-180K Annually

Senior level

145K-180K Annually

Senior level

Legal Tech • Software

Lead automation and optimization of Filevine's data platform: performance tune MSSQL/Postgres, optimize Snowflake, provision infrastructure with Terraform/AWS, run stateful containers on Kubernetes, integrate AI/LLM and MCP for operational automation, manage CI/CD, capacity planning, documentation, and serve in 24/7 on-call rotation.

Top Skills: AWSC#DapperDockerDynamoDBEntity FrameworkGitlabKubernetesLlmsMcp (Model Context Protocol)Microsoft Sql Server (Mssql)Octopus DeployOpensearchPostgresPowershellPythonRedisSnowflakeTerraform

Zingtree

Senior DevOps / Platform Reliability Engineer

Reposted 13 Days AgoSaved

Remote

East Coast, USA

Senior level

Software

As a Senior DevOps / Platform Reliability Engineer, you will manage CI/CD pipelines, automate infrastructure, operate Kubernetes, and enhance observability while ensuring security and compliance for enterprise systems.