Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Senior Site Reliability Engineer Jobs in Chicago, IL
Security • Cybersecurity
The Staff Site Reliability Engineer will lead reliability strategy, architecture, and incident response while mentoring engineers and improving operational excellence.
Top Skills:
AWSCi/CdGithub ActionsJavaScriptPythonRubyTerraform
Information Technology • Security • Cybersecurity
The Staff/Principal Site Reliability Engineer leads infrastructure initiatives, architects solutions for cloud and SaaS, and collaborates cross-functionally to enhance reliability and innovation.
Top Skills:
AWSBashBazelCuelangDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform
Information Technology • Software
The Site Reliability Engineer manages system reliability, performance, and scalability for end-user services, leading software deployments, incident management, and service quality improvements. Responsibilities include collaboration with teams, maintaining a product roadmap, and automation of processes.
Top Skills:
AgileAternityDevsecopsItilPowershellPython
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Cloud
Join Arista Networks as a Site Reliability Engineer to manage CloudVision service reliability, scalability, and stability in a FedRAMP environment, focusing on areas like architecture, security, and performance optimization.
Top Skills:
AnsibleBashGCPGkeGoKubernetesPulumiPython
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills:
AWSDockerGCPKubernetes
Artificial Intelligence • Fintech • Machine Learning • Natural Language Processing • Business Intelligence
Lead architecture and build reliability platforms, drive AIOps automation, champion SRE practices, lead incident response and postmortems, advance observability, and mentor engineers to improve system reliability and performance.
Top Skills:
AiopsAWSAzureContinuous ProfilingDatadogDnsElkGCPGoGrafanaHttp/SKubernetesLoad BalancingOpentelemetryPrometheusPythonTcp/Ip
Reposted 13 Days AgoSaved
Easy Apply
Easy Apply
Marketing Tech • Mobile • Software
Lead the Site Reliability Engineering team, ensuring platform reliability, scalability, and developer support while fostering an inclusive environment and coaching team members.
Top Skills:
EmberGoReact
Reposted 13 Days AgoSaved
Easy Apply
Easy Apply
Software
The Senior SRE Manager will establish an SRE team, implement best practices, manage incidents, and enhance system reliability, scaling operations effectively.
Top Skills:
Cloud InfrastructureDistributed SystemsObservability
Reposted 19 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills:
Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
Cloud • Security • Software
As a Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, improve deployment processes, and collaborate across teams.
Top Skills:
Ci/CdDockerGoKubernetes
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Information Technology • Other • Software • Consulting
The Site Reliability Engineer at CardioOne will enhance the reliability and performance of production systems, implement automation, and lead incident response efforts while collaborating with development teams.
Top Skills:
AnsibleAWSAzureDatadogDockerEcsJavaKubernetesPythonTerraformTerragrunt
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills:
AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
Fitness
The Staff Site Reliability Engineer will establish SRE best practices, drive observability strategy, implement software solutions, and mentor engineers. Responsibilities include improving platform resilience, managing risks, and participating in incident response processes.
Top Skills:
AnsibleAWSAzureBashCloudFormationGCPGoKubernetesPulumiPythonTerraform
Cloud • Security • Software
As a Senior Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, ensuring software deployment through automated CI/CD pipelines, while collaborating with teams to enhance service delivery.
Top Skills:
Ci/CdCloud PlatformsDockerGoGoKubernetes
Information Technology • Cryptocurrency
The Site Reliability Engineer will lead technical initiatives, architect solutions, troubleshoot issues, mentor team members, and improve observability practices.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Fintech
As a Site Reliability Engineer, you will enhance system reliability through scalable infrastructure, observability practices, automation, and collaboration with engineering teams.
Top Skills:
AWSDatadogGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonTerraform
3D Printing • Artificial Intelligence • Software • Design
The role involves building reliable platforms for 3D/4D content delivery to AR/VR devices, monitoring system health, and improving operational practices in collaboration with the engineering team.
Top Skills:
Aws FargateCoreweaveGrafanaKubernetesPrometheusTerraform
Reposted 17 Days AgoSaved
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills:
AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Reposted 18 Days AgoSaved
Easy Apply
Easy Apply
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills:
AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
Cloud • Software
The Site Reliability Engineer (SRE) will manage reliable, scalable systems, focusing on software development, infrastructure automation, and incident response. Responsibilities include monitoring, CI/CD pipeline management, security compliance, and cost optimization while collaborating with various teams.
Top Skills:
AWSAzureDockerElk StackGCPGitGrafanaJavaKubernetesPHPPrometheusPythonShellTerraform
Blockchain • Fintech • Social Media • Cryptocurrency • NFT • Web3
Design, build, and operate scalable, highly available infrastructure and platform software for Zora's blockchain services (indexer, APIs, data pipelines). Automate workflows, maintain core systems, improve developer experience, participate in on-call rotation, and contribute strategic technical direction.
Top Skills:
AsyncioBaseBridgesCephCloudflare Pages FunctionsDatadogDockerEthereumGoIpfsKubernetesMongoDBOpentelemetryOptimismOptimistic RollupsPlasmaPolygonPostgresPythonRpc NodesSidechainsVercelZk-Rollups
Security • Software • Analytics
Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.
Top Skills:
Aws,Docker,Kubernetes,Amazon Eks,Terraform,Pulumi,Linux,Github Actions,Gitlab,Circleci,Llms,Golang,Monitoring And Observability Tools
Logistics • Software • Transportation
Design and maintain infrastructure and software architecture, focusing on automation, observability, security, and developer productivity. Troubleshoot issues and optimize databases.
Top Skills:
GoInfrastructure As CodeJavaScriptKubernetesLinuxPythonShell Script
Top Chicago, IL Companies Hiring Remote Senior Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results









.png)






















