Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Reliability Engineer Jobs in Chicago, IL
Hardware • Machine Learning • Security • Software
The Site Reliability Engineer will manage software deployment for IoT devices, improve observability, maintain dashboards, automate processes, and collaborate on incident responses.
Top Skills:
AnsibleAWSBashC/C++DatadogGrafanaGroovyJavaJavaScriptNoSQLPostgresPrometheusPythonRSigmaSQLTerraform
Blockchain • Software
As a Site Reliability Engineer at Offchain Labs, you will manage infrastructure in cloud environments, design CI/CD workflows, and enhance system reliability with a focus on blockchain technology.
Top Skills:
ArgocdAWSAzureCodebuildGCPGithub ActionsGoGrafanaKubernetesLokiPrometheusPythonTerraform
Real Estate • Financial Services • PropTech
As a Site Reliability Engineer, you will support AWS Cloud products, optimize processes, enhance automation, and ensure system reliability and performance.
Top Skills:
ArgocdAWSAzure DevopsBashCi/CdCloudwatchDockerEksFluxcdGitKubernetesPowershellPythonSQLTerraform
Artificial Intelligence • eCommerce • Retail
Lead the SRE and DevOps team, ensure infrastructure reliability, oversee cloud operations, drive automation, and collaborate cross-functionally.
Top Skills:
AzureBashCi/CdDatadogDockerElk StackGoGrafanaKubernetesPowershellPrometheusPythonTerraform
Fintech • Financial Services
The Senior Site Reliability Engineer will ensure system reliability and performance, implement automated deployment strategies, and collaborate with cross-functional teams to enhance software delivery and operational responses.
Top Skills:
AWSAzureBashGCPMonitoring ToolsObservability ToolsPowershellPython
Software
The Senior Site Reliability Engineer will ensure CoderPad's multi-cloud platform is reliable and scalable, manage infrastructure, and improve monitoring and incident response.
Top Skills:
AWSBashDatadogGCPGitlab CiGoGrafanaHerokuKubernetesLinuxNode.jsPrometheusPythonTerraform
Cloud • Software • Database
Lead and scale a Site Reliability Engineering team, ensuring system reliability and performance across cloud-native databases, while collaborating with multiple teams.
Top Skills:
Automation ToolsCloud-Native TechnologiesInfrastructure As CodeMySQLObservation And Monitoring ToolsPostgres
Other
As a Senior Site Reliability Engineer, you will manage and optimize Juul's hybrid cloud infrastructure, ensuring operational stability and performance through automation and advanced troubleshooting.
Top Skills:
AWSBashCloudFormationGCPKubernetesNutanixPowershellPythonTerraform
Cloud • Security • Software
As a Staff Site Reliability Engineer, you will design and deliver solutions for cloud-based services, establish automated CI/CD pipelines, and support infrastructure with a focus on security and resiliency while participating in an on-call rotation.
Top Skills:
Ci/CdCloud PlatformsDockerGitGoKubernetes
Computer Vision • Information Technology • Machine Learning • Natural Language Processing • Real Estate • Software
The SRE will maintain infrastructure for SaaS products on AWS, support developers, manage platform components, and handle IT tasks.
Top Skills:
AWSComputer VisionIacLarge Language ModelsNlpTerraform
Logistics • Software • Transportation
Lead DevOps, SRE, and Database teams to build scalable Azure Cloud infrastructure, implement CI/CD pipelines, and drive automation and security practices.
Top Skills:
Ai-Driven ToolingAzure CloudAzure DevopsAzure MonitorCi/CdCosmosdbDockerElkGithub CopilotGrafanaKubernetesMySQLPostgresPrometheusRedisSQL ServerTerraform
Artificial Intelligence • Information Technology • Consulting
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of our inference platform, leveraging Kubernetes and Terraform while ensuring smooth scalability of systems under load.
Top Skills:
BashGrafanaKubernetesMlopsPrometheusPythonRayTerraformTritonVllm
New
Track Smarter, Apply Better.
Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.
Use For Free
Digital Media • Social Media • Software • Sports
Lead the technical architecture and execution of migration to AWS, drive developer enablement, and automate infrastructure using code-first principles.
Top Skills:
Aws EksDatadogGithub ActionsGoIstioK6KubernetesNode.jsTerraform
Fintech • Mobile • Software
The Staff Site Reliability Engineer will design and manage AWS infrastructure, optimize Kubernetes operations, automate workflows, and troubleshoot systems for improved reliability and performance.
Top Skills:
AWSCi/CdDatadogDockerEksGithub ActionsGoKafkaKubernetesNginxPrivatelinkPythonTerraformTransit GatewayVpc
Legal Tech
Join the Engineering team to enhance cloud solutions, improve service reliability, automate tasks, and support software delivery and compliance initiatives.
Top Skills:
AnsibleAWSCircleCIDockerGoHerokuJenkinsKubernetesLogentriesNew RelicPostgresPythonRuby on RailsRedisRubyTerraformTwilio Sendgrid
Reposted 11 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Information Technology
The Senior Site Reliability Engineer will ensure high availability of Vultr's control plane and infrastructure, focusing on reliability, automation, and observability for distributed systems.
Top Skills:
BgpGitlab Ci/CdGrafanaKvmLibvirtMySQLOpen VswitchPHPPuppetQemuSentrySumologic
Fintech
As a Site Reliability Engineer 2, you will enhance the reliability of the Brokerage-as-a-Service platform, manage technical challenges, and lead automation efforts while participating in on-call rotations for incident resolution.
Top Skills:
Apache AirflowAWSCloudFormationConfluent CloudKubernetesOpenshiftPythonSQLTerraform
News + Entertainment
As an Ads Reliability Engineer, you will ensure the reliability and scalability of Netflix's Ad Suite by designing infrastructure, automating operations, and collaborating across teams to maintain system health and performance.
Top Skills:
AWSAzureGCPGoJavaKubernetesPythonTerraform
Edtech
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.
Top Skills:
ActivejobAnsibleAWSAws CloudwatchEc2EcsElasticsearchGitGCPGoogle Cloud StackdriverJenkinsJIRAKubernetesMemcachedMongoDBNew RelicNode.jsPostgresRedisRuby On RailsSidekiqSpinnakerTerraformTerragrunt
Software
As a Site Reliability Engineer, you'll enhance system reliability, collaborate on production readiness, define SLIs/SLOs, and improve incident response.
Top Skills:
AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Blockchain • Cryptocurrency • NFT
The Staff Software Engineer will manage security, reliability, and observability for an e-commerce platform, implementing monitoring systems and conducting security audits.
Top Skills:
BlockchainExternal Service IntegrationsGoGoogle Cloud PlatformGoogle Cloud RunPostgresRedisTerraform
Artificial Intelligence • Cloud • Software
The Senior SRE Engineer will design, build, and maintain resilient infrastructure systems, manage infrastructure-as-code, and write tooling in various languages.
Top Skills:
CGoJavaScriptKubernetesNixosRustTerraformZig
Artificial Intelligence • Cloud • Information Technology • Software
Build and operate production-grade AI infrastructure using Kubernetes, ensuring high availability, reliability, and performance. Develop custom operators and implement automation for efficient operations and monitoring.
Top Skills:
AnsibleBashElk StackEnterprise Storage SystemsGrafanaHigh-Performance NetworkingKubernetesLinuxNvidia Gpu TechnologiesPrometheusPythonTerraform
Fintech
As a Senior Site Reliability Engineer, you will enhance platform reliability, troubleshoot complex issues, support SRE workflows, and manage incident and change management policies.
Top Skills:
Apache AirflowAWSCloudFormationConfluent CloudJavaKubernetesOpenshiftPythonRest ApisRundeckSQLTerraform
Fintech • Financial Services
The role involves optimizing system reliability and scalability in cloud environments, automating operational excellence, and mentoring SRE teams. Key responsibilities include defining SLOs, managing error budgets, and developing automated solutions for Azure infrastructure.
Top Skills:
ArgocdAtlantisAzureAzure DevopsCi/CdGithub ActionsKubernetesService MeshTerraform
Top Chicago, IL Companies Hiring Remote Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results


































