Bespoke Labs Logo

Bespoke Labs

DevOps / Site Reliability Engineer

Sorry, this job was removed at 10:12 p.m. (CST) on Friday, May 15, 2026
Remote
Hiring Remotely in USA
Remote
Hiring Remotely in USA

Similar Jobs

15 Days Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
Lead the development of Launch Potato's cloud infrastructure, establishing SRE practices including on-call rotations and monitoring systems, while ensuring cost efficiency and reliability.
Top Skills: AWSCi/CdEcsGrafanaLambdaOpentelemetryPagerdutyTerraform
15 Days Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
The Lead DevOps/SRE Engineer will own and evolve cloud infrastructure, build the SRE function, manage CI/CD platforms, and ensure compliance while enhancing infrastructure reliability and cost control.
Top Skills: AWSCi/CdGrafanaOpentelemetryPagerdutyTerraform
15 Days Ago
Remote
United States
Senior level
Senior level
AdTech • Big Data • Consumer Web • Digital Media • Marketing Tech
The Lead Engineer, DevOps & SRE will oversee the cloud infrastructure, build the SRE function, and manage CI/CD processes to ensure reliable operations and compliance.
Top Skills: AWSCi/CdEcsGrafanaLambdaOpentelemetryPagerdutyTerraform

About Bespoke Labs

Bespoke Labs is an AI research and data company building the datasets, benchmarks, and evaluation infrastructure that power frontier AI models. We're backed by leading investors, trusted by top AI labs, and have research accepted at venues like ICLR 2026. Our team is small, moves fast, and has an outsized impact on how the next generation of AI is built.

The Role

We're looking for a mid-level DevOps / Site Reliability Engineer to own and scale our cloud infrastructure. You'll work closely with engineering and ML teams to keep our systems reliable, observable, and fast — directly supporting the infrastructure that powers AI data pipelines at scale.

What You'll Do

  • Own cloud infrastructure on AWS — EC2, EKS, RDS, S3, IAM, VPC

  • Manage Kubernetes clusters and container orchestration end-to-end

  • Build and maintain CI/CD pipelines using GitHub Actions or similar

  • Implement monitoring, alerting, and observability stacks (Prometheus, Grafana, or DataDog)

  • Improve reliability, performance, and security of production systems

  • Automate infrastructure with Terraform or similar IaC tools

  • Debug and resolve issues across complex, distributed systems

  • Participate in design reviews and help raise the infrastructure bar

What We're Looking For

  • 3–5 years in DevOps, SRE, or infrastructure engineering

  • Strong AWS experience — EKS, EC2, RDS, S3, IAM

  • Kubernetes — deployment, scaling, troubleshooting in production

  • CI/CD pipelines — GitHub Actions, ArgoCD, or similar

  • Infrastructure as Code — Terraform, Pulumi, or CDK

  • Python or Go scripting

  • Experience working in production environments with real users

  • Comfort with ambiguity and ability to operate autonomously

Nice to Have

  • Experience supporting ML training workloads or GPU clusters

  • Familiarity with distributed computing or large-scale data pipelines

  • Prior work at an AI, ML, or data company

  • Open-source contributions or published technical writing

What We Offer

  • Competitive compensation and meaningful equity

  • Direct impact on frontier AI model training and evaluation infrastructure

  • Flexible, remote-friendly environment with low bureaucracy

  • A small, high-caliber team with deep AI research expertise

  • Health, wellness, and learning & development benefits

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account