This role involves managing AWS infrastructure, debugging applications, developing internal tools, and automating deployment processes for Metabase Cloud.
Metabase is the easiest way for people to get insights from their data, from tiny startups who get up and running quickly to major corporations with tens of thousands of users. That's why people love us.
We bring data tools with the elegance and simplicity of consumer products to the crufty world of enterprise business intelligence. We provide an opinionated open source starting point for how companies should measure, analyze and share their data, which is used by tens of thousands of companies.
Tens of thousands of companies use Metabase every day to answer questions about their data. While we seek to become the de-facto self-managed open source analytics software for organizations everywhere, many customers want an ability to use Metabase without worrying about the operational details of self-hosting. That’s why we recently launched our Metabase Cloud product. We’re looking for operations engineers to help build out and run our new and quickly growing ‘Metabase Cloud’ hosted product.
You will:
- Own and operate our application stack and AWS infrastructure to orchestrate and manage our hosted customer instances of Metabase
- Debug runtime issues across the different levels of our application stack and hosting stack.
- Develop and build our internal tooling and automation to manage the lifecycle of a hosted Metabase installation, from purchase to deployment, zero-downtime upgrades, and general operational health
- Continuously improve our automated deployments and testing
We're looking for someone who:
- Is thoughtful and careful
- Compulsively automates everything and documents it
- Is able to make solid technical judgements and back them up articulately
- Has at least 5 years of experience building and operating production infrastructure, ideally on public cloud
- Strong Kubernetes and AWS experience
- Strong experience with IaC and Terraform
- Can write high quality and readable code in a modern language (e.g. Python, Go, etc.)
- Experience with modern monitoring stacks (e.g Prometheus/Grafana/Datadog)
Projects you could work on:
- Multi-region hosting
- Automate EKS cluster provisioning
- Extend our CRDs and Operators
- Improve the RDS sharding strategy for our multi-tenant platform
- Unify and improve our CI/CD platforms
- Collaborate with core application developers on changes to improve our application metrics, deployment speeds and CI integration.
- Maintain our SOC2 compliance and security posture
We're a global team (50% outside the US), fully distributed (from Thailand to California), who get things done asynchronously, with plenty of uninterrupted time, supporting each other to do the best work of our careers. We offer flexibility (define your own schedule and work from wherever you want), autonomy, and an environment that fosters growth, learning, and development. We're relentlessly user-focused and believe in building long-term value, not short-term hacks. And we raised a $30M Series B to take our approach to the next level for years to come.
For U.S. applicants: Metabase participates in the federal E-Verify program, which confirms employment authorization of newly hired U.S. based employees. E-Verify is not used as a tool to pre-screen candidates and is only initiated upon hire.
E-Verify Participation Notice (English/Spanish)
Right to Work Notice (English/Spanish)
Top Skills
AWS
Datadog
Go
Grafana
Kubernetes
Prometheus
Python
Terraform
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Senior Staff Machine Learning Engineer will enhance AI workloads through platform design, ensure GPU cluster reliability, and support AI developers. Responsibilities include coding, collaborating across teams, mentoring, and refining SRE practices.
Top Skills:
AnsibleGitlab CiGoHelmJ2EeJavaKubernetesLinuxPrometheusPythonSplunk
Blockchain • Software
The Senior Engineer, SRE/DevOps will ensure the reliability and security of blockchain infrastructure by automating processes and collaborating with teams.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElasticsearchElk-StackEthereumGCPK8SMySQLOpsgeniePagerdutyPingdomPythonTerraform
Big Data • Fintech • Machine Learning • Real Estate • Database
The role involves building and supporting infrastructure, integrating systems, automating processes, and resolving technical issues with a focus on CI/CD and Infrastructure as Code practices.
Top Skills:
AirflowBashBigQueryCircleCIDatadogGCPGraphQLHasuraHelmKubernetesLinuxLookerMongoDBNode.jsPostgresPythonReactReduxStackdriverTerraform
What you need to know about the Chicago Tech Scene
With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.
Key Facts About Chicago Tech
- Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
- Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
- Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
- Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory