Axiom (axiom.co) Logo

Axiom (axiom.co)

Site Reliability Engineer

Posted 8 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
Design, operate, and automate scalable, secure infrastructure for Axiom Cloud. Define SLOs, plan disaster recovery and capacity, tune performance, improve deployment practices, build reliability tooling, respond to incidents, and promote monitoring and observability across teams.
The summary above was generated by AI
Site Reliability Engineer (SRE)

Global (UTC-3 preferred)

Axiom’s mission is to empower developers to get the best insights into their data, as fast as possible. We are a remote-first and globally distributed team building a cloud native, serverless data analytics platform. Axiom completely changes the way in which developers and organizations think about their data: they can now send unlimited data with cost-effective storage and lightning-fast querying.

As a Site Reliability Engineer at Axiom, you will be pivotal in upholding our promise of superior reliability and performance to our customers. Collaborating with backend engineers and product teams, you will emphasize creating and operating scalable and reliable systems. Axiom's emphasis on SREs revolves around automating, measuring, and continuously improving the reliability and efficiency of our systems.

Your primary responsibilities:

  • Engineer and maintain a robust, secure, and scalable infrastructure for Axiom Cloud.

  • Collaborate with engineering teams to define and refine service level objectives.

  • Contribute to disaster recovery planning, capacity engineering, performance analysis, and system tuning.

  • Foster best practices for code deployments, aiding in the education of the broader development team.

  • Roll out tooling and solutions that improve system reliability and reduce manual toil.

  • Address and remediate service incidents and contribute to postmortems and root cause analyses.

  • Foster a culture of monitoring, alerting, and observability across the organization.

You are an ideal candidate if:

  • You have over two years of experience in a reliability-focused engineering environment.

  • You are passionate about system reliability, latency, performance, and efficiency.

  • You're familiar with AWS tools and technologies.

  • You have hands-on experience with Docker, Kubernetes, and Amazon EKS.

  • You understand infrastructure-as-code tools such as Terraform/Pulumi.

  • You possess strong networking knowledge and are adept with Linux systems.

  • Familiarity with CI platforms like GitHub Actions, GitLab, CircleCI or others.

  • You can efficiently use LLMs.

  • Experience with monitoring, alerting, and observability tools.

Bonus skills and experiences:

  • Proven track record of maintaining production systems at scale.

  • A software engineering background with expertise in Golang.

We provide:
  • Flexibility to work from wherever suits you best. For this role, we are considering individuals based in the timezone range UTC-5 (EST) to UTC +2.

  • Budget to build your home office set-up.

  • Monthly budget to support mental and physical wellness.

  • A focus day each week with no meetings, Slack or Zoom. Uninterrupted time to focus on work.

  • Uncapped vacation to unplug and rejuvenate.

  • Generous and flexible family leave for everyone.

Top Skills

Aws,Docker,Kubernetes,Amazon Eks,Terraform,Pulumi,Linux,Github Actions,Gitlab,Circleci,Llms,Golang,Monitoring And Observability Tools

Similar Jobs

Yesterday
Remote or Hybrid
United States
148K-185K Annually
Senior level
148K-185K Annually
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Lead SRE responsible for architecting and automating fault-tolerant, scalable infrastructure across cloud and on-prem, driving deployment, monitoring, and performance tuning while mentoring engineers to improve reliability and SLAs.
Top Skills: .NetAnsibleAWSAws GreengrassC#ChefDockerElixirGCPGitopsGoJavaKubernetesLinuxNutanixPythonRubyTerraformVsphere
10 Days Ago
In-Office or Remote
Atlanta, GA, USA
120K-175K Annually
Senior level
120K-175K Annually
Senior level
Fintech • Gaming • Mobile • Sports • Esports
Design, implement, and monitor reliable production systems at scale. Lead incident response and post-mortems, debug critical production issues, build observability and monitoring, drive reliability best practices and SLO governance, and mentor/train engineers to improve system scalability, resilience, and security.
Top Skills: AWSAzureCrossplaneDatadogGCPGoGrafanaKubernetesNew RelicPythonRubyTerraform
2 Days Ago
Easy Apply
Remote or Hybrid
San Jose, CA, USA
Easy Apply
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Staff Site Reliability Engineer, you'll oversee Zscaler production data center services, optimize code, and ensure cloud service availability and performance. Collaborate with cross-functional teams to improve processes and resolve escalated issues.
Top Skills: BashDnsFirewallsGrafanaHTTPIcmpLoad BalancingNagiosOsi ModelPrometheusPythonTcp/Ip

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account