Voltage Park Logo

Voltage Park

Infrastructure Operations Engineer

Posted 3 Hours Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The Infrastructure Operations Engineer at Voltage Park will design and implement infrastructure solutions, ensure system stability, support AI workloads, and collaborate with various teams.
The summary above was generated by AI
Voltage Park is your enterprise AI factory. We offer scalable compute power, on-demand and reserved bare metal AI infrastructure using NVIDIA GPUs, with world-class service, performance and value. Founded with the mission of making accessible AI computing for all - our flexible, affordable GPU solutions power everyone from builders to enterprises.
We are seeking a highly skilled and proactive Infrastructure Operations Engineer to be part of our 24/7 Infrastructure Operations team responsible for the stability, scalability, and performance of compute, storage, and platform infrastructure. This role plays a key part in delivering always-on, high-performance environments that support AI/ML training, inference, and HPC workloads at scale. The ideal candidate combines technical depth with strong interpersonal skills and a passion for operational excellence.
This position offers full remote flexibility, although candidates must be based in the continental US and available to work during PST hours. Unfortunately, we are unable to provide sponsorship for this role.
Responsibilities
- At the direction of the Manager of Infrastructure Operations, design, build, and roll out new platforms and patterns to minimize incidents and enable customer facing and internal features.
- Deploy updates and improvements to support both Voltage Park's internal and end customer use cases.
- Collaborate with colleagues in Infrastructure Engineering, Network Operations, Customer Success and Software and Platform Development Teams.
- Participate in the on-call rotation which is evenly distributed across all team members in a primary / secondary pattern where you are primary then move to a secondary position.
Qualifications
- 8+ years working with Linux as a server / hosting platform, extra points for Ubuntu experience.
- 5+ years experience with AWS.
- 2+ years experience with Kubernetes and strong container fundamentals.
- 2+ years experience with Terraform and Ansible
- 2+ years with network attached storage management (via NFS, ceph, or other protocols). Extra points for experience with VAST storage systems.
- Experience working in a Slack-first, asynchronous remote work environment.
- Experience with monitoring systems (Prometheus, ELK stack).
- Familiarity with the gitops workflow.
- Software development experience using Python, Go, bash, or other languages for the purposes of automation & connecting systems & APIs together.
- Deep networking fundamentals, extra points for experience with datacenter level networks, 400Gb ethernet, and Infiniband.
- Experience building and delivering complex systems.
- Effective at navigating tradeoffs between design, risk, cost, and outcomes.
- Comfortable with navigating ambiguity.
- Strong written and oral communication.
Ideal Experiences
- Experience with bare metal hardware troubleshooting and provisioning, extra points for working with Dell hardware.
- Experience with GPU servers, both in bare metal form or under virtualization.
- Deep experience with network switches, routers, and firewalls, particularly SONiC switches, Palo Alto firewalls and Juniper Networks as vendors.
- Experience with VAST storage systems
Culture
- You enjoy working with a small group of friendly, highly motivated, execution focused colleagues.
- You're comfortable with a high degree of autonomy. We expect you to independently prioritize your work and understand how it maps to the overall needs and goals of the company.
- You're knowledgeable in your domain but also enjoy wearing multiple hats and venturing outside of your comfort zone when the need arises.
- You value the ability to write well and understand the importance of good documentation.
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.

Top Skills

Ansible
AWS
Bash
Ceph
Elk Stack
Go
Kubernetes
Linux
Nfs
Prometheus
Python
Terraform

Similar Jobs at Voltage Park

3 Hours Ago
In-Office or Remote
San Francisco, CA, USA
8-8 Annually
Expert/Leader
8-8 Annually
Expert/Leader
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
Design and operate observability platforms for metrics, logs, and alerts. Collaborate on infrastructure projects and enhance operational transparency.
Top Skills: BashElkGoGrafanaKafkaOtelPrometheusPromtailPythonVictoriametrics
3 Hours Ago
In-Office or Remote
Redmond, WA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
Design and develop automation tools, APIs, and systems for managing infrastructure, collaborating on architecture and lifecycle management of resources at scale.
Top Skills: Bare-Metal ProvisioningContainerizationHpc InfrastructureLinuxOrchestrationPython
3 Hours Ago
In-Office or Remote
San Francisco, CA, USA
Mid level
Mid level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
The Technical Account Manager will manage customer relationships, ensure satisfaction, and optimize use of GPU cloud infrastructure for various workflows.
Top Skills: AICloud InfrastructureGpuMachine Learning

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account