Lambda Logo

Lambda

AI Infrastructure Deployment Lead

Reposted 14 Days Ago
Be an Early Applicant
In-Office
6 Locations
128K-149K Annually
Senior level
In-Office
6 Locations
128K-149K Annually
Senior level
Lead deployment of AI infrastructure including GPU clusters and networking. Manage technical projects, collaborate with teams, and ensure efficient deployment processes.
The summary above was generated by AI

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU.

If you'd like to build the world's best AI cloud, join us.


*Note: This position requires presence in our Data Center location 5 days per week
As the AI Infrastructure Deployment Lead, you’ll be responsible for planning, coordinating, and executing the deployment of large-scale AI infrastructure across Lambda’s data centers and customer sites. You’ll lead cross-functional technical teams to design resilient network topologies, oversee rack-level integration, and ensure smooth delivery of compute environments optimized for large-scale training workloads.


This role combines hands-on technical expertise with strategic project leadership — ideal for engineers who thrive at the intersection of hardware, networking, and systems design.

What You’ll Do

  • Infrastructure Deployment

    • Lead end-to-end deployment of GPU clusters, storage systems, and networking fabric across Lambda’s data centers.

    • Design and implement data center network topologies optimized for AI and HPC workloads, including high-speed Ethernet and InfiniBand environments.

    • Oversee rack implementation, cabling, and power/cooling validation for optimal efficiency and scalability.

    • Collaborate with supply chain, logistics, and operations teams to ensure smooth delivery and installation timelines.

  • Network Engineering

    • Implement Layer 2/Layer 3 networks, including VLANs, Spine to Leaf architecture, Infiniband interconnect technology.

    • Partner with network architects to ensure redundancy, scalability, and low-latency interconnects for distributed AI workloads.

    • Monitor network health, identify bottlenecks, and implement optimizations to maintain peak performance.

  • Hardware & Systems Management

    • Oversee server hardware troubleshooting, including GPUs, NICs, CPUs, and storage components.

    • Lead root-cause analysis for system issues and drive corrective actions in collaboration with vendors and internal hardware teams.

    • Develop standard operating procedures (SOPs) for hardware validation, deployment, and maintenance.

  • Technical Project Leadership

    • Serve as technical project lead for infrastructure rollouts and cluster expansion projects.

    • Coordinate cross-functional teams — networking, facilities, cloud operations, and hardware engineering — to execute deployments on schedule.

    • Manage project scope, budgets, risk assessments, and post-deployment reviews.

    • Communicate status, challenges, and milestones to leadership with clarity and precision.

  • Documentation & Continuous Improvement

    • Maintain detailed network topology diagrams, deployment runbooks, and hardware inventories.

    • Identify opportunities for process automation and infrastructure standardization across deployments.

    • Contribute to Lambda’s internal knowledge base and mentor junior engineers on data center best practices.

What You’ll Bring

Required:

  • Bachelor’s degree in Computer Engineering, Information Technology, or related field.

  • CCNA (Cisco Certified Network Associate) certification (CCNP or equivalent a plus).

  • PMP (project Management Professional) Certification (PMP or equivalent a plus).

  • 5+ years of experience in data center infrastructure deployment or network operations, preferably in AI, HPC, or cloud environments.

  • Proven ability to lead complex technical projects and manage multidisciplinary teams.

  • Strong understanding of data center network design (Layer 2/3, VLAN, Rack elevations, port mapping, Infiniband technologies.

  • Hands-on expertise in server hardware troubleshooting and rack-level integration.

  • Ability and willingness to travel 50-70% to our data center sites.

Preferred:

  • Experience deploying or managing GPU clusters and distributed training environments.

  • Familiarity with automation and orchestration tools (Ansible, Terraform) and monitoring systems (Prometheus, Grafana).

  • Knowledge of structured cabling, power distribution, and environmental monitoring in data centers.

  • Excellent communication and documentation skills.

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • Founded in 2012, with 500+ employees, and growing fast

  • Our investors notably include TWG Global, US Innovative Technology Fund (USIT), Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, Gradient Ventures, Mercato Partners, SVB, 1517, and Crescent Cove

  • We have research papers accepted at top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG

  • Our values are publicly available: https://lambda.ai/careers

  • We offer generous cash & equity compensation

  • Health, dental, and vision coverage for you and your dependents

  • Wellness and commuter stipends for select roles

  • 401k Plan with 2% company match (USA employees)

  • Flexible paid time off plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Top Skills

Ansible
Ethernet
Gpu Clusters
Grafana
Infiniband
Infiniband Technology
Layer 2
Layer 3 Networks
Networking Fabric
Prometheus
Storage Systems
Terraform
Vlans

Similar Jobs

2 Hours Ago
Hybrid
Garden City, KS, USA
21-31 Hourly
Junior
21-31 Hourly
Junior
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
As Assistant Store Manager I, you'll lead a sales team, manage store performance, set sales goals, provide training, and handle customer disputes. You'll focus on sales strategies, inventory management, and aid in staff development within a retail environment.
2 Hours Ago
Hybrid
Overland Park, KS, USA
16-24 Hourly
Junior
16-24 Hourly
Junior
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
Provide technical customer support to clients via multiple communication methods, resolve issues, document information, and maintain expert knowledge of business processes.
Top Skills: CRMGenesys Pure CloudSalesforce
2 Hours Ago
Hybrid
Kansas, USA
85K-153K Annually
Mid level
85K-153K Annually
Mid level
Automotive • Cloud • Greentech • Information Technology • Other • Software • Cybersecurity
The Account Executive role involves developing sales strategies, managing customer relationships, and driving revenue growth in cloud services. Responsibilities include pipeline development, consultation-based selling, and collaboration with internal teams.
Top Skills: AWSAzureCloud TechnologiesGCPSalesforce

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account