NVIDIA Logo

NVIDIA

Senior Solutions Architect - AI Infrastructure

Posted 3 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in CA, USA
184K-357K Annually
Senior level
In-Office or Remote
Hiring Remotely in CA, USA
184K-357K Annually
Senior level
Lead GPU and NVLink-based cluster design and validation for large-scale AI and HPC deployments. Advise cloud partners on architectures, perform performance modeling, debug deployment issues, support NPI rollouts, and relay field feedback to engineering.
The summary above was generated by AI

NVIDIA is building the world’s most groundbreaking and innovative accelerated computing platforms for AI and HPC.  Because of our work, scientists, researchers, and engineers can push the boundaries of what’s possible.  We pioneered a supercharged form of computing that powers everything from breakthrough AI research to the world’s fastest supercomputers.

We are seeking a highly motivated Senior Solutions Architect to join the NVIDIA Cloud Partners team with a focus on GPU, NVLink, and infrastructure design. In this role, you will be at the forefront of assisting with designs and architectures for some for the largest next-generation GPU-based clusters enabling the world’s most advanced AI supercomputers and enterprise AI infrastructure in the field. As a Solutions Architect, you will serve as a key technical expert bridging NVIDIA’s ground breaking GPU and NVLink technology designs as well as all of our software solutions directly between engineering and field teams supporting customers with the most demanding requirements.  You will work on end-to-end cluster design and architecture, performance modeling, validation, and NPI cluster deployments.  Your expertise will directly influence how the world’s leading AI companies, cloud providers, hyperscalers, research institutions, and enterprises build their infrastructure.

What you’ll be doing:

  • Partner with NVIDIA Cloud Partners in GPU cluster design and networking and convey architecture and optimal process information for building next-generation architectures.

  • Guide NVIDIA Cloud Partners in cluster design, weighing design principles but also complex, situational limitations to make the most performant and supportable GPU clusters possible.

  • Work closely with NVIDIA Cloud Partners to ensure successful first deployments with new products, including new network architectures and topologies.

  • Feedback customer/field perspectives on cluster design and workflows back to engineering teams designing internal clusters.

  • Perform hands-on work to assist NVIDIA Cloud Partners debugging issues relating to cluster design, configuration, and performance employing internal engineering expertise and known bugs.

  • Support NPI customer deployments with new GPU/Networking architectures.

What we need to see:

  • BS, MS, or PhD in Computer Science, Electrical Engineering, Computer Engineering, Physics, or related field (or equivalent experience).

  • 8+ years of experience in cluster design, validation, and issue resolution, specifically on GPU and HPC clusters.

  • Proven expertise in designing large-scale distributed systems, AI clusters, or HPC infrastructure.

  • Ability to translate sophisticated engineering concepts into customer-ready documentation, diagrams, and reference material.

  • Expertise in driving customer/partner issues to a close with product and engineering teams.

  • Ability to handle multi-functional communications across customer, product team, support team, engineering team, etc.

Ways to stand out from the crowd:

  • Experience leading large-scale AI Factory or HPC cluster bring-ups or builds.

  • Hands-on experience with NVIDIA products including, but not limited to, GPUs, NVLink, NVIDIA Networking, etc.; specifically debugging issues that occur during deployment on NVLink, etc.

  • Knowledge of NCCL, MPI, IMEX, NMX, and collectives in distributed training as it pertains to cluster designs.

  • External customer facing skill-set and background.

  • Effective time management and capability to balance multiple tasks and customers while thinking creatively to debug and solve problems.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 6, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

8 Days Ago
In-Office or Remote
2 Locations
184K-357K Annually
Senior level
184K-357K Annually
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Drive deployment of NVIDIA GPU and networking solutions at customer data centers: advise on network/compute/storage design, perform bring-up visits, debug performance, build POCs/demos, and liaise with product, engineering, and sales teams.
Top Skills: ArmCC++ContainersCudaDockerEthernetInfinibandKernel DriversKubernetesLinuxLinux KernelNicsNvidia Gpu SystemsNvidia SdksRoceVirtualization
21 Hours Ago
Easy Apply
Remote
United States
Easy Apply
31-35 Hourly
Mid level
31-35 Hourly
Mid level
Healthtech • Software
Perform evidence-based utilization management reviews, prepare compliant member and provider correspondence, consult with Medical Directors, document clinical determinations, ensure NCQA/CMS regulatory compliance, meet productivity and turnaround targets, support verbal notifications, and drive process improvements.
Top Skills: Google SuitemacOSZoom
21 Hours Ago
Easy Apply
Remote
United States
Easy Apply
260K-280K Annually
Senior level
260K-280K Annually
Senior level
Healthtech • Software
Provide evidence-based medical utilization reviews for orthopedic spine cases, document decisions in Cohere workflows, conduct peer-to-peer provider discussions, meet turnaround and quality standards, and support clinical and operational improvement projects.

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account