Graphcore Logo

Graphcore

Staff AI Performance Engineer

Reposted 2 Days Ago
Hybrid
Austin, TX
Mid level
Hybrid
Austin, TX
Mid level
The Staff AI Performance Engineer will optimize performance across ARM-based architectures and distributed systems, analyzing AI workloads and collaborating to enhance system efficiency.
The summary above was generated by AI
About us

Graphcore is one of the world’s leading innovators in Artificial Intelligence compute.
It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.

Job Summary

Graphcore’s AI/ML training and inference infrastructure is rapidly scaling to meet the growing demands of AI workloads across mobile, edge, and datacenter environments. This role focuses on optimizing performance across ARM-based architectures and large-scale distributed systems, ensuring efficiency, scalability, and reliability across the full hardware-software stack.

The Team

The System Engineering Performance team architects and optimizes high-performance infrastructure for large-scale datacenter deployments. The team works across hardware, software, networking, and system architecture to deliver cutting-edge AI solutions and ensure optimal system performance at scale.

Responsibilities and Duties
  • Analyze ML models’ compute and memory requirements using roofline analysis and simulations
  • Collaborate across hardware and software teams to optimize large-scale AI workloads
  • Benchmark, monitor, and troubleshoot system performance across distributed systems
  • Optimize communication stacks including MPI, NCCL, UCX, RDMA, and networking fabrics
  • Profile and optimize AI workloads, focusing on performance bottlenecks
  • Develop high-quality, ARM-compatible code and documentation
Candidate Profile

Essential:

  • BS/MS in Computer Science, Electrical Engineering, or related field
  • Experience with distributed systems and communication libraries (MPI, NCCL, UCX, libfabric)
  • Strong programming skills in C++ and Python
  • Experience profiling and optimizing HPC or AI/ML workloads
  • Familiarity with ML benchmarks such as MLPerf

Desirable:

  • Experience with GPUs or accelerated computing architectures
  • Knowledge of HPC networking and interconnect technologies (InfiniBand, RoCE)
  • Familiarity with ML frameworks such as PyTorch or TensorFlow
  • Understanding of ARM architectures and toolchains
  • Strong debugging, profiling, and performance optimization skills

In addition to a competitive salary, Graphcore offers flexible working and a comprehensive benefits package designed to support your health, wellbeing and financial future. Our benefits include medical, dental and vision coverage, Flexible Spending Accounts (FSAs), Health Savings Accounts (HSAs), disability and life insurance, a 401(k) retirement plan, commuter benefits, wellness services and an Employee Assistance Programme (EAP). We welcome people of different backgrounds and experiences; we're committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Similar Jobs at Graphcore

3 Hours Ago
Hybrid
Expert/Leader
Expert/Leader
Artificial Intelligence • Semiconductor
Lead systems engineer for blade and rack validation of ARM/x86 server compute racks. Drive first-silicon bring-up, firmware integration, lab and debug tool development, test plan ownership, triage and resolution of HW/FW/SW issues, and cross-functional coordination to meet program milestones and improve validation capabilities.
Top Skills: ArmCpuCxlDevice DriversDhcpDnsGpuHbmHyper-VKvmLinuxPciePower ManagementRasRevision Control SystemsSocStorageSystem BiosVMwareWindowsX86
3 Hours Ago
Hybrid
Expert/Leader
Expert/Leader
Artificial Intelligence • Semiconductor
Lead server and blade rack bring-up, install and configure servers, manage inventory via DCIM, run post-silicon validation and debug for CPU/GPU/HBM/IO, develop lab validation tools and scripts, coordinate data center projects and vendors, and drive technical improvements in system validation.
Top Skills: Asset ManagementCisco UcsCopper CablingDcimDell MxDhcpDnsFiber Optic CablingFirmware FlashingGpuHbmHpe SynergyLinuxPythonStorage SystemsTicketing/Change Management SystemsUbuntu
Yesterday
Hybrid
Mid level
Mid level
Artificial Intelligence • Semiconductor
The Electrical Engineer will design hardware systems for AI applications, develop schematics, create PCBs, and collaborate on designs through production.
Top Skills: Analog And Digital Circuit DesignLab EquipmentPcb DesignPower Integrity Analysis ToolsSchematic CaptureServer HardwareSignal Integrity Analysis Tools

What you need to know about the Chicago Tech Scene

With vibrant neighborhoods, great food and more affordable housing than either coast, Chicago might be the most liveable major tech hub. It is the birthplace of modern commodities and futures trading, a national hub for logistics and commerce, and home to the American Medical Association and the American Bar Association. This diverse blend of industry influences has helped Chicago emerge as a major player in verticals like fintech, biotechnology, legal tech, e-commerce and logistics technology. It’s also a major hiring center for tech companies on both coasts.

Key Facts About Chicago Tech

  • Number of Tech Workers: 245,800; 5.2% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: McDonald’s, John Deere, Boeing, Morningstar
  • Key Industries: Artificial intelligence, biotechnology, fintech, software, logistics technology
  • Funding Landscape: $2.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Pritzker Group Venture Capital, Arch Venture Partners, MATH Venture Partners, Jump Capital, Hyde Park Venture Partners
  • Research Centers and Universities: Northwestern University, University of Chicago, University of Illinois Urbana-Champaign, Illinois Institute of Technology, Argonne National Laboratory, Fermi National Accelerator Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account