Senior Network Engineer (AI Infrastructure) - Hosting

1711278
  • Circa $225,000 base salary
  • San Francisco, California, United States
  • Permanent
  • 250000
  • Artificial Intelligence
  • AI Network


Are you looking for an exciting new opportunity? 

Join a rapidly scaling AI cloud infrastructure provider building next-generation GPU platforms designed for large-scale AI training, experimentation, and inference workloads. The company is operating cutting-edge GPU infrastructure across Europe and is now aggressively expanding into the United States, with significant investment in high-performance networking and AI-ready data center architecture.

The company is looking for a Senior Network Engineer to design, deploy, and support ultra-low-latency network infrastructure for large-scale GPU and AI environments. The role involves working with technologies such as NVIDIA Spectrum and Cumulus Linux to build scalable 400G network fabrics optimised for AI and HPC workloads within a rapidly growing global infrastructure environment.

If you would like to learn more about this opportunity, feel free to reach out and apply today!


Responsibilities:

  • Design, deploy, and operate large-scale 400G network fabrics supporting AI and HPC workloads
  • Build and maintain high-performance data center networking environments using NVIDIA Spectrum switches and Cumulus Linux
  • Optimize network performance, latency, throughput, and resilience across GPU clusters
  • Troubleshoot complex Layer 2/Layer 3 networking issues in distributed HPC environments
  • Automate network provisioning, configuration management, and operational workflows
  • Collaborate with infrastructure, platform, and ML engineering teams to ensure efficient GPU cluster communication
  • Support network observability, telemetry, and capacity planning initiatives
  • Contribute to network architecture strategy and scalability planning for future deployments


Skills/Must Have:

  • Deep experience in network engineering within data center, HPC, cloud, or large-scale infrastructure environments
  • Strong hands-on experience with NVIDIA Spectrum switching platforms and Cumulus Linux
  • Experience designing and operating 100G/400G Ethernet fabrics
  • Deep understanding of modern data center networking protocols (BGP, EVPN, VXLAN, MLAG, ECMP)
  • Strong Linux systems knowledge and network automation skills
  • Experience with automation tools and scripting (Python, Ansible, Bash, Terraform preferred)
  • Familiarity with AI/HPC traffic patterns and GPU cluster networking requirements
  • Strong troubleshooting and performance optimization capabilities in high-throughput environments


Benefits:

  • Stock options 
  • Company bonus
  • Remote working options and allowance 


Salary:

  • Circa $225,000 base salary 
Ben Davies Director Global AI Infrastructure

Apply for this role