GPU Cluster Architect - ISP

1634245
  • Up to €200,000 gross per year
  • Amsterdam, Netherlands
  • Permanent
  • 200000
  • Telecoms
  • IP Networking & Transmission


Are you looking for an exciting new opportunity? 

Join a cloud technology and AI infrastructure provider recognised for delivering high-performance computing and scalable platform solutions. With a focus on innovation, reliability, and technical excellence, the organisation continues to empower businesses through cutting-edge cloud technologies and automation.

Keep cloud infrastructure running at its best. Apply now!


Responsibilities:

  • Architect scalable GPU cluster topologies spanning compute nodes, interconnects (InfiniBand, Ethernet), storage, and control planes
  • Model and analyze AI/ML workloads (LLM training, inference) to drive tradeoffs in latency, bandwidth, GPU density, and performance
  • Collaborate with network architects to design and validate low-latency, high-throughput interconnects (InfiniBand HDR/NDR, RoCEv2) at POD and data center scale
  • Integrate and optimize storage solutions to support training datasets, checkpointing, and high-performance I/O operations
  • Design for reliability, incorporating telemetry, automation, and monitoring to detect and resolve issues early
  • Partner with cross-functional teams including SRE, networking, storage, and data center engineering to operationalize your designs


Skills / Must Have:

  • 5+ years of experience designing GPU or HPC clusters at scale
  • Deep understanding of modern GPU architectures (NVIDIA, AMD)
  • Expertise with HPC interconnects (InfiniBand, RoCE) and low-latency networking
  • Strong background in systems architecture, compute, and hardware reliability
  • Proficiency in scripting and automation (Python, Go)


Bonus If You Have:

  • Experience with AI/ML workload optimization and performance modeling
  • Familiarity with large-scale data center design and cooling/power strategies
  • Exposure to orchestration systems (Kubernetes, Slurm) or telemetry frameworks


Benefits:

  • Bonus scheme 
  • Company shares
  • Flexible remote working


Salary:

  • Up to €200,000 gross per year
Holly Staff Principal Network Consultant BLX

Apply for this role