GPU Cluster Architect - ISP
1634245
Posted: 29/10/2025
- Up to €200,000 gross per year
- Amsterdam, Netherlands
- Permanent
- 200000
- Telecoms
- IP Networking & Transmission
Are you looking for an exciting new opportunity?
Join a cloud technology and AI infrastructure provider recognised for delivering high-performance computing and scalable platform solutions. With a focus on innovation, reliability, and technical excellence, the organisation continues to empower businesses through cutting-edge cloud technologies and automation.
Keep cloud infrastructure running at its best. Apply now!
Responsibilities:
- Architect scalable GPU cluster topologies spanning compute nodes, interconnects (InfiniBand, Ethernet), storage, and control planes
- Model and analyze AI/ML workloads (LLM training, inference) to drive tradeoffs in latency, bandwidth, GPU density, and performance
- Collaborate with network architects to design and validate low-latency, high-throughput interconnects (InfiniBand HDR/NDR, RoCEv2) at POD and data center scale
- Integrate and optimize storage solutions to support training datasets, checkpointing, and high-performance I/O operations
- Design for reliability, incorporating telemetry, automation, and monitoring to detect and resolve issues early
- Partner with cross-functional teams including SRE, networking, storage, and data center engineering to operationalize your designs
Skills / Must Have:
- 5+ years of experience designing GPU or HPC clusters at scale
- Deep understanding of modern GPU architectures (NVIDIA, AMD)
- Expertise with HPC interconnects (InfiniBand, RoCE) and low-latency networking
- Strong background in systems architecture, compute, and hardware reliability
- Proficiency in scripting and automation (Python, Go)
Bonus If You Have:
- Experience with AI/ML workload optimization and performance modeling
- Familiarity with large-scale data center design and cooling/power strategies
- Exposure to orchestration systems (Kubernetes, Slurm) or telemetry frameworks
Benefits:
- Bonus scheme
- Company shares
- Flexible remote working
Salary:
- Up to €200,000 gross per year
Holly Staff
Principal Network Consultant BLX