Datacenter Engineer (AI Infrastructure) - Hosting

1674714
  • S$160,000 – S$220,000+ (Based on experience and specialised hardware certifications)
  • Adam Park, Singapore
  • Permanent
  • 150000
  • 200000
  • Artificial Intelligence
  • AI Data Center


Looking to take a leadership role in AI and high-performance computing while working on next-generation GPU infrastructure?

Join a technology team delivering scalable GPU computing solutions and global infrastructure for AI and compute-intensive workloads. The organization is seeking a hands-on technical leader to spearhead the physical deployment of AI Factories, overseeing thousands of H100 and B200 GPUs across high-performance clusters. As a Founders Fund-backed NVIDIA cloud partner, the team powers foundation model training and enterprise AI, providing team members with rare exposure to both cutting-edge hardware and complex infrastructure challenges. This role operates at the intersection of silicon, networking, and cooling, offering opportunities to deploy, manage, and optimize high-performance systems while collaborating with experienced engineers in a fast-paced, innovative environment.

Apply now to drive impactful AI infrastructure projects and advance a career at the forefront of AI computing.


Responsibilities:

  • Lead GPU Deployments: Oversee the end-to-end installation of large-scale GPU clusters, ensuring every server, switch, and cable is configured for maximum performance and reliability.
  • Optimise Physical Infrastructure: Work directly with datacenter operators to manage power density, thermal loads, and space planning for high-wattage AI hardware.
  • Troubleshoot Hardware at Scale: Partner with engineering to solve "hard" physical problems—from firmware bugs and faulty optics to complex network degradation within the cluster.
  • Own Vendor & Supply Chain: Act as the technical lead for hardware deliveries, managing quality control, onsite technicians, and vendor accountability to ensure zero delays in capacity expansion.
  • Standardise Operational Discipline: Create the playbooks, labelling standards, and documentation that ensure our global datacenter footprint runs like a well-oiled machine.
  • Bridge the Physical & Digital: Coordinate with our software teams to ensure that physical deployments integrate seamlessly with our orchestration layers and monitoring tools.


Skills/Must-have:

  • Proven Infrastructure Experience: 4+ years in datacenter engineering or hardware operations, specifically working with high-density compute environments.
  • GPU & High-Perf Compute Knowledge: Deep familiarity with NVIDIA HGX/DGX platforms and the unique power and cooling requirements of modern GPUs.
  • Networking Expertise: Hands-on experience with high-speed networking, including InfiniBand, RoCE, and leaf-spine architectures.
  • Hardware Craftsmanship: A "no compromises" approach to cable management, labelling, and physical organisation; you understand that messy racks lead to messy outages.
  • Technical Communication: Ability to coordinate effectively with onsite datacenter staff, remote engineering teams, and high-level stakeholders.
  • Bias for Action: You thrive in the "racking" phase—you see a deployment bottleneck and take immediate ownership to clear it.


Benefits:

  • 10% bonus
  • Stock options


Salary:

  • S$160,000 – S$220,000+ (Based on experience and specialised hardware certifications)
Ben Davies Director Global AI Infrastructure

Apply for this role