Datacenter Engineer (AI Infrastructure) - Hosting
- S$160,000 – S$220,000+ (Based on experience and specialised hardware certifications)
- Adam Park, Singapore
- Permanent
- 150000
- 200000
- Artificial Intelligence
- AI Data Center
Looking to take a leadership role in AI and high-performance computing while working on next-generation GPU infrastructure?
Join a technology team delivering scalable GPU computing solutions and global infrastructure for AI and compute-intensive workloads. The organization is seeking a hands-on technical leader to spearhead the physical deployment of AI Factories, overseeing thousands of H100 and B200 GPUs across high-performance clusters. As a Founders Fund-backed NVIDIA cloud partner, the team powers foundation model training and enterprise AI, providing team members with rare exposure to both cutting-edge hardware and complex infrastructure challenges. This role operates at the intersection of silicon, networking, and cooling, offering opportunities to deploy, manage, and optimize high-performance systems while collaborating with experienced engineers in a fast-paced, innovative environment.
Apply now to drive impactful AI infrastructure projects and advance a career at the forefront of AI computing.
Responsibilities:
- Lead GPU Deployments: Oversee the end-to-end installation of large-scale GPU clusters, ensuring every server, switch, and cable is configured for maximum performance and reliability.
- Optimise Physical Infrastructure: Work directly with datacenter operators to manage power density, thermal loads, and space planning for high-wattage AI hardware.
- Troubleshoot Hardware at Scale: Partner with engineering to solve "hard" physical problems—from firmware bugs and faulty optics to complex network degradation within the cluster.
- Own Vendor & Supply Chain: Act as the technical lead for hardware deliveries, managing quality control, onsite technicians, and vendor accountability to ensure zero delays in capacity expansion.
- Standardise Operational Discipline: Create the playbooks, labelling standards, and documentation that ensure our global datacenter footprint runs like a well-oiled machine.
- Bridge the Physical & Digital: Coordinate with our software teams to ensure that physical deployments integrate seamlessly with our orchestration layers and monitoring tools.
Skills/Must-have:
- Proven Infrastructure Experience: 4+ years in datacenter engineering or hardware operations, specifically working with high-density compute environments.
- GPU & High-Perf Compute Knowledge: Deep familiarity with NVIDIA HGX/DGX platforms and the unique power and cooling requirements of modern GPUs.
- Networking Expertise: Hands-on experience with high-speed networking, including InfiniBand, RoCE, and leaf-spine architectures.
- Hardware Craftsmanship: A "no compromises" approach to cable management, labelling, and physical organisation; you understand that messy racks lead to messy outages.
- Technical Communication: Ability to coordinate effectively with onsite datacenter staff, remote engineering teams, and high-level stakeholders.
- Bias for Action: You thrive in the "racking" phase—you see a deployment bottleneck and take immediate ownership to clear it.
Benefits:
- 10% bonus
- Stock options
Salary:
- S$160,000 – S$220,000+ (Based on experience and specialised hardware certifications)