Senior Storage Engineer (AI Infrastructure) - Hosting

1711279
  • Circa $250,000 base salary
  • San Francisco, California, United States
  • Permanent
  • 250000
  • Artificial Intelligence
  • AI Software


Ready to take the next step in your career?

Join a rapidly scaling AI cloud infrastructure provider building next-generation GPU platforms for large-scale AI training, experimentation, and inference. The company is developing some of the most advanced AI-ready infrastructure environments in the market and is now significantly expanding operations across the United States alongside continued international growth.

The company is looking for a Senior Storage Engineer with experience supporting high-performance AI infrastructure and large-scale GPU environments. This role focuses on designing and optimizing scalable, low-latency storage solutions while working closely with infrastructure, networking, and platform engineering teams across data-intensive AI and machine learning workloads.

Don’t miss out on this exciting opportunity and apply today!


Responsibilities:

  • Design, deploy, and operate large-scale high-performance storage platforms supporting AI and HPC workloads
  • Manage and optimize distributed storage environments for large GPU training and inference clusters
  • Work closely with platform, compute, and networking teams to ensure end-to-end infrastructure performance
  • Troubleshoot storage bottlenecks, latency issues, throughput constraints, and data flow inefficiencies
  • Contribute to storage architecture strategy, scalability planning, and operational best practices
  • Automate storage provisioning, monitoring, and lifecycle management processes
  • Support performance tuning across parallel file systems, object storage, and AI data pipelines
  • Implement observability and capacity planning solutions for petabyte-scale environments


Skills/Must Have:

  • Deep experience in storage engineering within HPC, AI infrastructure, hyperscale, or large-scale data center environments
  • Deep hands-on expertise with VAST Data storage platforms strongly preferred
  • Strong understanding of high-performance distributed storage architectures and parallel file systems
  • Experience supporting GPU-intensive AI/ML workloads and high-throughput data environments
  • Strong Linux systems administration skills
  • Experience troubleshooting performance across storage, networking, and compute layers
  • Familiarity with NFS, RDMA, NVMe-oF, InfiniBand, and modern storage networking concepts
  • Automation and scripting experience using Python, Bash, or Ansible preferred
  • Strong understanding of scalability, resiliency, and data protection in enterprise storage environments


Benefits:

  • Stock options
  • Company bonus
  • Remote working options and allowance


Salary:

  • Circa $250,000 base salary 
Ben Davies Director Global AI Infrastructure

Apply for this role