Senior Site Reliability Engineer - Neocloud Provider

1650331
  • Up to €130,000
  • Remote (EU)
  • Permanent
  • Artificial Intelligence
  • AI Network


Do you want to join a leading next-generation AI cloud provider as a Senior Site Reliability Engineer? 

You will be joining a Neocloud that is building one of the most advanced GPU and high-performance computing platforms in Europe. 

The role offers the chance to help design and maintain the reliability, scale and performance of a growing cloud platform with real engineering challenges. 

 You will collaborate with highly skilled teams across software, hardware, networking & AI infrastructure, with the autonomy to influence technical direction and build systems that support large-scale compute workloads. 

If you are interested in this opportunity and want to learn more, get in touch today. 

Responsibilities:

  • Architect and maintain reliable, fault-tolerant, large-scale distributed systems for high-performance GPU and compute workloads.
  • Build and automate deployment, failover, monitoring, capacity planning, and incident-response workflows. 
  • Develop, optimise, and maintain CI/CD pipelines to enable safe, rapid, and repeatable software delivery. 
  • Drive incident response and root-cause analysis while improving system observability, performance, and long-term stability.
  • Partner with backend, hardware, and networking teams to optimise service performance, support regional expansion, scale compute clusters, and participate in on-call rotations.

Required Skills & Experience:

  • Strong Linux debugging expertise, including network and system-call tracing.
  • Proficiency with Terraform and Kubernetes (network policies, scheduling, taints/tolerations). 
  • Experience with Slurm job monitoring and core configuration.
  • Solid Python or Go skills, covering async/error handling, environment management, and common system/HTTP tooling.
  • Ability to automate workflows and troubleshoot distributed systems using CLI tools, logs, and scripting.

 Salary  & Benefits: 

  • Up to €130,000 Gross Per Year
  • Bonus Scheme
  • Company share scheme
Sam Hammersley Network Consultant BLX

Apply for this role