Your new company We are supporting a next-generation supercomputing data centre built for large-scale AI workloads. The infrastructure is live, and we are now hiring an AI Engineer to help customers onboard, scale, and optimize their AI workloads on high-performance GPU clusters.
Your new role This role focuses on applied AI, distributed training, and performance optimization -turning powerful infrastructure into real AI outcomes.
- Support customer onboarding and deployment of AI workloads on large GPU clusters
- Optimize AI models for multi-GPU and distributed training
- Diagnose and resolve GPU performance, memory, and scaling issues
- Work with AI frameworks such as PyTorch / TensorFlow
- Collaborate with platform, HPC, and customer AI teams
- Share best practices and provide hands-on technical guidance
What you'll need to succeed - 3-5 years experience in AI / machine learning engineering
- Strong hands-on experience with GPU-based model training
- Proficiency in Python and Linux environments
- Experience with at least one major AI framework (PyTorch preferred)
- Solid understanding of AI training workflows and performance tuning
Nice to have - Distributed or multi-GPU training experience
- Exposure to HPC, CUDA, NCCL, or GPU optimization
- Docker / container or cluster scheduling experience
If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.
If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion on your career.