Systems Engineer - High Performance Computing
Hong Kong (Hybrid)
HK$1.5M+ total compensation (base + performance bonus)
We are partnering with a leading global quantitative hedge fund to hire an HPC Systems Engineer to support and scale high-performance research and trading infrastructure in Hong Kong.
The firm operates at the forefront of systematic trading, with Asia-Pacific a key growth region. You will support latency-sensitive trading systems and large-scale compute platforms while working closely with global engineering teams.
This is a high-impact role for an engineer who thrives in performance-critical environments and enjoys solving complex systems challenges at scale.
The Role
You will be responsible for designing, operating, and optimising large-scale HPC compute and storage infrastructure across both on-premise and cloud environments, supporting research and trading teams across APAC.
Key Responsibilities:
- Design, support, and operate HPC compute and storage platforms
- Maintain and enhance large-scale Linux systems across compute, storage, networking, automation, and monitoring
- Troubleshoot complex cross-layer issues in collaboration with global infrastructure teams
- Manage and optimise batch and containerised workloads across CPU and GPU environments
- Develop and operate cloud infrastructure (AWS, Azure or GCP)
- Build and maintain HPC management tooling, access control modules, and internal libraries
- Develop observability pipelines and performance metrics to improve cluster utilisation and efficiency
- Manage deployments, upgrades, fixes, and infrastructure lifecycle processes
- Drive automation to improve reliability, scalability, and user experience
Requirements
- Bachelors degree or higher in Computer Science, Engineering, or a related field
- 1–5 years experience in Linux systems, DevOps, HPC, or infrastructure engineering
- Strong understanding of Linux internals (process scheduling, virtual memory, filesystems, networking)
- Experience with distributed/networked storage systems (NFS, Weka, object storage, etc.)
- Familiarity with Infrastructure-as-Code tools
- Strong scripting/programming skills (Python, Go, or Bash)
- Experience with container technologies (Docker/Podman, Kubernetes)
- Solid understanding of networking fundamentals (TCP/IP, Ethernet)
Nice to Have
- Experience managing GPU-based compute platforms
- Exposure to CI/CD systems and release automation
- Experience operating hybrid on-prem + cloud HPC environments
- Interest in performance tuning and systems optimisation in trading environments