Site Reliability Engineer

We are exclusively representing a global investment firm that stands at the intersection of advanced mathematics, cutting-edge technology, and global finance. They operate a world-class, high-performance technology stack (including C++, Python, KDB+, and FPGA) to identify and execute on systematic opportunities. They are now seeking to add a strategic Site Reliability Engineer to a team that is fundamental to their research and trading prowess. The Role in a Nutshell: As an SRE, you will be a co

Ashford Benjamin Ltd - Hong Kong - Full time

Salary: HK$60k - HK$60k

We are exclusively representing a global investment firm that stands at the intersection of advanced mathematics, cutting-edge technology, and global finance. They operate a world-class, high-performance technology stack (including C++, Python, KDB+, and FPGA) to identify and execute on systematic opportunities. They are now seeking to add a strategic Site Reliability Engineer to a team that is fundamental to their research and trading prowess.

The Role in a Nutshell:

As an SRE, you will be a cornerstone of platform stability and performance. You will employ a software engineering mindset to build inherently reliable, scalable, and efficient systems. This role is not just about maintaining uptime; it's about designing and building the tools and platforms that allow researchers and traders to operate at the peak of their potential.

    • Design, build, and maintain core platform infrastructure and services with a focus on automation, reliability, and scalability.
    • Develop sophisticated software and automation tools (using Python, C++, or C#) to manage complex systems and eliminate manual operational tasks.
    • Create and implement advanced monitoring, logging, and alerting systems to ensure deep observability and proactive incident prevention.
    • Embed SRE principles within development teams, guiding them on scalability, performance, and reliability best practices.
    • Lead response to and post-mortem analysis of incidents, driving improvements that prevent future occurrences.
    • Ensure all systems meet the stringent security and compliance standards required of a leading financial institution.

Your Present Skillset:

    • Proven experience in an SRE or Production Engineering role, with a strong background in software development using Python, C++, or C#.
    • A deep understanding of Linux operating systems, networking, and distributed systems architecture.
    • Hands-on experience with infrastructure-as-code, configuration management, and modern observability tools (e.g., Prometheus, Grafana).
    • A commercial, outcome-oriented mindset. You understand that system reliability is not an end in itself but a critical business driver.
    • The ability to work effectively in a collaborative, flat-structure environment with high-performing peers.
    • Prior experience in the finance or a similarly demanding tech-first industry (e.g., big tech, high-scale SaaS) is highly advantageous.
23348459
Ad