We are exclusively representing a global investment firm that stands at the intersection of advanced mathematics, cutting-edge technology, and global finance. They operate a world-class, high-performance technology stack (including C++, Python, KDB+, and FPGA) to identify and execute on systematic opportunities. They are now seeking to add a strategic Site Reliability Engineer to a team that is fundamental to their research and trading prowess.
The Role in a Nutshell:
As an SRE, you will be a cornerstone of platform stability and performance. You will employ a software engineering mindset to build inherently reliable, scalable, and efficient systems. This role is not just about maintaining uptime; it's about designing and building the tools and platforms that allow researchers and traders to operate at the peak of their potential.
-
- Design, build, and maintain core platform infrastructure and services with a focus on automation, reliability, and scalability.
- Develop sophisticated software and automation tools (using Python, C++, or C#) to manage complex systems and eliminate manual operational tasks.
- Create and implement advanced monitoring, logging, and alerting systems to ensure deep observability and proactive incident prevention.
- Embed SRE principles within development teams, guiding them on scalability, performance, and reliability best practices.
- Lead response to and post-mortem analysis of incidents, driving improvements that prevent future occurrences.
- Ensure all systems meet the stringent security and compliance standards required of a leading financial institution.
Your Present Skillset:
-
- Proven experience in an SRE or Production Engineering role, with a strong background in software development using Python, C++, or C#.
- A deep understanding of Linux operating systems, networking, and distributed systems architecture.
- Hands-on experience with infrastructure-as-code, configuration management, and modern observability tools (e.g., Prometheus, Grafana).
- A commercial, outcome-oriented mindset. You understand that system reliability is not an end in itself but a critical business driver.
- The ability to work effectively in a collaborative, flat-structure environment with high-performing peers.
- Prior experience in the finance or a similarly demanding tech-first industry (e.g., big tech, high-scale SaaS) is highly advantageous.