Site Reliability Engineer, AWS Cloud

Our investment client is seeking AWS cloud SRE expert for their new-build cloud infrastructure team.

Pathos Consultancy Limited - Hong Kong - Full time

Salary: HK$1200k - HK$1800k

Responsibilities:

  • Gather and analyze metrics from both pre-trade and post-trade operating systems to assist in performance tuning and fault finding.
  • Partner with DevOps and development teams to improve the reliability and performance of the cloud infrastructure and web services through rigorous automated testing and release procedures.
  • Participate in system design consulting, platform management, and capacity planning.
  • Create sustainable containerised platform with capability in managing high volume of data and services through automation and uplifts.
  • Balance both existing systems enhancement and new-build feature development speed and reliability with clear service-level objectives and guidelines.

Requirements:

  • At least 5 years of site-reliability engineering or DevOps engineering experiences derived from top investment or capital management institutions.
  • Expertise in containerisation technologies, i.e. Kubernetes and Docker, and Redhat Linux/Unix OS in a complex cloud native environment.
  • Hands-on experience in building automation tools and scripting using Python, Golang, or Rust
  • Solid cloud tooling experience in monitoring and configuration using Prometheus, Grafana, Kibana, Ansible, Puppet, Terraform, IaaC, etc.
  • Sound knowledge data management and networking technologies such as message queuing (Kafka/Rabbit/Zero), Ignite, Spark, Airflow, firewall, storage, and server virtualisation.
21501708
Ad