Site Reliability Engineer

IT – Manager
Cape Town – Western Cape

ENVIRONMENT:
A specialist consultancy focused on developing competitive software and data visualization solutions is seeking a Site Reliability Engineer. This role involves operating and maintaining their client’s hybrid infrastructure across three data centers and Amazon Web Services, including hundreds of database instances, CentOS-based virtualization, Kubernetes production clusters, and more. The ideal candidate should have 3-5 years of experience as a DevOps/SRE, with strong Linux skills and a solid understanding of Linux OS fundamentals.
 
REQUIREMENTS:
  • At least 3-5 years of experience as a DevOps / SRE
  • Strong Linux skills and Linux OS fundamentals.
  • Experience in automation, CI and CD (Jenkins and Ansible)
  • Experience with cloud infrastructure management tools: CloudFormation, Terraform
  • Knowledge working with Cassandra/MySQL/Postgres
  • Scalable networking technologies such as Load Balancers (HAProxy)
  • Familiarity with Python/Bash/Golang or other scripting languages
  • Deep understanding of the Kubernetes architecture
  • Strong monitoring experience (Grafana/Prometheus/CheckMK)
  • Solid troubleshooting skills and networking knowledge
  • Extensive knowledge of AWS is a plus
  • RHCE, RHCA, and Kubernetes certifications are beneficial, but not required
 
ATTRIBUTES:
  • Detail-oriented, self-driven, with excellent communication skills
  • Good oral and written English

+ 27 (0) 21 741 0400