Site Reliability Engineer
IT – Manager
Cape Town – Western Cape
ENVIRONMENT:
A specialist consultancy focused on developing competitive software and data visualization solutions is seeking a Site Reliability Engineer. This role involves operating and maintaining their client’s hybrid infrastructure across three data centers and Amazon Web Services, including hundreds of database instances, CentOS-based virtualization, Kubernetes production clusters, and more. The ideal candidate should have 3-5 years of experience as a DevOps/SRE, with strong Linux skills and a solid understanding of Linux OS fundamentals.
REQUIREMENTS:
- At least 3-5 years of experience as a DevOps / SRE
- Strong Linux skills and Linux OS fundamentals.
- Experience in automation, CI and CD (Jenkins and Ansible)
- Experience with cloud infrastructure management tools: CloudFormation, Terraform
- Knowledge working with Cassandra/MySQL/Postgres
- Scalable networking technologies such as Load Balancers (HAProxy)
- Familiarity with Python/Bash/Golang or other scripting languages
- Deep understanding of the Kubernetes architecture
- Strong monitoring experience (Grafana/Prometheus/CheckMK)
- Solid troubleshooting skills and networking knowledge
- Extensive knowledge of AWS is a plus
- RHCE, RHCA, and Kubernetes certifications are beneficial, but not required
ATTRIBUTES:
- Detail-oriented, self-driven, with excellent communication skills
- Good oral and written English