Jobs Career Advice Signup
X

Send this job to a friend

X

Did you notice an error or suspect this job is scam? Tell us.

  • Posted: Apr 15, 2024
    Deadline: Not specified
    • @gmail.com
    • @yahoo.com
    • @outlook.com
  • Never pay for any CBT, test or assessment as part of any recruitment process. When in doubt, contact us

    Datafin was established in 1999 due to the need for a specialized IT recruitment solution. We offer a personalized and flexible recruitment service, specializing in providing both client and candidate with the perfect fit. We pride ourselves on the fact that we have established relationships with industry leaders and a vast majority of our business is repeat...
    Read more about this company

     

    Site Reliability Engineer - Pretoria/Centurion

    DUTIES:

    Kubernetes CI/CD:

    • Designing, implementing, and maintaining CI/CD pipelines for Kubernetes-based applications.
    • Automating deployment processes and ensuring continuous integration and delivery of software.

    Monitoring and Reporting:

    • Implementing monitoring solutions for infrastructure and applications using tools such as Prometheus, Grafana, and Kubernetes-native monitoring.
    • Generating reports on system performance, availability, and reliability.

    Log Analysis:

    • Analysing logs and metrics to identify trends, anomalies, and performance issues.
    • Implementing log aggregation and analysis solutions like ELK Stack or Splunk.

    Application Troubleshooting:

    • Investigating and resolving issues related to application performance, availability, and reliability in Kubernetes environments.
    • Collaborating with development teams to diagnose and debug complex issues.

    Alerting and Escalation:

    • Setting up alerting mechanisms to proactively detect and respond to incidents.
    • Escalating critical issues to appropriate teams and stakeholders.

    Linux Administration and Maintenance:

    • Managing and maintaining Linux servers, including installation, configuration, and patch management.
    • Implementing security measures and best practices for Linux-based systems.

    Active Directory Admin and Maintenance:

    • Managing user accounts, groups, and permissions in Active Directory.
    • Performing routine maintenance tasks and ensuring the security of AD infrastructure.

    DNS Admin and Maintenance:

    • Configuring and managing DNS servers and zones.
    • Troubleshooting DNS-related issues and ensuring DNS resolution reliability.

    End-User Support:

    • Providing technical support and assistance to end-users for infrastructure-related issues.
    • Resolving hardware, software, and connectivity problems promptly.

    Database Administration (PostgreSQL):

    • Managing PostgreSQL databases, including installation, configuration, and performance tuning.
    • Performing routine maintenance tasks such as backups, restores, and upgrades.

     
    REQUIREMENTS:

    • 3+ years of experience in a Site Reliability Engineer role or similar position.
    • Proficiency in Kubernetes administration and experience with CI/CD pipelines.
    • Strong Linux administration skills, including shell scripting and troubleshooting.
    • Experience with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or Splunk.
    • Familiarity with Active Directory administration and DNS management.
    • Experience with PostgreSQL database administration is a plus.

    ATTRIBUTES:

    • Excellent communication and problem-solving skills.
    • Ability to work effectively in a fast-paced, collaborative environment.

    Method of Application

    Interested and qualified? Go to Datafin Recruitment on www.datafin.com to apply

    Build your CV for free. Download in different templates.

  • Send your application

    View All Vacancies at Datafin Recruitment Back To Home

Subscribe to Job Alert

 

Join our happy subscribers

 
 
Send your application through

GmailGmail YahoomailYahoomail