Subscribe to Job Alert
Join our happy subscribers
Never pay for any CBT, test or assessment as part of any recruitment process. When in doubt, contact us
Imagine a world where people live healthier, more enhanced and protected lives… A world in which each organisation is a powerful influencer and responsible corporate citizen, committed to being a force for social good. As a leading innovator in healthcare, wellness, insurance, investments, financial and life planning, Discovery works ceaselessly to...
Read more about this company
What is the role
In the Discovery Health Data Science unit, our core purpose is “creating intelligence for a healthier tomorrow” by leveraging Discovery’s vast data to drive valuable insights to improve both clinical and operational environments. Key to our purpose is obtaining and structuring quality data, leveraging cutting edge analytical innovations and delivering actionable insights in a sustainable and meaningful way. We leverage an integrated, collaborative, and multidisciplinary approach to ensure our objectives and goals are met.
The role entails building a reusable sustainable framework to ensure collection, processing and availability of high quality health care data to enable us to achieve the core purpose. The Data Engineer will work collaboratively with the Program Managers, Data Scientists, Systems Architects to define data sources and to build a custom data framework that facilitates Machine Learning, AI and productionising AI models based on the principles of ETL/ELT. Together these teams will enable data driven actionable insights. The role may include international exposure with Discovery partnerships.
What you will do
The successful applicant will be working within a highly specialized and growing team to enable delivery of data and advanced analytics system capability.
Responsibilities will include:
Develop and implement a reusable architecture of data pipelines to make data available for various purposes including Machine Learning (ML), Analytics and Reporting
Work collaboratively as part of team engaging with system architects, data scientists and business in a healthcare context
Define hardware, tools and software to enable the reusable framework for data sharing and ML model productionization
Work comfortably with structured and unstructured data in a variety of different programming languages such as SQL, R, python, Java etc
Understanding of distributing programming and advising data scientists on how to optimally structure program code for maximum efficiency
Build data solutions that leverage controls to ensure privacy, security, compliance and data quality
Understand meta-data management systems and orchestration architecture in the designing of ML/AI pipelines
Deep understanding of cutting edge cloud technology and frameworks to enable Data Science
System integration skills between Business Intelligence and source transactional
Improving overall production landscape as required
Define strategies with Data Scientists to monitor models post production
Write unit tests and participate in code reviews
What skills you will need
Technical skills core:
Expert in programming languages such as R, Python, Scala and Java
Expert database knowledge in SQL and experience with MS Azure tools such as Data Factory, Synapse Analytics, Data Lake, Databricks, Azure stream analytics and PowerBI
Modern Azure datawarehouse skills
Expert Unix/Linux admin experience including shell script development
Exposure to AI or model development
Experience working on large and complex datasets
Understanding and application of Big Data and distributed computing principles (Hadoop and MapReduce)
ML model optimization skills in a production environment
Production environment machine learning and AI
DevOps/DataOps and CI/CD experience
Technical skills additional:
AWS experience
Behavioural skills:
A passion for programming and working with data
Self-starter
Willingness to learn and grow exponentially
A restless curiosity in learning new technology
Ability to work cohesively in a team environment and balance multiple priorities
A team player who can work alone when required and without supervision
High level of attention to detail, resilience, enthusiasm, energy and drive
Positive, can-do attitude
Ethical and able to maintain confidentiality and manage boundaries
Aligned to Discovery values and core purpose
Professional Qualifications & Experience
Honours or Master’s degree in BSc Computer Science
Honours or Master’s degree in Engineering or Software Engineering with solid experience in data mining and machine learning
Other qualifications will also be considered if accompanied by the relevant experience
EMPLOYMENT EQUITY
The Company’s approved Employment Equity Plan and Targets will be considered as part of the recruitment process. As an Equal Opportunities employer, we actively encourage and welcome people with various disabilities to apply.
Build your CV for free. Download in different templates.
Join our happy subscribers