About the Organization
The organization plays a central role in architecting and implementing cutting-edge digital health solutions -a major preventive care strategy aimed at improving long-term population health.
Why This Role Matters
As a Data Engineer, you will be integral to building and maintaining the data infrastructure that fuels analytics, reporting, and predictive modelling. Your work will directly influence how data is accessed, secured, and leveraged to improve healthcare outcomes.
Key Responsibilities
- Design, build, and maintain robust data pipelines for integrating and processing data from diverse sources and formats.
- Clean, prepare, and transform data for analytics, business intelligence, and data science use cases.
- Ensure comprehensive documentation of data processes and pipeline architecture.
- Monitor, troubleshoot, and improve the performance of data systems and pipelines.
- Identify optimisation opportunities for scalability, repeatability, and security.
- Handle data system errors and contribute to testing configurations for improved efficiency.
Requirements
Must-Have Skills :
Proven experience in designing scalable ETL pipelines to support AI and data science initiatives.Strong proficiency in SQL, NoSQL, and Python for data preparation, transformation, and automation.Hands-on experience with data lake management and data pipeline development.Familiarity with cloud collaboration and development tools (e.g., Office 365, Atlassian, AWS, Azure).Nice-to-Have Skills :
Exposure to AWS services (e.g., S3, Athena, Lambda, IAM, CloudWatch).Domain knowledge in health informaticsExperience supporting machine learning, clinical data projects, or hospital information systems.Familiarity with modern data engineering frameworks like Airflow, Docker, Kubernetes.Qualifications
Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related field.3 to 7 years of hands-on experience in data engineering, pipeline development, and production deployment.Strong scripting abilities, preferably in Python.Solid SQL skills and experience with platforms like Informatica, Teradata, or SQL Server.