Summary
Overview
Work History
Education
Skills
Timeline
Generic

Kai Soon

Melbourne

Summary

Kai is a Data Engineer with experience in developing ELT pipelines using DBT, Databricks, Snowflake, SQL, Python, PySpark, hosted on cloud platforms such as Azure DevOps and AWS.

Kai completed his Masters in IT at the University of Melbourne with a specialisation in Computing. His thesis was on the application of Machine Learning and Natural Language Processing on transcribed parliamentary speeches in an unstructured data format to investigate what could have caused a bill to pass. Speech data were obtained from Hansard. https://www.aph.gov.au/Parliamentary_Business/Hansard

Overview

6
6
years of professional experience

Work History

Senior Data Engineer (Consultant)

UniSuper
07.2023 - Current

UniSuper is an Australian superannuation fund that provides superannuation services to employees of Australia's higher education and research sector.

In 2023, UniSuper’s Data Office launched an innovative data strategy around governance and cloud enablement, centred on the provisioning of a new Azure Databricks Modern Data Platform. With NCS as delivery partner, they began a three-year program to transform UniSuper’s Data Analytics capability and uplift their Data Governance practice.

🔸Achievement/s🔸
Successfully delivered a reusable framework with over 500 data models in the 1st year of the project

🔹Responsibilities🔹

  • Building, uplifting, and deploying data models with DBT & Databricks
  • Running and debugging source ingestion on Azure Data Factory
  • Managing code and releases on Azure DevOps
  • Coaching engineers on coding best practices and the ELT process


Skills: DBT (Data Build Tool) · SQL · Extract, Transform, Load (ETL) · Databricks · Azure DevOps Services · Python

Senior Data Engineer

NCS Group Australia
12.2022 - Current

NCS Group Australia provides a broad range of IT and digital transformation services, including cloud solutions, cybersecurity, data analytics, and digital experience development. Their offerings are tailored to meet the needs of various sectors such as government, healthcare, financial services, and transport.

Clients engaged:

  • NAB
  • UniSuper
  • Safe Transport VIC


For details of my achievements and responsibilities with each client, please look for client heading under my experiences.

Senior Data Engineer (Consultant)

Safe Transport Victoria
01.2025 - 03.2025

Safe Transport Victoria (STVic) is a government authority in Victoria, Australia, and is responsible for managing and regulating the safety of the commercial passenger vehicle industry, including taxis, rideshares, and buses, as well as marine safety.

In 2025, STVic engaged NCS to deliver their very first cloud data platform to uplift and automate their analytics capability.

🔸Achievements🔸
Successfully constructed/adapted 7 macros and 52 models with DBT in 2.5 months. Source tables are consolidated and materialised on Snowflake as a single source of truth based on business rules established by working with SMEs.

🔹Responsibilities🔹

  • Building data models and abstracting processes to construct a data platform that is sustainable and extendable.
  • Establishing build patterns that are both modular and reusable such that future development work can be expedited.
  • Adapting previous learnings of data pipeline builds on a different system.


Skills: DBT (Data Build Tool) · Snowflake · SQL · Python · Apache Airflow · Amazon Web Services (AWS)

Data Engineer (Consultant)

NAB
03.2023 - 07.2023

NAB (National Australia Bank) is one of Australia’s largest financial institutions, providing a wide range of banking and financial services.

🔸Achievement/s🔸
Developed a framework to automate tests on data models

🔹Responsibilities🔹

  • Running Databricks jobs to ingest data
  • Building and running automated tests to detect anomaly in ingest data


Skills: Python · SQL · Coaching & Mentoring · Databricks · Git

Senior Data Engineer

Rest Superanuation
06.2022 - 10.2022

REST Superannuation (Retail Employees Superannuation Trust) is one of Australia's largest industry super funds, primarily designed for workers in the retail and hospitality sectors but open to all.

🔹Responsibilities🔹

  • Assessment of data integration tools and warehouse to replace and improve performance of existing data pipeline
  • Running ETL pipelines
  • Dimensional modelling


Skills: ETL Tools · Risk Assessment · Extract, Transform, Load (ETL) · Bash · Data Modeling

Data Engineer (Consultant)

IAG
02.2022 - 06.2022

IAG (Insurance Australia Group) is a major general insurance company based in Australia and New Zealand.

🔹Responsibilities🔹

  • Migrating PostgreSQL scripts to the DBT
  • Implementing change data capture (type 2 slowly changing dimension) with DBT


Skills: DBT (Data Build Tool) · SQL · Data Migration · Git BASH

Senior Data Engineer (Consultant)

Energy Australia
11.2021 - 02.2022

EnergyAustralia is one of Australia's largest electricity and gas retailers.

🔹Responsibilities🔹

  • Migration of on-premise data models to a cloud infrastructure
  • Constructing data processing pipelines


Skills: Data Migration · SQL · Python · Data Pipelines · Microsoft SQL Server

Data Engineer

NAB
08.2021 - 11.2021

NAB (National Australia Bank) is one of Australia’s largest financial institutions, providing a wide range of banking and financial services.

🔹Responsibilities🔹

  • Running and maintaining data transformation scripts


Skills: PySpark · Python · Jupyter · Data Pipelines · Git

Data Engineer

IPRO Group Pty Ltd
11.2019 - 08.2021

iPro Group is a financial services company that specialises in providing a range of solutions, including accounting, financial planning, and business advisory services.

🔸Achievement/s🔸

  • Successfully constructed ELT pipelines to transform raw CRM data into dashboards for business analytics. Use cases include contractor risk & modern slavery assessment, customer churn, acquisition, up-sell, down-sell, lifetime value, & onboarding friction.
  • Researched and implemented deterministic & probabilistic record linkage algorithm to associate 20 million Australian business registry records with CRM data


🔹Responsibilities🔹

  • Constructing and managing ELT pipelines
  • Designing dashboards for business intelligence
  • Configuring schedulers for automated ETL processes


Skills: Extract, Transform, Load (ETL) · PySpark · DBT (Data Build Tool) · Tableau · SQL

Education

Masters of Information Technology - Information Technology

University of Melbourne
Parkville, VIC
01-2020

Bachelor of Applied Music - Music

Box Hill Institute of TAFE
01-2012

Skills

  • Data pipeline design
  • Data warehousing (Databricks & Snowflake)
  • DBT Development
  • Python programming
  • Spark development
  • Advanced SQL
  • Git version control
  • Continuous integration

Timeline

Senior Data Engineer (Consultant)

Safe Transport Victoria
01.2025 - 03.2025

Senior Data Engineer (Consultant)

UniSuper
07.2023 - Current

Data Engineer (Consultant)

NAB
03.2023 - 07.2023

Senior Data Engineer

NCS Group Australia
12.2022 - Current

Senior Data Engineer

Rest Superanuation
06.2022 - 10.2022

Data Engineer (Consultant)

IAG
02.2022 - 06.2022

Senior Data Engineer (Consultant)

Energy Australia
11.2021 - 02.2022

Data Engineer

NAB
08.2021 - 11.2021

Data Engineer

IPRO Group Pty Ltd
11.2019 - 08.2021

Masters of Information Technology - Information Technology

University of Melbourne

Bachelor of Applied Music - Music

Box Hill Institute of TAFE
Kai Soon