Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

ANBUSELVI EKAMBARAM

Melbourne

Summary

Experienced data team lead with 15 years of experience in data engineering, analysis, and machine learning. Proven track record of leading successful projects from concept to delivery. Skilled in managing teams, developing strategies, and driving growth. Applying for the position of Engineering Manager at Easy Agile to maximize data insights and product development.

Overview

16
16
years of professional experience
1
1
Certification

Work History

DATA TEAM LEAD

Online Pajak
11.2021 - Current
  • Designed, implemented, and tested data platform solutions, resulting in a reduced cost by 25% in AWS services and improved performance of the Data Pipelines by 35% through the adoption of Databricks
  • Worked with C - suites and external stakeholders to build the Data team roadmap for every quarter
  • Advised management, business and technical staff on solutions using specific domains or technology
  • Researched and adopted new technologies to add value to existing offerings
  • Worked on Data Migration between AWS regions
  • Optimized performance on Airflow Orchestration
  • Recruited, onboarded and led a team of 2 Data Engineers and 2 Data Analysts
  • Implemented best practices and guidelines to reduce defects by 30% and reduced delivery time of ad hoc data requests by 80%.

LEAD DATA ENGINEER

Capgemini - NAB
10.2018 - 04.2021


  • Tools: Python, Airflow,AWS, Redshift, Snowflake, Jenkins, Terraform
  • End to End Experience in delivering Enterprise Data Lake Layers from Ingestion, Curation to Serving and Conformance
  • Designed architecture for a next-generation data platform to process and analyze millions of records, enabling machine learning models to provide real-time insights and increased application performance by 25%
  • Worked closely with Data Modelers and Business stakeholders to finalize the Data vault design according to Business requirements and Enterprise Architect team to get endorsement of the design
  • Designed and Implemented data pipelines to serve data to consumers from Curated Layer on S3
  • Utilized air flow DAGs to orchestrate daily conformance jobs, managed access control in AWS, managed and maintained CI/CD pipelines, and deployed infrastructure with Terraform
  • Authored development guidelines to expedite application design efforts through ready-made frameworks
  • Reviewed business success drivers, applying strategic prioritization to future architectural updates.

SOLUTION DESIGNER/DATA ENGINEER

Capgemini - BNP Paribas
06.2017 - 09.2018
  • Tools: MapR Hadoop, Spark, Java, Oozie, MapRDB, SONAR
  • Implemented an Ingestion Framework to ensure secure client data and IRP calculation data in Datahub, resulting in an overall increase in reporting accuracy of 90%
  • Involved in performance optimization of Spark Jobs and designed efficient queries to query data from Apache Drill
  • Performed impact analysis and provided solutions for performance issues.


DATA ENGINEER

Capgemini - Standard Chartered
04.2016 - 06.2017
  • Tools: Hortonworks Hadoop, Hive, MapReduce, Sqoop,Hbase, Oozie, Unix, Falcon
  • Developed Hadoop frameworks to source various source systems and performed data quality checks on the source data before ingesting into Hadoop environment
  • Achieved 25% reduction in time-to-execution, 95% accuracy in data validation checks and 40% increase in ingestion speed into HDFS
  • Implemented checksum validation of source and processed files to improve data quality
  • Simplified data extraction from HDFS using Teradata studio.

CONSULTANT 2, DEVELOPER

UNISYS
12.2014 - 04.2016
  • Tools:Cloudera Hadoop, Hive, PIG, Sqoop, Oracle 10g, SQL
  • Utilized Apache Sqoop to transfer eight million transactions daily from RDBMS to HDFS, resulting in 27% improvement in processing speed and a 23% reduction in storage costs
  • Collecting and aggregating large amounts of streaming data using Apache Flume and staging data in HDFS for further analysis
  • Log Analysis/Transformation using HIVE


IT ANALYST

Tata Consultancy Services Ltd
08.2007 - 08.2014
  • Worked on the Order Managment System for Major US Telecom Provider
  • Implemented Online Lockbox for Major US Insurance Provider

Education

Post Graduate Program - Artificial Intelligence and Machine Learning: Business Applications, Computer Science

The McCombs School of Business, The University of Texas
AUSTIN
01.2024

Bachelor of Science - Computer Science And Engineering

Madras Institute of Technology
CHENNAI, TAMIL NADU, INDIA
2007

Skills

  • Cloud and Big Data Ecosystem
  • Airflow, Jenkins, Terraform, Docker, Jira, Git
  • Snowflake, Databricks, Redshift, MySQL, Oracle, MapRDB
  • Data Lake Design
  • AWS, Azure, Spark-Scala, Map Reduce, Hive
  • Business Analysis and Development

Certification

HashiCorp Certified: Terraform Associate 2021

Snowflake Snowpro Core 2020

Timeline

DATA TEAM LEAD

Online Pajak
11.2021 - Current

LEAD DATA ENGINEER

Capgemini - NAB
10.2018 - 04.2021

SOLUTION DESIGNER/DATA ENGINEER

Capgemini - BNP Paribas
06.2017 - 09.2018

DATA ENGINEER

Capgemini - Standard Chartered
04.2016 - 06.2017

CONSULTANT 2, DEVELOPER

UNISYS
12.2014 - 04.2016

IT ANALYST

Tata Consultancy Services Ltd
08.2007 - 08.2014

Post Graduate Program - Artificial Intelligence and Machine Learning: Business Applications, Computer Science

The McCombs School of Business, The University of Texas

Bachelor of Science - Computer Science And Engineering

Madras Institute of Technology
ANBUSELVI EKAMBARAM