Summary

Overview

Work History

Education

Skills

Work Availability

Certification

Quote

Timeline

Arash Haydari

Mulambin,QLD

Summary

Experienced Data Engineer specializing in designing, implementing, and managing complex data architectures for over 15 years. Proficient in cloud-based and on-premises solutions, particularly in the AWS ecosystem. Exceptional in building and optimizing data pipelines to transform raw data into insightful business intelligence. Detail-oriented designs, develops and maintains highly scalable, secure and reliable data structures. Accustomed to working closely with system architects, software architects and design analysts to understand business or industry requirements to develop comprehensive data models. Proficient at developing database architectural strategies at the modeling, design and implementation stages.

Overview

years of professional experience

Certificate

Work History

Data Enineer

Ubank

Remote , Queensland

2024.03 - Current

Data Engineering and Pipeline Design:
Data Processing and Integration: Leveraged Apache Spark and Databricks for efficient data ingestion and processing, ensuring robust data handling.
Data Modeling: Applied DBT and Data Vault methodologies to design and implement complex data models, structuring data in multi-layered architecture (Bronze, Silver, Gold).
ETL Process Development and Maintenance: Developed and maintained scalable ETL pipelines, integrating various data sources and ensuring smooth data flow.
CI/CD Deployment: Managed end-to-end CI/CD deployment processes using Jenkins, Terraform, and Bitbucket, streamlining integration and delivery cycles.
Testing and Quality Assurance: Created and implemented comprehensive test scenarios to verify data integrity and reliability across all stages.
Source and Destination Management: Integrated data from diverse systems and distributed it to various platforms and third-party services, optimizing data utilization for business growth and customer experience enhancement.
Collaboration and Workflow Automation:
Pipeline Automation: Partnered with data scientists and engineers to automate data workflows, enhancing efficiency and accuracy in data processing.
Continuous Improvement: Conducted performance tuning and optimization of data jobs, reducing latency and improving resource utilization.
Internal Training and Mentorship: Provided guidance and training to junior engineers on best practices in data engineering, focusing on the strategic use of Databricks, DBT, and other technologies.
Focus: Apache Spark, Databricks, DBT, Data Vault, Jenkins, Terraform, Bitbucket, CI/CD, Data Integration, Data Modeling, ETL
Designed and implemented data pipelines using Apache Spark and Databricks, ensuring efficient data ingestion and processing.
Modeled data using DBT and Data Vault methodologies, structuring data in multiple layers (Bronze, Silver, Gold) for optimal reporting and analysis.
Developed and maintained ETL processes to integrate and manage data from various sources, ensuring smooth data flow and storage.
Managed CI/CD deployment processes with Jenkins, Terraform, and Bitbucket, streamlining development and deployment cycles.
Created and executed comprehensive test scenarios to ensure data integrity and reliability across all processing stages.
Integrated data from diverse systems and distributed it to various platforms and third-party services, enhancing data utilization for business growth and improved customer experience.
Automated data workflows in collaboration with data scientists, improving efficiency and accuracy in data processing.
Conducted performance tuning on data jobs to reduce latency and improve resource utilization.
Mentored and trained junior engineers on best practices in data engineering, focusing on technologies such as Databricks, DBT, and CI/CD tools.
Technologies: Apache Spark, Databricks, DBT, Data Vault, Jenkins, Terraform, Bitbucket, CI/CD, Data Integration, Data Modeling, ETL

Senior Data Engineer

O2E Brands

06.2022 - 03.2024

Focus: AWS Ecosystem & Google BigQuery
Spearheaded the design and implementation of an AWS S3-based data lake, centralizing data storage with robust governance policies
Established and enforced data governance and quality frameworks
Developed and maintained ETL processes using Airflow and AWS Glue, optimizing the data pipeline for scalability and maintainability
Mentored junior engineers in best practices for cloud-based solutions, particularly AWS services
Introduced data versioning for traceability and compliance
Conducted performance tuning on SQL queries, reducing latency
Collaborated with data scientists to automate machine learning workflows
Played a key role in data migration projects, ensuring zero downtime
Led internal training sessions to uplift data literacy within the organization
Worked closely with business analysts to implement tailored BI solutions
Technologies: AWS Glue, AWS Redshift, Google BigQuery, Airflow, S3, Python, SQL

Senior Data Engineer

TD Bank

09.2019 - 06.2022

Focus: On-Premises & Cloud-based Data Solutions
Engineered and managed high-availability database systems
Automated routine database tasks, reducing manual effort
Conducted code reviews and quality assurance tests
Played an integral role in optimizing existing ETL processes
Assisted in the modernization of the data stack, integrating cloud-based solutions
Conducted PoCs for new technologies, presenting findings to executive leadership
Guided the organization in best practices for database scaling
Collaborated with cybersecurity teams to ensure data privacy and compliance
Technologies: Python, SQL, Oracle, PostgreSQL, ETL Tools

Data Engineer

Shaw Communications

02.2018 - 08.2019

Orchestrated a large-scale data transformation project, improving data consistency
Developed automated testing suites for data pipelines
Managed data sync processes between disparate data sources, ensuring data consistency
Fostered cross-departmental communication for a unified data strategy
Created and deployed SQL-based analytics reports for business units
Collaborated in the design and implementation of the company’s first data lake
Implemented metadata tagging for better data discoverability
Developed data quality checks that reduced errors
Recommended architecture improvements, tool solutions, and acted as a technical advisor
Technologies: SQL, Python, Hadoop, Hive, Spark

Data Engineer

Freedom Mobile

04.2013 - 02.2018

Engineered and optimized stored procedures for data retrieval
Automated monthly data aggregation tasks, improving efficiency
Designed data models for reporting and analytics
Played a key role in several data migration initiatives
Acted as the primary contact for data-related queries from business units
Advised on best practices for data visualization
Created reusable Python scripts for data extraction tasks
Collaborated with IT to ensure uptime and data security
Technologies: Python, SQL, Bash

ETL Developer

MCCI Corporation

10.2007 - 03.2013

Focus: Revenue Assurance and Fraud Management
Developed code and procedures for repeatable data retrievals, summarized by desired groupings
Served as a technical/data subject matter expert, reporting to multiple business units
Conducted data analysis to provide actionable insights for executive-level management
Managed large data sets and derived conclusions from complex analytics
Worked on fraud detection algorithms, contributing to risk mitigation
Introduced a modular approach to ETL development, significantly enhancing maintainability
Technologies: SQL, Bash, Oracle

Education

Databricks Academy Accreditation - Databricks Lakehouse Fundamentals (Databricks) -

dbt Fundamentals (dbt Labs) -

Bachelor of Computer Science -

Pune University

2007

Skills

Data Curating
Data Security
Git Version Control
Agile Methodologies
API Development
Team Leadership
Cloud Computing
Data Warehousing
ETL Development
Big Data Processing
Project Management
Hadoop Ecosystem
Metadata Management
NoSQL Databases
Advanced SQL
Database Management
Python Programming
Data Integration
Data Quality Assurance

Spark Development
Data Modeling
RDBMS
SQL and Databases
Data Analysis
Risk Analysis
Data Operations
Analytical Thinking
Amazon Redshift
Data Acquisitions
Data Governance
Advanced Data Mining
Database Optimization
Data Curating
Database Development
Teradata Database
Data Synchronization
Data Mapping
Data Structures

Work Availability

monday

tuesday

wednesday

thursday

friday

saturday

sunday

morning

afternoon

evening

swipe to browse

Certification

Data Engineering AWS Certified Cloud Practitioner (Amazon Web Services) (Nov 2022) Bash Scripting (Codecademy) (Oct 2022) Alteryx Designer Core (Alteryx) (Jul 2021) Advanced SQL queries with MySQL 5.7+ (Udemy) (Jan 2018) PHP (Sololearn) (Jan 2018) Python 3 (Sololearn) (Jan 2018)

Quote

Judge a man by his questions rather than his answers.

Voltaire

Timeline

Data Enineer

Ubank

2024.03 - Current

Senior Data Engineer

O2E Brands

06.2022 - 03.2024

Senior Data Engineer

TD Bank

09.2019 - 06.2022

Data Engineer

Shaw Communications

02.2018 - 08.2019

Data Engineer

Freedom Mobile

04.2013 - 02.2018

ETL Developer

MCCI Corporation

10.2007 - 03.2013

Databricks Academy Accreditation - Databricks Lakehouse Fundamentals (Databricks) -

dbt Fundamentals (dbt Labs) -

Bachelor of Computer Science -

Pune University

Arash Haydari

Summary

Overview

Work History

Data Enineer

Senior Data Engineer

Senior Data Engineer

Data Engineer

Data Engineer

ETL Developer

Education

Databricks Academy Accreditation - Databricks Lakehouse Fundamentals (Databricks) -

dbt Fundamentals (dbt Labs) -

Bachelor of Computer Science -

Skills

Work Availability

Certification

Quote

Timeline

Data Enineer

Senior Data Engineer

Senior Data Engineer

Data Engineer

Data Engineer

ETL Developer

Databricks Academy Accreditation - Databricks Lakehouse Fundamentals (Databricks) -

dbt Fundamentals (dbt Labs) -

Bachelor of Computer Science -

Similar Profiles

DEIJA GEORGEDEIJA GEORGE

Rana ViswajeetRana Viswajeet

Reginald PenalosaReginald Penalosa

Awais SultanAwais Sultan

Daven DeSatgeDaven DeSatge