Analytical and insightful Data Analyst known for productivity and efficiency in task completion. Specialise in data visualisation, statistical analysis, and predictive modeling. Committed to leveraging statistical and technical proficiency for data-informed decision-making. Eager to contribute to a dynamic team, supporting business objectives, and continually enhancing data analysis expertise to deliver meaningful results.
· Programming Skills: Python, Apache Spark, R, JavaScript, Java, C#,
· Database Tools: SQL, MySQL, MongoDB, Google Cloud, Cloud SQL
· Machine Learning: Regression, Classification, Clustering, SVM, Neural Networks
· Data Visualisation: Tableau, MS PowerBI, IBM Cognos, Excel, Matplotlib, Seaborn
· Data Handling: Data Wrangling, Data Cleaning, Data Analysis, Data Modelling, Data Visualisation, Machine Learning
New York Taxi and Limousine Fare Forecast – Google Advanced Data Analytics Capstone Project - 2024
· Performed data cleaning and preprocessing of datasets using Pandas and NumPy.
· Developed regression ML models to predict taxi fares
· Evaluated ML models using metrics such as R-Squared, RMSE
· Visualised analysis with graphs and charts produced by Matplotlib and Seaborn.
Sentiment Classification of Tweets – ML Project - University of Melbourne – 2022
· Wrangled tweet data using various text pre-processing techniques
· Conducted feature selection using Chi-Squared and Entropy metrics.
· Trained models using Cross-fold Validation.
· Evaluated models using confusion matrices and F1 score.
· Implemented various machine learning models, including Gradient-Boosting, SVM and Stacking.