Senior Azure Data Engineer with over 5+ years of proven experience in designing, developing, and deploying scalable data solutions that drive business impact across diverse cloud and hybrid infrastructures. Proficient in designing and building scalable, enterprise-grade data pipelines using Azure Data Factory (ADF), Azure Databricks (PySpark/Scala), and Azure Synapse Analytics. Adept at supporting batch and real-time data ingestion, transformation, and orchestration to deliver robust data solutions. Expert in advanced Power BI solutions, including composite models, incremental refresh, and deployment pipelines. Utilized Azure Data Lake Gen2, Dataverse, and Azure Analysis Services to ensure scalable and governed BI delivery. Designed and deployed high-performance analytics using Power BI Direct Lake, enabling large-scale data analysis with significantly reduced data duplication and improved efficiency. Hands-on experience with Azure Databricks, including Delta Live Tables for automated pipeline orchestration and MLflow for comprehensive experiment tracking in Python environments. Designed modern data lake house architectures leveraging Azure Data Lake Storage Gen2, Delta Lake, and Spark SQL. Proficient in managing diverse data formats including Parquet, Avro, JSON, and CSV for robust data solutions. Successfully migrated on-premises data solutions (SQL Server, Oracle) to Azure, delivering high-performance, secure, and cost-effective cloud-based systems. Proficient in designing dimensional data models with Star and Snowflake schemas, complemented by deep experience in the Microsoft SQL Server BI stack (SSIS, SSAS, SSRS) and strong T-SQL scripting capabilities. Proven expertise in Power BI, building interactive dashboards with advanced DAX and real-time capabilities. Seamlessly integrated with Azure SQL, Synapse, and Kafka streams to deliver valuable insights across finance, inventory, operations, and high-volume data. Experienced in handling high-volume, distributed data workloads using a robust Big Data stack, including Hadoop, Hive, HDFS, Sqoop, Spark, Kafka, HBase, Oozie, and Zookeeper. Implemented robust Data Quality Checks and Data Validation Frameworks using Python libraries like Great Expectations and Pandas Profiling. Integrated seamlessly with ADF and Databricks for early detection of data anomalies and enhanced data governance. Expert in Azure Data Factory V2 (ADF V2), specializing in developing flexible parameterized pipelines, configuring trigger-based executions, and orchestrating complex data workflows across heterogeneous systems. Applied DevOps practices by automating CI/CD pipelines using Azure DevOps and GitHub Actions for efficient building, testing, and deployment of data pipelines and Azure services. Expertise in utilizing Azure PaaS components (Logic Apps, Azure Functions, Event Hubs, Azure Monitor, Azure App Services, Azure Key Vault) to deliver scalable cloud-native data solutions. Developed Python scripts for data quality validations, profiling, and automation across both cloud and on-premise systems, significantly enhancing data reliability and reducing manual intervention. Experienced in Agile/Scrum methodologies, driving collaborative, sprint-based delivery with a commitment to documentation and iterative improvements throughout the SDLC. Implemented enterprise-grade metadata management and governance frameworks utilizing Azure Purview and Microsoft Fabric Data Governance. Enabled robust lineage tracking, data classification, access control, and compliance for comprehensive data oversight. Proficient in implementing Azure security measures such as RBAC, Managed Identities, and Private Endpoints. Secured data access and ensured compliance across Azure Data Factory, Databricks, and various storage layers. Proficient in building comprehensive observability solutions using Azure Monitor, Log Analytics, and custom metric alerts. Enabled efficient monitoring for pipeline health, SLA adherence, and data latency detection. Hands-on experience in automating infrastructure provisioning using Bicep templates and Azure DevOps, ensuring efficient, scalable, and version-controlled deployments for data engineering.
DP 700