IT Professional with 15+ years of expertise in AWS Cloud, DevOps, and Site Reliability Engineering (SRE), driving enterprise-scale infrastructure automation, observability, and operations. Skilled in designing, automating, and managing platforms using AWS, Kubernetes, Docker,Pulumi, Terraform, Ansible, and CloudFormation, with a strong focus on CI/CD pipelines, Security, and Scalability. Experienced in AWS cost optimization, performance tuning, and cloud resource efficiency, delivering significant OPEX savings across environments. Hands-on experience with Databricks for data engineering workloads, Kafka for event streaming, and Kibana/ELK stack for log analytics and monitoring. Proven success in large-scale datacenter & cloud migrations, high availability design, and resilient architecture leveraging VMware, Oracle RAC, and Red Hat clusters. Expertise in observability and monitoring using Prometheus, Grafana, Zabbix, ELK, CloudWatch, integrated with automation and alerting pipelines.
Strong background in Linux (RHEL), storage/SAN administration, and server builds, combined with automation using Python & Bash scripting.