Data & AI Engineer | DevOps Enthusiast

Cloud-agnostic data platforms | GenAI pipelines | Production Deployments | 100% Reliable delivery

I design and lead pragmatic data engineering solutions that turn raw data into reliable, production-grade insights. 6+ years with organisations accross the world like RAKEZ and BCG building metadata-driven ingestion, quality frameworks, and scalable pipelines on AWS, Azure, and GCP.

"A professional Data AI Engineer and DevOps Enthusiast who is building data recipes using the coding hands topped with GenAI fingers to fill the tummy with insights so that the mind can take decisions and unlock new possibilities."

SKILLS & SERVICES

Hello! I'm Karan Sawlani

Data & AI Engineer Professional | DevOps Enthusiast | 8x Databricks | 1x Cloudera | Hacker Rank Gold - SQL & Python | @RAKEZ | Ex-BCG | 11k+ on LinkedIn

  • 6+ years delivering data platforms and analytics at RAKEZ and BCG
  • Cloud-agnostic frameworks for ingestion, data quality, observability, and automation
  • Kubernetes, Terraform, CI/CD, and multi-cloud (AWS/Azure/GCP) delivery experience
  • Designing cloud-agnostic, production-grade platforms and GenAI-enhanced data engineering.
0+ Years leading enterprise data & AI programs
0x Professional certifications across DE/ML/GenAI
0+ Mock interviews conducted
0+ Professionals mentored
Multi-cloud delivery (AWS / Azure / GCP) GenAI accelerators Metadata-driven Data frameworks Community-first mentor & speaker

SERVICES

Data Engineering Platforms

Cloud-agnostic, metadata-driven frameworks for ingestion, data quality, Delta/Lakehouse design, and robust orchestration.

Cloud & DevOps

Infrastructure-as-code and platform automation on AWS, Azure, and GCP using Kubernetes, Terraform, Helm, and GitHub Actions.

Analytics & GenAI

Customer 360 analytics, feedback analysis, PII detection/masking, automated reporting, and LLM-powered assistants.

Team Enablement

Mentoring, best-practice enablement, and documentation to help teams scale delivery safely and predictably.

Data Governance & Quality

DAMA-aligned strategy covering modeling, cataloging, lineage, observability, and automated quality guardrails across SAP, Salesforce, and ERP sources.

Coaching & Community

1:1 mentoring, 200+ mock interviews, and enablement programs that accelerate team onboarding and keep engineering rigor consistent.

SKILLS

Programming

Python, Bash, HCL

Data Platforms

Databricks, Snowflake, Hive, Vertica

Data Engineering

PySpark, Hadoop, Kafka, Airflow, NiFi, Sqoop, Oozie, Delta Lake

Cloud (AWS/Azure/GCP)

EMR, Glue, Lambda, Redshift, AKS, ADLS, Dataproc, GKE

DevOps

Docker, Kubernetes (EKS/AKS/GKE), Helm, Terraform, CI/CD (GitHub Actions)

GenAI

LLMs (GPT/Llama), OpenAI APIs, LangChain, LangGraph, Streamlit

Databases

MySQL, Postgres

Documentation

Draw.io, Mermaid, Lucidchart, Miro

Operating Systems

Linux (scripting), Windows, macOS

Version Control & IDE

GitHub, GitLab, Bitbucket, PyCharm, Sublime

AWS Highlights

VPC, NAT/SG/NACL, EC2, S3, IAM, EMR, Glue, Lambda, Athena, Redshift, EKS, ECR

Azure Highlights

VNet, NSG, VM, ADLS/ABFS, IAM, Azure Functions, AKS, ACR, AD

GCP Highlights

VPC, Firewall, VM, GCS, IAM, Cloud Functions, Dataproc, GKE, GCR, Cloud NAT

RECOMMENDATIONS

"Karan builds pragmatic and reliable data platforms with impressive speed and precision. He communicates clearly, mentors his peers, and consistently raises the bar for quality and delivery. His work is focused, structured, and strategically executed, even in complex scenarios. I remember working with him on a critical finance data logic project, despite the complexity, he took full ownership and delivered an exceptional solution on time."

"His cloud-agnostic approach and automation mindset significantly accelerated our delivery timelines while enhancing data quality and observability. We were able to deploy infrastructure for multiple clients across different cloud environments seamlessly within just 1-2 weeks. His strong technical and coding skills truly set him apart and made a measurable difference to our success."

EXPERIENCE

Nov 2024 - Present

Data & AI Engineer

Ras Al Khaimah Economic Zone (RAKEZ), UAE

  • Designed reusable ingestion & quality framework reducing manual build effort by ~40%
  • Shipped Python logging utility to improve observability across data products
  • Architected data lake integrating SAP ERP/HANA, Salesforce, BigQuery, AWS AppFlow
  • Built GenAI pipelines for Customer 360, feedback analysis, PII masking, auto-reporting
  • Enabled analysts and scientists with reliable, governed datasets
Oct 2023 - Nov 2024

Senior Data Engineer

Boston Consulting Group (BCGX), India

  • Spark cluster lifecycle mgmt accross clouds (EMR/Databricks/Dataproc): start/stop, job submission, error states, bootstrap, libs
  • 90%+ coverage with unit/integration tests and HTML reports
  • Pre-commit, Black, Flake8, isort; coverage badges and docs on Pages
  • CI/CD via GitHub Actions; artifacts (wheel/tar) for remote cluster installs
Apr 2021 - Oct 2023

Data Engineer

Boston Consulting Group (BCGX), India

  • Analyzed email engagement (CTR/CTOR/OR/CR), RFM and omni-channel funnels
  • Stood up cloud agnostic infra with Terraform + K8s (EKS/AKS/GKE), Airflow on K8s via Helm
  • Built CI/CD with Actions incl. lint, tests, artifact deploys, security scans
Aug 2019 - Apr 2021

Data Engineer

To The New Digital (TTND), India

  • Landing -> Staging -> Warehouse pipelines with robust cleaning and metadata mgmt
  • Global customer records with SCD1/2, star and snowflake schema designs, data mart creation
  • Airflow & NiFi orchestration with monitoring and recovery
Feb 2019 - Jul 2019

Big Data Trainee

To The New Digital (TTND), India

  • Migrated partitioned/bucketed/external tables using regex-driven automation
  • Integrated business APIs to populate structured warehouse layers

EDUCATION

2015 - 2019

Bachelor of Technology

Computer Science & Engineering

APJ Abdul Kalam Technical University, India

Core focus on software engineering, data structures, analytics, and systems design.

2011 - 2015

Schooling [10+2]

Puranchandra Vidyaniketan, Kanpur

Science coursework emphasizing mathematics, physics, and problem-solving.

COMPANIES & CLIENTS

RAKEZ logo RAKEZ
Boston Consulting Group logo Boston Consulting Group
To The New logo To The New Digital
Meta logo Meta
AWS logo AWS
Airtel logo Airtel
Alteryx logo Alteryx
The Clorox Company logo The Clorox Company
Salesforce logo Salesforce
SAP logo SAP
Nordstrom logo Nordstrom
Diageo logo Diageo
Bridgestone logo Bridgestone

CERTIFICATIONS

Databricks logo

Databricks Certified GenAI Associate

Databricks

Databricks logo

Databricks Certified Machine Learning Associate

Databricks

Cloudera logo

Cloudera Certified Spark & Hadoop Developer

Cloudera

Databricks logo

Databricks Certified Associate Spark 3.0 Developer

Databricks

Databricks logo

Databricks Certified Associate Data Analyst

Databricks

Databricks logo

Databricks Certified Associate Data Engineer

Databricks

Databricks logo

Databricks Certified Professional Data Engineer

Databricks

Databricks logo

Databricks Lakehouse Fundamentals

Databricks

Databricks logo

Databricks GenAI Fundamentals

Databricks

Hackerrank logo

Hackerrank Gold Badge - Python

Hackerrank

Hackerrank logo

Hackerrank Gold Badge - SQL

Hackerrank

PROJECTS

Customer 360 Analytics (Retail)

End-to-end Customer 360 model (5TB+) covering email engagement (CTR/CTOR/OR/CR), RFM, and omni-channel analysis. Built with scalable ETL and robust governance.

Cloud-Agnostic Spark Job Framework

Python library to submit/monitor Spark jobs remotely on AWS/Azure/GCP/Databricks, managing full cluster lifecycle, CI/CD, and artifacts.

GenAI Data Engineering Pipelines

GenAI-assisted pipelines for Customer 360 insights, PII detection/masking, automated report generation, and feedback analysis.

Real Estate Monthly Analysis Generator

Scrapes, processes, and visualizes US real estate data monthly, built with BeautifulSoup, PySpark, and reporting automation.

Organic Farming Intelligence App

Streamlit app that aggregates and analyzes agri data using PySpark and GPT-4o to produce farmer-ready insights.

AI-driven WhatsApp Portfolio Bot

Conversational portfolio experience using OpenAI APIs with WhatsApp integration for interactive Q&A.

Microservice Transactional Website

Flask API + React frontend + MySQL, deployed on AWS EKS with ALB, CloudWatch, IAM, VPC networking, and Kubernetes best practices.

DAMA & Metadata Framework

Enterprise pivot to governed, reusable data pipelines: ingestion, data quality, standardized logging, and observability patterns for scale.

COMMUNITY IMPACT

Mentoring & Community

Helped 500+ students from tech/non-tech backgrounds in DevOps, Data Engineering, and GenAI across LinkedIn, GitHub, Fiverr and more.

Mock Interviews

Conducted 200+ mock interviews for candidates targeting Data Engineering/DevOps/AI roles at BCG, McKinsey, Bain, Walmart, EY, Deloitte, KPMG, TCS, Infosys, Wipro and more.

Freelance Gigs

Created 50+ Fiverr gigs, delivered for 350+ global clients with 5-star reviews, spanning AWS, Azure, GCP, Databricks, Snowflake, Docker, K8s, Terraform, Python/PySpark/SQL.

Feel free to connect!

Enjoy your wonderful day!