Projects

A collection of projects I've worked on, showcasing my skills and experience.

Real Estate Analytics App

Real Estate Analytics App
Featured

A full data science pipeline for property price prediction in Gurgaon.

Flask
Scikit-Learn
Pandas
NumPy
Tourism AI

Tourism AI
Featured

AI-powered tourism analytics platform delivering real-time insights from dashboards and OCR-driven intelligence.

Flask
Power BI
AI
OCR
Python
Global Tourism & Holiday Analytics Dashboard

Global Tourism & Holiday Analytics Dashboard
Featured

Interactive global tourism dashboard visualizing trends across 232 countries with advanced data modeling.

Power BI
Data Modeling
Data Visualization
Analytics
Sales Data Forecasting

Sales Data Forecasting

Analyzed historical sales data to uncover patterns and support data-driven business decisions.

Python
Pandas
Matplotlib
Seaborn
Movie Recommendation System

Movie Recommendation System

A content-based filtering system that recommends movies using NLP and cosine similarity.

Python
NLP
Scikit-learn
Cosine Similarity
IPL Dream11 ETL Pipeline

IPL Dream11 ETL Pipeline

A production-style ETL pipeline that processes IPL ball-by-ball data from AWS RDS, applies Dream11 fantasy point rules using Python and Pandas, and loads structured results back into a target database.

Python
Pandas
AWS RDS (MySQL)
SQLAlchemy
Weather Data Integration Pipeline

Weather Data Integration Pipeline

An end-to-end ETL pipeline that extracts real-time weather data from the OpenWeatherMap API, transforms it using Apache Spark, and loads it into PostgreSQL.

Python
Apache Spark
PostgreSQL
Apache Airflow
PySpark
Flask
Customer Review Analysis - British Airways

Customer Review Analysis - British Airways

Conducted text analysis on 3,400+ customer reviews to extract insights on sentiment and service quality.

Python
Web Scraping
Pandas
NLP
AQA: Air Quality Analysis

AQA: Air Quality Analysis

Analyzed real-time air quality data to study pollutant patterns across Indian states using preprocessing, visualization, and statistical insights.

Python
Pandas
Matplotlib
Seaborn