Projects

3 projects

Yelp Batch ETL Pipeline — Business Analytics at Scale

· Data EngineeringApache SparkScalaAirflowMongoDBDelta LakePostgreSQLDockerETL PipelineBig Data

A batch ETL pipeline that processes 9.3 GB of Yelp business data to generate analytics and insights about business performance, customer reviews, and popularity trends.

View project →

NBA ML Pipeline — Predict Player Performance

· NBASports AnalyticsMachine LearningData EngineeringPythonGoogle CloudDocker

An end-to-end, reproducible machine learning pipeline to forecast NBA player points from historical data. The project combines data engineering 🏗️, feature engineering 🧩, and ML...

View project →