NYC Smart Traffic Pipeline
This project implements a scalable data engineering platform that processes both historical and real-time traffic-related data to analyze road incidents, weather impact, and urban mobility patterns.
Building scalable data pipelines and turning raw data into clear business value. Passionate about ETL architecture, big data systems, cloud-based workflows, and analytics-ready platforms.
I am Ahmed Refat, an aspiring Data Engineer with a strong focus on building scalable and efficient data systems. I have hands-on experience designing data workflows using Python, SQL, PySpark, Airflow, and modern data tools. Through my projects, I work on ETL pipelines, streaming workflows, and analytics-ready data models that transform raw data into clear business insight.
Kafr El-Sheikh University, Faculty of Computers and Information – Information Systems Department
Graduation: May 2023
This project implements a scalable data engineering platform that processes both historical and real-time traffic-related data to analyze road incidents, weather impact, and urban mobility patterns.
This project designs and implements a scalable data platform for flight and airline analytics using both batch and real-time processing across major New York airports, integrating flight, delay, airline, and weather data.
This pipeline is an end-to-end system that extracts data from MongoDB and CSV files, processes it using Airflow and PySpark, and stores it in PostgreSQL using a star schema. It runs in a Dockerized environment for scalability and easy deployment.