This project utilizes a dataset from Kaggle, comprising detailed flight ticket data with over 6 million records captured from Expedia. The dataset represents a wide array of flights to and from key airports like ATL, DFW, DEN, ORD, LAX, CLT, MIA, JFK, EWR, SFO, DTW, BOS, PHL, LGA, and IAD, recorded between April 2022 and November 2022. It includes comprehensive flight information such as the flight and search dates, starting and destination airports, fare details, travel duration, and seat availability, among others.
In this analysis, the focus narrows to examining flights arriving at John F. Kennedy International Airport (JFK). JFK is selected for its prominence as a major international and domestic hub, offering a rich and diverse dataset to analyze. By concentrating on JFK, the aim is to gain deeper insights into the flight patterns and pricing dynamics specific to this significant airport.
The aim of this project was to analyze and understand the complexities of flight pricing and demand at one of the United States' busiest transport hubs, JFK Airport. Specifically, the objectives were:
These objectives were established to provide actionable insights into airline pricing strategies, consumer demand patterns, and the overall economic landscape of airline travel to JFK Airport.
Before diving into the analysis, the data was preprocessed with Python Dask and SQL, and was reduced from over 6 million records to 637,000 entries. This stage was critical to ensure the accuracy and usability of the data, tailored specifically to the needs of this project. The process involved cleaning, transforming, and enriching the dataset to facilitate a detailed and meaningful analysis. Exploratory data analysis (EDA) was employed to understand the data's structure, identify patterns, and assess the distribution of key variables.
With the data refined and structured, Tableau was employed to visualize the data. This phase was about translating complex datasets into intuitive graphics, enabling the identification of clear trends, patterns, and outliers. The visualizations aimed to provide a compelling narrative of the flight market trends and behaviors at JFK Airport.
The analysis offers a comprehensive look at the airline market's behavior towards JFK flights, exploring regional demand, fare variation by weekday, carrier dominance, and class-based pricing strategies. It reveals how airlines adjust to market shifts and passenger preferences. This overview is not just a snapshot of current operations; it's a predictive guide for future industry movements, equipping stakeholders with the insights for informed decision-making.