On the second day, the task was to create a visualization using the U.S. International Air Passenger Dataset. This dataset contains daily flight departure statistics (from 1990 to 2022) from the U.S. to international airports, along with information on the airlines, chartered/scheduled passengers, and freight numbers. It should be noted that not every flight will have passengers or freight, and some flights will have both.

My main interest was to analyze the trend of flights over time, and whether they are increasing or decreasing. Therefore, I started by analyzing the scheduled passenger data. I created a line chart to observe changes over time, and then a bar chart to identify the top 5 flights by passengers and utilization rate. These flights are becoming increasingly popular, so investing in them would be beneficial. Conversely, we also examined the least popular flights by passengers, some of which have had only one passenger over the past 20 years. It may be feasible to discontinue some of these flights, as they are unlikely to generate substantial profits.

Figure1:Scheduled Passengers

Next, we looked at chartered passengers to determine which flights are more popular among the wealthy. Similarly, we examined the trend over time, as well as the top and bottom flights.

The Data School
Author: The Data School