5. Data Frames#
Data frames are the premier data structure for working with tabular data. In Julia, the DataFrames.jl package provides data frames and functions to work with them. Besides reading this chapter, you can learn more about how to use the package from the official documentation and cheat sheet.
This chapter uses the Airline On-Time Performance Data Set introduced in Section 4.
5.1. Inspecting#
When you load a data set, it’s a good idea to inspect it to make sure it was loaded correctly and contains the data you expect. Julia and DataFrames.jl provide several functions that are helpful for inspecting data frames:
describe
to get a summaryfirst
,last
to get the first or last n rowsnrow
,ncol
,size
,ndims
to get dimension informationnames
to get column namestypeof
,eltype
to get types
Let’s take a look at the first 5 rows of the air
data to refresh our memory:
first(air, 5)
Row | Year | Quarter | Month | DayofMonth | DayOfWeek | FlightDate | Reporting_Airline | DOT_ID_Reporting_Airline | IATA_CODE_Reporting_Airline | Tail_Number | Flight_Number_Reporting_Airline | OriginAirportID | OriginAirportSeqID | OriginCityMarketID | Origin | OriginCityName | OriginState | OriginStateFips | OriginStateName | OriginWac | DestAirportID | DestAirportSeqID | DestCityMarketID | Dest | DestCityName | DestState | DestStateFips | DestStateName | DestWac | CRSDepTime | DepTime | DepDelay | DepDelayMinutes | DepDel15 | DepartureDelayGroups | DepTimeBlk | TaxiOut | WheelsOff | WheelsOn | TaxiIn | CRSArrTime | ArrTime | ArrDelay | ArrDelayMinutes | ArrDel15 | ArrivalDelayGroups | ArrTimeBlk | Cancelled | CancellationCode | Diverted | CRSElapsedTime | ActualElapsedTime | AirTime | Flights | Distance | DistanceGroup | CarrierDelay | WeatherDelay | NASDelay | SecurityDelay | LateAircraftDelay | FirstDepTime | TotalAddGTime | LongestAddGTime | DivAirportLandings | DivReachedDest | DivActualElapsedTime | DivArrDelay | DivDistance | Div1Airport | Div1AirportID | Div1AirportSeqID | Div1WheelsOn | Div1TotalGTime | Div1LongestGTime | Div1WheelsOff | Div1TailNum | Div2Airport | Div2AirportID | Div2AirportSeqID | Div2WheelsOn | Div2TotalGTime | Div2LongestGTime | Div2WheelsOff | Div2TailNum | Div3Airport | Div3AirportID | Div3AirportSeqID | Div3WheelsOn | Div3TotalGTime | Div3LongestGTime | Div3WheelsOff | Div3TailNum | Div4Airport | Div4AirportID | Div4AirportSeqID | Div4WheelsOn | Div4TotalGTime | Div4LongestGTime | Div4WheelsOff | ⋯ |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Int64 | Int64 | Int64 | Int64 | Int64 | Date | String3 | Int64 | String3 | String7 | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64? | Int64? | Int64? | Float64? | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64 | String3? | Float64 | Float64? | Float64? | Float64? | Float64 | Float64 | Int64 | Float64? | Float64? | Float64? | Float64? | Float64? | Int64? | Float64? | Float64? | Int64 | Float64? | Float64? | Float64? | Float64? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | ⋯ | |
1 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 757 | -3.0 | 0.0 | 0.0 | -1 | 0800-0859 | 11.0 | 808 | 833 | 20.0 | 905 | 853 | -12.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 56.0 | 25.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
2 | 2023 | 1 | 1 | 3 | 2 | 2023-01-03 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 19.0 | 814 | 851 | 6.0 | 905 | 857 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 37.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
3 | 2023 | 1 | 1 | 4 | 3 | 2023-01-04 | 9E | 20363 | 9E | N331PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 14.0 | 809 | 837 | 7.0 | 905 | 844 | -21.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 49.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
4 | 2023 | 1 | 1 | 5 | 4 | 2023-01-05 | 9E | 20363 | 9E | N906XJ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 754 | -6.0 | 0.0 | 0.0 | -1 | 0800-0859 | 13.0 | 807 | 845 | 3.0 | 905 | 848 | -17.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 54.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
5 | 2023 | 1 | 1 | 6 | 5 | 2023-01-06 | 9E | 20363 | 9E | N337PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 759 | -1.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 816 | 844 | 5.0 | 905 | 849 | -16.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 50.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
The size
function returns the number of rows and columns in a data frame (you
can use nrow
and ncol
to get these individually):
size(air)
(538837, 110)
The names
function returns the names of a data frame’s columns:
names(air)
110-element Vector{String}:
"Year"
"Quarter"
"Month"
"DayofMonth"
"DayOfWeek"
"FlightDate"
"Reporting_Airline"
"DOT_ID_Reporting_Airline"
"IATA_CODE_Reporting_Airline"
"Tail_Number"
"Flight_Number_Reporting_Airline"
"OriginAirportID"
"OriginAirportSeqID"
⋮
"Div4LongestGTime"
"Div4WheelsOff"
"Div4TailNum"
"Div5Airport"
"Div5AirportID"
"Div5AirportSeqID"
"Div5WheelsOn"
"Div5TotalGTime"
"Div5LongestGTime"
"Div5WheelsOff"
"Div5TailNum"
"Column110"
Tip
By default, Julia tries to make printed output fit on the screen. This is
unhelpful when you want to see all of a particular data structure. You can use
the print
function to make Julia show everything. For instance, try this
code:
print(names(air))
One way to characterize a data frame is by the types of elements in its
columns. In a Julia data frame, columns are generally Vectors, and the eltype
function gets the element type(s) of a Vector. To get the element types for all
columns, use the eachcol
function to get an iterator over the columns, and
then broadcast eltype
over the iterator:
eltype.(eachcol(air))
110-element Vector{Type}:
Int64
Int64
Int64
Int64
Int64
Dates.Date
String3
Int64
String3
String7
Int64
Int64
Int64
⋮
Missing
Missing
Missing
Missing
Missing
Missing
Missing
Missing
Missing
Missing
Missing
Missing
We can make this result easier to read by putting it in a data frame with the
column names (and possibly other summary information). The constructor function
DataFrame
makes a new data frame:
air_types = DataFrame(name = names(air), type = eltype.(eachcol(air)))
air_types
Row | name | type |
---|---|---|
String | Type | |
1 | Year | Int64 |
2 | Quarter | Int64 |
3 | Month | Int64 |
4 | DayofMonth | Int64 |
5 | DayOfWeek | Int64 |
6 | FlightDate | Date |
7 | Reporting_Airline | String3 |
8 | DOT_ID_Reporting_Airline | Int64 |
9 | IATA_CODE_Reporting_Airline | String3 |
10 | Tail_Number | String7 |
11 | Flight_Number_Reporting_Airline | Int64 |
12 | OriginAirportID | Int64 |
13 | OriginAirportSeqID | Int64 |
⋮ | ⋮ | ⋮ |
99 | Div4LongestGTime | Missing |
100 | Div4WheelsOff | Missing |
101 | Div4TailNum | Missing |
102 | Div5Airport | Missing |
103 | Div5AirportID | Missing |
104 | Div5AirportSeqID | Missing |
105 | Div5WheelsOn | Missing |
106 | Div5TotalGTime | Missing |
107 | Div5LongestGTime | Missing |
108 | Div5WheelsOff | Missing |
109 | Div5TailNum | Missing |
110 | Column110 | Missing |
5.2. Indexing#
Data frames use square brackets [ ]
for indexing (like most other data
structures in Julia). Since data frames are two-dimensional, two indexes are
required. The following subsections describe different kinds of indexes you can
use, as well as some other ways to get data out of a data frame.
5.2.1. By Position#
You can use integer arguments to select elements by position. For example, to
extract the value in row 2, column 1 of the air
data frame:
air[2, 1]
2023
The first argument is the row index, while the second is the column index.
As with other data structures, you can also use an array of indexes to select multiple values. For instance, to get rows 1, 3, and 1 again from column 5:
air[[1, 3, 1], 5]
You can also use a slice to select a range of values. For instance, to select the values in the first 3 rows, column 5:
air[1:3, 5]
3-element Vector{Int64}:
1
2
3
Tip
You can use the end
keyword in a slice to mean the last element. The end
keyword can be combined with arithmetic operators. For example, to get the last
2 rows, column 5:
air[end-1:end, 5]
In DataFrames.jl, there are two different ways to indicate that you want all of
the elements along a dimension. A :
selects all elements and returns a
copy, while a !
selects all elements and returns a view (or reference).
It’s generally safer to use a copy (especially if you’re going to modify the
data), but more CPU- and memory-efficient to use a view. Here are examples of
both:
year = air[!, 1]
year_copy = air[:, 1]
538837-element Vector{Int64}:
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
⋮
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
2023
Caution
Returning a view with !
is only possible if the resulting data are contiguous
in the original data frame.
Tip
More generally, you can use the @view
macro to get a view based on indexing
even when you don’t want all elements along an axis. For example:
@view air[1, 1]
You can combine indexing with assignment (=
) to reassign specific elements of
a data frame. Note that if you reassign elements of a view, the elements will
also change in the original data frame.
5.2.2. By Name#
You can use String arguments to select elements by name. For instance, to
select row 1 of the Year
column:
air[1, "Year"]
2023
You can also use Symbol arguments to select elements by name. In Julia, you can
write a literal Symbol by putting a colon :
in front of text. For instance,
to select row 1 of the Year
column:
air[1, :Year]
2023
Indexing with Symbols is faster than indexing with Strings, so use Symbols when possible.
As with positional indexes, you can use arrays of indexes to select multiple elements. For example:
air[1:3, [:Year, :Month, :DayofMonth]]
Row | Year | Month | DayofMonth |
---|---|---|---|
Int64 | Int64 | Int64 | |
1 | 2023 | 1 | 2 |
2 | 2023 | 1 | 3 |
3 | 2023 | 1 | 4 |
Selection by name is primarily used for columns, since rows usually don’t have
names. If you want to select an entire column, there are two more ways to do
it besides [ ]
: attribute access (.NAME
) and the select
function. As an
example, here are three ways to select the entire DayofMonth
column:
air[:, "DayofMonth"]
air.DayofMonth
select(air, :DayofMonth)
Row | DayofMonth |
---|---|
Int64 | |
1 | 2 |
2 | 3 |
3 | 4 |
4 | 5 |
5 | 6 |
6 | 7 |
7 | 14 |
8 | 21 |
9 | 28 |
10 | 9 |
11 | 10 |
12 | 11 |
13 | 12 |
⋮ | ⋮ |
538826 | 2 |
538827 | 2 |
538828 | 2 |
538829 | 2 |
538830 | 2 |
538831 | 2 |
538832 | 2 |
538833 | 2 |
538834 | 2 |
538835 | 2 |
538836 | 2 |
538837 | 2 |
Note that the first two return arrays, while select
returns a data frame.
5.2.3. By Condition#
The indexing operator [ ]
also accepts arrays of Boolean values, to
facilitate getting elements based on a condition. For example, suppose you want
to get all rows where DayofMonth
is less than 15
. You can test for these
rows with this condition:
air.DayofMonth .< 15
538837-element BitVector:
1
1
1
1
1
1
1
0
0
1
1
1
1
⋮
1
1
1
1
1
1
1
1
1
1
1
1
And the code to actually get the rows is:
air[air.DayofMonth .< 15, :]
Row | Year | Quarter | Month | DayofMonth | DayOfWeek | FlightDate | Reporting_Airline | DOT_ID_Reporting_Airline | IATA_CODE_Reporting_Airline | Tail_Number | Flight_Number_Reporting_Airline | OriginAirportID | OriginAirportSeqID | OriginCityMarketID | Origin | OriginCityName | OriginState | OriginStateFips | OriginStateName | OriginWac | DestAirportID | DestAirportSeqID | DestCityMarketID | Dest | DestCityName | DestState | DestStateFips | DestStateName | DestWac | CRSDepTime | DepTime | DepDelay | DepDelayMinutes | DepDel15 | DepartureDelayGroups | DepTimeBlk | TaxiOut | WheelsOff | WheelsOn | TaxiIn | CRSArrTime | ArrTime | ArrDelay | ArrDelayMinutes | ArrDel15 | ArrivalDelayGroups | ArrTimeBlk | Cancelled | CancellationCode | Diverted | CRSElapsedTime | ActualElapsedTime | AirTime | Flights | Distance | DistanceGroup | CarrierDelay | WeatherDelay | NASDelay | SecurityDelay | LateAircraftDelay | FirstDepTime | TotalAddGTime | LongestAddGTime | DivAirportLandings | DivReachedDest | DivActualElapsedTime | DivArrDelay | DivDistance | Div1Airport | Div1AirportID | Div1AirportSeqID | Div1WheelsOn | Div1TotalGTime | Div1LongestGTime | Div1WheelsOff | Div1TailNum | Div2Airport | Div2AirportID | Div2AirportSeqID | Div2WheelsOn | Div2TotalGTime | Div2LongestGTime | Div2WheelsOff | Div2TailNum | Div3Airport | Div3AirportID | Div3AirportSeqID | Div3WheelsOn | Div3TotalGTime | Div3LongestGTime | Div3WheelsOff | Div3TailNum | Div4Airport | Div4AirportID | Div4AirportSeqID | Div4WheelsOn | Div4TotalGTime | Div4LongestGTime | Div4WheelsOff | ⋯ |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Int64 | Int64 | Int64 | Int64 | Int64 | Date | String3 | Int64 | String3 | String7 | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64? | Int64? | Int64? | Float64? | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64 | String3? | Float64 | Float64? | Float64? | Float64? | Float64 | Float64 | Int64 | Float64? | Float64? | Float64? | Float64? | Float64? | Int64? | Float64? | Float64? | Int64 | Float64? | Float64? | Float64? | Float64? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | ⋯ | |
1 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 757 | -3.0 | 0.0 | 0.0 | -1 | 0800-0859 | 11.0 | 808 | 833 | 20.0 | 905 | 853 | -12.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 56.0 | 25.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
2 | 2023 | 1 | 1 | 3 | 2 | 2023-01-03 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 19.0 | 814 | 851 | 6.0 | 905 | 857 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 37.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
3 | 2023 | 1 | 1 | 4 | 3 | 2023-01-04 | 9E | 20363 | 9E | N331PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 14.0 | 809 | 837 | 7.0 | 905 | 844 | -21.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 49.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
4 | 2023 | 1 | 1 | 5 | 4 | 2023-01-05 | 9E | 20363 | 9E | N906XJ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 754 | -6.0 | 0.0 | 0.0 | -1 | 0800-0859 | 13.0 | 807 | 845 | 3.0 | 905 | 848 | -17.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 54.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
5 | 2023 | 1 | 1 | 6 | 5 | 2023-01-06 | 9E | 20363 | 9E | N337PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 759 | -1.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 816 | 844 | 5.0 | 905 | 849 | -16.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 50.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
6 | 2023 | 1 | 1 | 7 | 6 | 2023-01-07 | 9E | 20363 | 9E | N336PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 750 | -10.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 807 | 845 | 7.0 | 905 | 852 | -13.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
7 | 2023 | 1 | 1 | 14 | 6 | 2023-01-14 | 9E | 20363 | 9E | N311PQ | 4628 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 11193 | 1119302 | 33105 | CVG | Cincinnati, OH | KY | 21 | Kentucky | 52 | 1500 | 1452 | -8.0 | 0.0 | 0.0 | -1 | 1500-1559 | 26.0 | 1518 | 1643 | 6.0 | 1720 | 1649 | -31.0 | 0.0 | 0.0 | -2 | 1700-1759 | 0.0 | missing | 0.0 | 140.0 | 117.0 | 85.0 | 1.0 | 585.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
8 | 2023 | 1 | 1 | 9 | 1 | 2023-01-09 | 9E | 20363 | 9E | N491PX | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2122 | -7.0 | 0.0 | 0.0 | -1 | 2100-2159 | 25.0 | 2147 | 2220 | 5.0 | 2228 | 2225 | -3.0 | 0.0 | 0.0 | -1 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 63.0 | 33.0 | 1.0 | 147.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
9 | 2023 | 1 | 1 | 10 | 2 | 2023-01-10 | 9E | 20363 | 9E | N478PX | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2114 | -15.0 | 0.0 | 0.0 | -1 | 2100-2159 | 45.0 | 2159 | 2230 | 4.0 | 2228 | 2234 | 6.0 | 6.0 | 0.0 | 0 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 80.0 | 31.0 | 1.0 | 147.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
10 | 2023 | 1 | 1 | 11 | 3 | 2023-01-11 | 9E | 20363 | 9E | N135EV | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2204 | 35.0 | 35.0 | 1.0 | 2 | 2100-2159 | 46.0 | 2250 | 2326 | 3.0 | 2228 | 2329 | 61.0 | 61.0 | 1.0 | 4 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 85.0 | 36.0 | 1.0 | 147.0 | 1 | 0.0 | 0.0 | 26.0 | 0.0 | 35.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
11 | 2023 | 1 | 1 | 12 | 4 | 2023-01-12 | 9E | 20363 | 9E | N197PQ | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2341 | 132.0 | 132.0 | 1.0 | 8 | 2100-2159 | 16.0 | 2357 | 35 | 5.0 | 2228 | 40 | 132.0 | 132.0 | 1.0 | 8 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 59.0 | 38.0 | 1.0 | 147.0 | 1 | 50.0 | 0.0 | 0.0 | 0.0 | 82.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
12 | 2023 | 1 | 1 | 13 | 5 | 2023-01-13 | 9E | 20363 | 9E | N915XJ | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2124 | -5.0 | 0.0 | 0.0 | -1 | 2100-2159 | 61.0 | 2225 | 2259 | 3.0 | 2228 | 2302 | 34.0 | 34.0 | 1.0 | 2 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 98.0 | 34.0 | 1.0 | 147.0 | 1 | 0.0 | 0.0 | 34.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
13 | 2023 | 1 | 1 | 1 | 7 | 2023-01-01 | 9E | 20363 | 9E | N906XJ | 4630 | 11337 | 1133705 | 31337 | DLH | Duluth, MN | MN | 27 | Minnesota | 63 | 13487 | 1348702 | 31650 | MSP | Minneapolis, MN | MN | 27 | Minnesota | 63 | 510 | 516 | 6.0 | 6.0 | 0.0 | 0 | 0001-0559 | 55.0 | 611 | 644 | 6.0 | 626 | 650 | 24.0 | 24.0 | 1.0 | 1 | 0600-0659 | 0.0 | missing | 0.0 | 76.0 | 94.0 | 33.0 | 1.0 | 144.0 | 1 | 6.0 | 0.0 | 18.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋱ |
240250 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N13138 | 1105 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 13204 | 1320402 | 31454 | MCO | Orlando, FL | FL | 12 | Florida | 33 | 1500 | 1809 | 189.0 | 189.0 | 1.0 | 12 | 1500-1559 | 20.0 | 1829 | 2040 | 14.0 | 1800 | 2054 | 174.0 | 174.0 | 1.0 | 11 | 1800-1859 | 0.0 | 0.0 | 180.0 | 165.0 | 131.0 | 1.0 | 937.0 | 4 | 0.0 | 0.0 | 78.0 | 0.0 | 96.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240251 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N488UA | 1104 | 11298 | 1129806 | 30194 | DFW | Dallas/Fort Worth, TX | TX | 48 | Texas | 74 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 1337 | 1400 | 23.0 | 23.0 | 1.0 | 1 | 1300-1359 | 15.0 | 1415 | 1440 | 6.0 | 1446 | 1446 | 0.0 | 0.0 | 0.0 | 0 | 1400-1459 | 0.0 | 0.0 | 129.0 | 106.0 | 85.0 | 1.0 | 641.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240252 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N35260 | 1103 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 10721 | 1072102 | 30721 | BOS | Boston, MA | MA | 25 | Massachusetts | 13 | 950 | 957 | 7.0 | 7.0 | 0.0 | 0 | 0900-0959 | 27.0 | 1024 | 1533 | 4.0 | 1541 | 1537 | -4.0 | 0.0 | 0.0 | -1 | 1500-1559 | 0.0 | 0.0 | 231.0 | 220.0 | 189.0 | 1.0 | 1754.0 | 8 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240253 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N813UA | 1102 | 11066 | 1106606 | 31066 | CMH | Columbus, OH | OH | 39 | Ohio | 44 | 14771 | 1477104 | 32457 | SFO | San Francisco, CA | CA | 6 | California | 91 | 708 | 706 | -2.0 | 0.0 | 0.0 | -1 | 0700-0759 | 19.0 | 725 | 921 | 4.0 | 933 | 925 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | 0.0 | 325.0 | 319.0 | 296.0 | 1.0 | 2120.0 | 9 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240254 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N871UA | 1101 | 13502 | 1350202 | 33502 | MTJ | Montrose/Delta, CO | CO | 8 | Colorado | 82 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 1525 | 1613 | 48.0 | 48.0 | 1.0 | 3 | 1500-1559 | 13.0 | 1626 | 2145 | 25.0 | 2126 | 2210 | 44.0 | 44.0 | 1.0 | 2 | 2100-2159 | 0.0 | 0.0 | 241.0 | 237.0 | 199.0 | 1.0 | 1795.0 | 8 | 44.0 | 0.0 | 0.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240255 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N839UA | 1100 | 14869 | 1486903 | 34614 | SLC | Salt Lake City, UT | UT | 49 | Utah | 87 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 1442 | 1824 | 222.0 | 222.0 | 1.0 | 12 | 1400-1459 | 36.0 | 1900 | 2009 | 17.0 | 1611 | 2026 | 255.0 | 255.0 | 1.0 | 12 | 1600-1659 | 0.0 | 0.0 | 89.0 | 122.0 | 69.0 | 1.0 | 391.0 | 2 | 65.0 | 0.0 | 33.0 | 0.0 | 157.0 | 1807 | 11.0 | 11.0 | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240256 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N884UA | 1097 | 11697 | 1169706 | 32467 | FLL | Fort Lauderdale, FL | FL | 12 | Florida | 33 | 13930 | 1393008 | 30977 | ORD | Chicago, IL | IL | 17 | Illinois | 41 | 702 | 657 | -5.0 | 0.0 | 0.0 | -1 | 0700-0759 | 21.0 | 718 | 858 | 11.0 | 923 | 909 | -14.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | 0.0 | 201.0 | 192.0 | 160.0 | 1.0 | 1182.0 | 5 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240257 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N877UA | 1095 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 13342 | 1334207 | 33342 | MKE | Milwaukee, WI | WI | 55 | Wisconsin | 45 | 825 | 836 | 11.0 | 11.0 | 0.0 | 0 | 0800-0859 | 32.0 | 908 | 958 | 9.0 | 1001 | 1007 | 6.0 | 6.0 | 0.0 | 0 | 1000-1059 | 0.0 | 0.0 | 156.0 | 151.0 | 110.0 | 1.0 | 725.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240258 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N73270 | 1093 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 12892 | 1289208 | 32575 | LAX | Los Angeles, CA | CA | 6 | California | 91 | 745 | 752 | 7.0 | 7.0 | 0.0 | 0 | 0700-0759 | 49.0 | 841 | 932 | 7.0 | 923 | 939 | 16.0 | 16.0 | 1.0 | 1 | 0900-0959 | 0.0 | 0.0 | 158.0 | 167.0 | 111.0 | 1.0 | 862.0 | 4 | 7.0 | 0.0 | 9.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240259 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N433UA | 1092 | 13930 | 1393008 | 30977 | ORD | Chicago, IL | IL | 17 | Illinois | 41 | 14635 | 1463502 | 31714 | RSW | Fort Myers, FL | FL | 12 | Florida | 33 | 1400 | 1504 | 64.0 | 64.0 | 1.0 | 4 | 1400-1459 | 18.0 | 1522 | 1852 | 4.0 | 1802 | 1856 | 54.0 | 54.0 | 1.0 | 3 | 1800-1859 | 0.0 | 0.0 | 182.0 | 172.0 | 150.0 | 1.0 | 1120.0 | 5 | 0.0 | 0.0 | 54.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240260 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | 1092 | 14635 | 1463502 | 31714 | RSW | Fort Myers, FL | FL | 12 | Florida | 33 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 1905 | missing | missing | missing | missing | missing | 1900-1959 | missing | missing | missing | missing | 2159 | missing | missing | missing | missing | missing | 2100-2159 | 1.0 | A | 0.0 | 174.0 | missing | missing | 1.0 | 1068.0 | 5 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240261 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N68823 | 1086 | 12266 | 1226603 | 31453 | IAH | Houston, TX | TX | 48 | Texas | 74 | 14771 | 1477104 | 32457 | SFO | San Francisco, CA | CA | 6 | California | 91 | 1814 | 2044 | 150.0 | 150.0 | 1.0 | 10 | 1800-1859 | 21.0 | 2105 | 2306 | 7.0 | 2035 | 2313 | 158.0 | 158.0 | 1.0 | 10 | 2000-2059 | 0.0 | 0.0 | 261.0 | 269.0 | 241.0 | 1.0 | 1635.0 | 7 | 0.0 | 0.0 | 158.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
DataFrames.jl also provides a subset
function as an alternative (often more
efficient) way to get subsets. The first argument to the subset
function is
the data set, while the second argument is a Pair
that describes a condition.
In Julia, a Pair
is a helper data structure that pairs two pieces of
information, and can be created with the =>
operator. In this case, the Pair
should pair column name(s) with a test function to apply to the column(s). For
example, the anonymous function x -> x .< 15
tests whether the elements of an
array are less than 15, so you can get all rows where DayofMonth
is less than
15
with this code:
subset(air, :DayofMonth => x -> x .< 15)
# Or: subset(air, :DayofMonth => ByRow(x -> x < 15))
Row | Year | Quarter | Month | DayofMonth | DayOfWeek | FlightDate | Reporting_Airline | DOT_ID_Reporting_Airline | IATA_CODE_Reporting_Airline | Tail_Number | Flight_Number_Reporting_Airline | OriginAirportID | OriginAirportSeqID | OriginCityMarketID | Origin | OriginCityName | OriginState | OriginStateFips | OriginStateName | OriginWac | DestAirportID | DestAirportSeqID | DestCityMarketID | Dest | DestCityName | DestState | DestStateFips | DestStateName | DestWac | CRSDepTime | DepTime | DepDelay | DepDelayMinutes | DepDel15 | DepartureDelayGroups | DepTimeBlk | TaxiOut | WheelsOff | WheelsOn | TaxiIn | CRSArrTime | ArrTime | ArrDelay | ArrDelayMinutes | ArrDel15 | ArrivalDelayGroups | ArrTimeBlk | Cancelled | CancellationCode | Diverted | CRSElapsedTime | ActualElapsedTime | AirTime | Flights | Distance | DistanceGroup | CarrierDelay | WeatherDelay | NASDelay | SecurityDelay | LateAircraftDelay | FirstDepTime | TotalAddGTime | LongestAddGTime | DivAirportLandings | DivReachedDest | DivActualElapsedTime | DivArrDelay | DivDistance | Div1Airport | Div1AirportID | Div1AirportSeqID | Div1WheelsOn | Div1TotalGTime | Div1LongestGTime | Div1WheelsOff | Div1TailNum | Div2Airport | Div2AirportID | Div2AirportSeqID | Div2WheelsOn | Div2TotalGTime | Div2LongestGTime | Div2WheelsOff | Div2TailNum | Div3Airport | Div3AirportID | Div3AirportSeqID | Div3WheelsOn | Div3TotalGTime | Div3LongestGTime | Div3WheelsOff | Div3TailNum | Div4Airport | Div4AirportID | Div4AirportSeqID | Div4WheelsOn | Div4TotalGTime | Div4LongestGTime | Div4WheelsOff | ⋯ |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Int64 | Int64 | Int64 | Int64 | Int64 | Date | String3 | Int64 | String3 | String7 | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64? | Int64? | Int64? | Float64? | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64 | String3? | Float64 | Float64? | Float64? | Float64? | Float64 | Float64 | Int64 | Float64? | Float64? | Float64? | Float64? | Float64? | Int64? | Float64? | Float64? | Int64 | Float64? | Float64? | Float64? | Float64? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | ⋯ | |
1 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 757 | -3.0 | 0.0 | 0.0 | -1 | 0800-0859 | 11.0 | 808 | 833 | 20.0 | 905 | 853 | -12.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 56.0 | 25.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
2 | 2023 | 1 | 1 | 3 | 2 | 2023-01-03 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 19.0 | 814 | 851 | 6.0 | 905 | 857 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 37.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
3 | 2023 | 1 | 1 | 4 | 3 | 2023-01-04 | 9E | 20363 | 9E | N331PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 14.0 | 809 | 837 | 7.0 | 905 | 844 | -21.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 49.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
4 | 2023 | 1 | 1 | 5 | 4 | 2023-01-05 | 9E | 20363 | 9E | N906XJ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 754 | -6.0 | 0.0 | 0.0 | -1 | 0800-0859 | 13.0 | 807 | 845 | 3.0 | 905 | 848 | -17.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 54.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
5 | 2023 | 1 | 1 | 6 | 5 | 2023-01-06 | 9E | 20363 | 9E | N337PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 759 | -1.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 816 | 844 | 5.0 | 905 | 849 | -16.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 50.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
6 | 2023 | 1 | 1 | 7 | 6 | 2023-01-07 | 9E | 20363 | 9E | N336PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 750 | -10.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 807 | 845 | 7.0 | 905 | 852 | -13.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
7 | 2023 | 1 | 1 | 14 | 6 | 2023-01-14 | 9E | 20363 | 9E | N311PQ | 4628 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 11193 | 1119302 | 33105 | CVG | Cincinnati, OH | KY | 21 | Kentucky | 52 | 1500 | 1452 | -8.0 | 0.0 | 0.0 | -1 | 1500-1559 | 26.0 | 1518 | 1643 | 6.0 | 1720 | 1649 | -31.0 | 0.0 | 0.0 | -2 | 1700-1759 | 0.0 | missing | 0.0 | 140.0 | 117.0 | 85.0 | 1.0 | 585.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
8 | 2023 | 1 | 1 | 9 | 1 | 2023-01-09 | 9E | 20363 | 9E | N491PX | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2122 | -7.0 | 0.0 | 0.0 | -1 | 2100-2159 | 25.0 | 2147 | 2220 | 5.0 | 2228 | 2225 | -3.0 | 0.0 | 0.0 | -1 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 63.0 | 33.0 | 1.0 | 147.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
9 | 2023 | 1 | 1 | 10 | 2 | 2023-01-10 | 9E | 20363 | 9E | N478PX | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2114 | -15.0 | 0.0 | 0.0 | -1 | 2100-2159 | 45.0 | 2159 | 2230 | 4.0 | 2228 | 2234 | 6.0 | 6.0 | 0.0 | 0 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 80.0 | 31.0 | 1.0 | 147.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
10 | 2023 | 1 | 1 | 11 | 3 | 2023-01-11 | 9E | 20363 | 9E | N135EV | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2204 | 35.0 | 35.0 | 1.0 | 2 | 2100-2159 | 46.0 | 2250 | 2326 | 3.0 | 2228 | 2329 | 61.0 | 61.0 | 1.0 | 4 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 85.0 | 36.0 | 1.0 | 147.0 | 1 | 0.0 | 0.0 | 26.0 | 0.0 | 35.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
11 | 2023 | 1 | 1 | 12 | 4 | 2023-01-12 | 9E | 20363 | 9E | N197PQ | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2341 | 132.0 | 132.0 | 1.0 | 8 | 2100-2159 | 16.0 | 2357 | 35 | 5.0 | 2228 | 40 | 132.0 | 132.0 | 1.0 | 8 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 59.0 | 38.0 | 1.0 | 147.0 | 1 | 50.0 | 0.0 | 0.0 | 0.0 | 82.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
12 | 2023 | 1 | 1 | 13 | 5 | 2023-01-13 | 9E | 20363 | 9E | N915XJ | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2124 | -5.0 | 0.0 | 0.0 | -1 | 2100-2159 | 61.0 | 2225 | 2259 | 3.0 | 2228 | 2302 | 34.0 | 34.0 | 1.0 | 2 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 98.0 | 34.0 | 1.0 | 147.0 | 1 | 0.0 | 0.0 | 34.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
13 | 2023 | 1 | 1 | 1 | 7 | 2023-01-01 | 9E | 20363 | 9E | N906XJ | 4630 | 11337 | 1133705 | 31337 | DLH | Duluth, MN | MN | 27 | Minnesota | 63 | 13487 | 1348702 | 31650 | MSP | Minneapolis, MN | MN | 27 | Minnesota | 63 | 510 | 516 | 6.0 | 6.0 | 0.0 | 0 | 0001-0559 | 55.0 | 611 | 644 | 6.0 | 626 | 650 | 24.0 | 24.0 | 1.0 | 1 | 0600-0659 | 0.0 | missing | 0.0 | 76.0 | 94.0 | 33.0 | 1.0 | 144.0 | 1 | 6.0 | 0.0 | 18.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋱ |
240250 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N13138 | 1105 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 13204 | 1320402 | 31454 | MCO | Orlando, FL | FL | 12 | Florida | 33 | 1500 | 1809 | 189.0 | 189.0 | 1.0 | 12 | 1500-1559 | 20.0 | 1829 | 2040 | 14.0 | 1800 | 2054 | 174.0 | 174.0 | 1.0 | 11 | 1800-1859 | 0.0 | 0.0 | 180.0 | 165.0 | 131.0 | 1.0 | 937.0 | 4 | 0.0 | 0.0 | 78.0 | 0.0 | 96.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240251 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N488UA | 1104 | 11298 | 1129806 | 30194 | DFW | Dallas/Fort Worth, TX | TX | 48 | Texas | 74 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 1337 | 1400 | 23.0 | 23.0 | 1.0 | 1 | 1300-1359 | 15.0 | 1415 | 1440 | 6.0 | 1446 | 1446 | 0.0 | 0.0 | 0.0 | 0 | 1400-1459 | 0.0 | 0.0 | 129.0 | 106.0 | 85.0 | 1.0 | 641.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240252 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N35260 | 1103 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 10721 | 1072102 | 30721 | BOS | Boston, MA | MA | 25 | Massachusetts | 13 | 950 | 957 | 7.0 | 7.0 | 0.0 | 0 | 0900-0959 | 27.0 | 1024 | 1533 | 4.0 | 1541 | 1537 | -4.0 | 0.0 | 0.0 | -1 | 1500-1559 | 0.0 | 0.0 | 231.0 | 220.0 | 189.0 | 1.0 | 1754.0 | 8 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240253 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N813UA | 1102 | 11066 | 1106606 | 31066 | CMH | Columbus, OH | OH | 39 | Ohio | 44 | 14771 | 1477104 | 32457 | SFO | San Francisco, CA | CA | 6 | California | 91 | 708 | 706 | -2.0 | 0.0 | 0.0 | -1 | 0700-0759 | 19.0 | 725 | 921 | 4.0 | 933 | 925 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | 0.0 | 325.0 | 319.0 | 296.0 | 1.0 | 2120.0 | 9 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240254 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N871UA | 1101 | 13502 | 1350202 | 33502 | MTJ | Montrose/Delta, CO | CO | 8 | Colorado | 82 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 1525 | 1613 | 48.0 | 48.0 | 1.0 | 3 | 1500-1559 | 13.0 | 1626 | 2145 | 25.0 | 2126 | 2210 | 44.0 | 44.0 | 1.0 | 2 | 2100-2159 | 0.0 | 0.0 | 241.0 | 237.0 | 199.0 | 1.0 | 1795.0 | 8 | 44.0 | 0.0 | 0.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240255 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N839UA | 1100 | 14869 | 1486903 | 34614 | SLC | Salt Lake City, UT | UT | 49 | Utah | 87 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 1442 | 1824 | 222.0 | 222.0 | 1.0 | 12 | 1400-1459 | 36.0 | 1900 | 2009 | 17.0 | 1611 | 2026 | 255.0 | 255.0 | 1.0 | 12 | 1600-1659 | 0.0 | 0.0 | 89.0 | 122.0 | 69.0 | 1.0 | 391.0 | 2 | 65.0 | 0.0 | 33.0 | 0.0 | 157.0 | 1807 | 11.0 | 11.0 | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240256 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N884UA | 1097 | 11697 | 1169706 | 32467 | FLL | Fort Lauderdale, FL | FL | 12 | Florida | 33 | 13930 | 1393008 | 30977 | ORD | Chicago, IL | IL | 17 | Illinois | 41 | 702 | 657 | -5.0 | 0.0 | 0.0 | -1 | 0700-0759 | 21.0 | 718 | 858 | 11.0 | 923 | 909 | -14.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | 0.0 | 201.0 | 192.0 | 160.0 | 1.0 | 1182.0 | 5 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240257 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N877UA | 1095 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 13342 | 1334207 | 33342 | MKE | Milwaukee, WI | WI | 55 | Wisconsin | 45 | 825 | 836 | 11.0 | 11.0 | 0.0 | 0 | 0800-0859 | 32.0 | 908 | 958 | 9.0 | 1001 | 1007 | 6.0 | 6.0 | 0.0 | 0 | 1000-1059 | 0.0 | 0.0 | 156.0 | 151.0 | 110.0 | 1.0 | 725.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240258 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N73270 | 1093 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 12892 | 1289208 | 32575 | LAX | Los Angeles, CA | CA | 6 | California | 91 | 745 | 752 | 7.0 | 7.0 | 0.0 | 0 | 0700-0759 | 49.0 | 841 | 932 | 7.0 | 923 | 939 | 16.0 | 16.0 | 1.0 | 1 | 0900-0959 | 0.0 | 0.0 | 158.0 | 167.0 | 111.0 | 1.0 | 862.0 | 4 | 7.0 | 0.0 | 9.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240259 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N433UA | 1092 | 13930 | 1393008 | 30977 | ORD | Chicago, IL | IL | 17 | Illinois | 41 | 14635 | 1463502 | 31714 | RSW | Fort Myers, FL | FL | 12 | Florida | 33 | 1400 | 1504 | 64.0 | 64.0 | 1.0 | 4 | 1400-1459 | 18.0 | 1522 | 1852 | 4.0 | 1802 | 1856 | 54.0 | 54.0 | 1.0 | 3 | 1800-1859 | 0.0 | 0.0 | 182.0 | 172.0 | 150.0 | 1.0 | 1120.0 | 5 | 0.0 | 0.0 | 54.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240260 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | 1092 | 14635 | 1463502 | 31714 | RSW | Fort Myers, FL | FL | 12 | Florida | 33 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 1905 | missing | missing | missing | missing | missing | 1900-1959 | missing | missing | missing | missing | 2159 | missing | missing | missing | missing | missing | 2100-2159 | 1.0 | A | 0.0 | 174.0 | missing | missing | 1.0 | 1068.0 | 5 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
240261 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N68823 | 1086 | 12266 | 1226603 | 31453 | IAH | Houston, TX | TX | 48 | Texas | 74 | 14771 | 1477104 | 32457 | SFO | San Francisco, CA | CA | 6 | California | 91 | 1814 | 2044 | 150.0 | 150.0 | 1.0 | 10 | 1800-1859 | 21.0 | 2105 | 2306 | 7.0 | 2035 | 2313 | 158.0 | 158.0 | 1.0 | 10 | 2000-2059 | 0.0 | 0.0 | 261.0 | 269.0 | 241.0 | 1.0 | 1635.0 | 7 | 0.0 | 0.0 | 158.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
5.3. Grouping & Aggregating#
Aggregation is especially useful when combined with grouping. You can group
sets of rows in a data frame with the groupby
function. Its first argument is
the data and its second is the grouping columns. For example, to group the
air
data by Year
:
groupby(air, :Year)
GroupedDataFrame with 1 group based on key: Year
Row | Year | Quarter | Month | DayofMonth | DayOfWeek | FlightDate | Reporting_Airline | DOT_ID_Reporting_Airline | IATA_CODE_Reporting_Airline | Tail_Number | Flight_Number_Reporting_Airline | OriginAirportID | OriginAirportSeqID | OriginCityMarketID | Origin | OriginCityName | OriginState | OriginStateFips | OriginStateName | OriginWac | DestAirportID | DestAirportSeqID | DestCityMarketID | Dest | DestCityName | DestState | DestStateFips | DestStateName | DestWac | CRSDepTime | DepTime | DepDelay | DepDelayMinutes | DepDel15 | DepartureDelayGroups | DepTimeBlk | TaxiOut | WheelsOff | WheelsOn | TaxiIn | CRSArrTime | ArrTime | ArrDelay | ArrDelayMinutes | ArrDel15 | ArrivalDelayGroups | ArrTimeBlk | Cancelled | CancellationCode | Diverted | CRSElapsedTime | ActualElapsedTime | AirTime | Flights | Distance | DistanceGroup | CarrierDelay | WeatherDelay | NASDelay | SecurityDelay | LateAircraftDelay | FirstDepTime | TotalAddGTime | LongestAddGTime | DivAirportLandings | DivReachedDest | DivActualElapsedTime | DivArrDelay | DivDistance | Div1Airport | Div1AirportID | Div1AirportSeqID | Div1WheelsOn | Div1TotalGTime | Div1LongestGTime | Div1WheelsOff | Div1TailNum | Div2Airport | Div2AirportID | Div2AirportSeqID | Div2WheelsOn | Div2TotalGTime | Div2LongestGTime | Div2WheelsOff | Div2TailNum | Div3Airport | Div3AirportID | Div3AirportSeqID | Div3WheelsOn | Div3TotalGTime | Div3LongestGTime | Div3WheelsOff | Div3TailNum | Div4Airport | Div4AirportID | Div4AirportSeqID | Div4WheelsOn | Div4TotalGTime | Div4LongestGTime | Div4WheelsOff | ⋯ |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Int64 | Int64 | Int64 | Int64 | Int64 | Date | String3 | Int64 | String3 | String7 | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64 | Int64 | String3 | String | String3 | Int64 | String | Int64 | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64? | Int64? | Int64? | Float64? | Int64 | Int64? | Float64? | Float64? | Float64? | Int64? | String15 | Float64 | String3? | Float64 | Float64? | Float64? | Float64? | Float64 | Float64 | Int64 | Float64? | Float64? | Float64? | Float64? | Float64? | Int64? | Float64? | Float64? | Int64 | Float64? | Float64? | Float64? | Float64? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | String3? | Int64? | Int64? | Int64? | Float64? | Float64? | Int64? | String7? | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | Missing | ⋯ | |
1 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 757 | -3.0 | 0.0 | 0.0 | -1 | 0800-0859 | 11.0 | 808 | 833 | 20.0 | 905 | 853 | -12.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 56.0 | 25.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
2 | 2023 | 1 | 1 | 3 | 2 | 2023-01-03 | 9E | 20363 | 9E | N605LR | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 19.0 | 814 | 851 | 6.0 | 905 | 857 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 37.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
3 | 2023 | 1 | 1 | 4 | 3 | 2023-01-04 | 9E | 20363 | 9E | N331PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 755 | -5.0 | 0.0 | 0.0 | -1 | 0800-0859 | 14.0 | 809 | 837 | 7.0 | 905 | 844 | -21.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 49.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
4 | 2023 | 1 | 1 | 5 | 4 | 2023-01-05 | 9E | 20363 | 9E | N906XJ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 754 | -6.0 | 0.0 | 0.0 | -1 | 0800-0859 | 13.0 | 807 | 845 | 3.0 | 905 | 848 | -17.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 54.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
5 | 2023 | 1 | 1 | 6 | 5 | 2023-01-06 | 9E | 20363 | 9E | N337PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 759 | -1.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 816 | 844 | 5.0 | 905 | 849 | -16.0 | 0.0 | 0.0 | -2 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 50.0 | 28.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
6 | 2023 | 1 | 1 | 7 | 6 | 2023-01-07 | 9E | 20363 | 9E | N336PQ | 4628 | 10529 | 1052907 | 30529 | BDL | Hartford, CT | CT | 9 | Connecticut | 11 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 800 | 750 | -10.0 | 0.0 | 0.0 | -1 | 0800-0859 | 17.0 | 807 | 845 | 7.0 | 905 | 852 | -13.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | missing | 0.0 | 65.0 | 62.0 | 38.0 | 1.0 | 101.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
7 | 2023 | 1 | 1 | 14 | 6 | 2023-01-14 | 9E | 20363 | 9E | N311PQ | 4628 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 11193 | 1119302 | 33105 | CVG | Cincinnati, OH | KY | 21 | Kentucky | 52 | 1500 | 1452 | -8.0 | 0.0 | 0.0 | -1 | 1500-1559 | 26.0 | 1518 | 1643 | 6.0 | 1720 | 1649 | -31.0 | 0.0 | 0.0 | -2 | 1700-1759 | 0.0 | missing | 0.0 | 140.0 | 117.0 | 85.0 | 1.0 | 585.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
8 | 2023 | 1 | 1 | 21 | 6 | 2023-01-21 | 9E | 20363 | 9E | N917XJ | 4628 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 11193 | 1119302 | 33105 | CVG | Cincinnati, OH | KY | 21 | Kentucky | 52 | 1500 | 1450 | -10.0 | 0.0 | 0.0 | -1 | 1500-1559 | 16.0 | 1506 | 1650 | 5.0 | 1720 | 1655 | -25.0 | 0.0 | 0.0 | -2 | 1700-1759 | 0.0 | missing | 0.0 | 140.0 | 125.0 | 104.0 | 1.0 | 585.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
9 | 2023 | 1 | 1 | 28 | 6 | 2023-01-28 | 9E | 20363 | 9E | N336PQ | 4628 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 11193 | 1119302 | 33105 | CVG | Cincinnati, OH | KY | 21 | Kentucky | 52 | 1500 | 1455 | -5.0 | 0.0 | 0.0 | -1 | 1500-1559 | 15.0 | 1510 | 1656 | 9.0 | 1720 | 1705 | -15.0 | 0.0 | 0.0 | -1 | 1700-1759 | 0.0 | missing | 0.0 | 140.0 | 130.0 | 106.0 | 1.0 | 585.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
10 | 2023 | 1 | 1 | 9 | 1 | 2023-01-09 | 9E | 20363 | 9E | N491PX | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2122 | -7.0 | 0.0 | 0.0 | -1 | 2100-2159 | 25.0 | 2147 | 2220 | 5.0 | 2228 | 2225 | -3.0 | 0.0 | 0.0 | -1 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 63.0 | 33.0 | 1.0 | 147.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
11 | 2023 | 1 | 1 | 10 | 2 | 2023-01-10 | 9E | 20363 | 9E | N478PX | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2114 | -15.0 | 0.0 | 0.0 | -1 | 2100-2159 | 45.0 | 2159 | 2230 | 4.0 | 2228 | 2234 | 6.0 | 6.0 | 0.0 | 0 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 80.0 | 31.0 | 1.0 | 147.0 | 1 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
12 | 2023 | 1 | 1 | 11 | 3 | 2023-01-11 | 9E | 20363 | 9E | N135EV | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2204 | 35.0 | 35.0 | 1.0 | 2 | 2100-2159 | 46.0 | 2250 | 2326 | 3.0 | 2228 | 2329 | 61.0 | 61.0 | 1.0 | 4 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 85.0 | 36.0 | 1.0 | 147.0 | 1 | 0.0 | 0.0 | 26.0 | 0.0 | 35.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
13 | 2023 | 1 | 1 | 12 | 4 | 2023-01-12 | 9E | 20363 | 9E | N197PQ | 4629 | 12953 | 1295304 | 31703 | LGA | New York, NY | NY | 36 | New York | 22 | 10577 | 1057705 | 30577 | BGM | Binghamton, NY | NY | 36 | New York | 22 | 2129 | 2341 | 132.0 | 132.0 | 1.0 | 8 | 2100-2159 | 16.0 | 2357 | 35 | 5.0 | 2228 | 40 | 132.0 | 132.0 | 1.0 | 8 | 2200-2259 | 0.0 | missing | 0.0 | 59.0 | 59.0 | 38.0 | 1.0 | 147.0 | 1 | 50.0 | 0.0 | 0.0 | 0.0 | 82.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋱ |
538826 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N13138 | 1105 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 13204 | 1320402 | 31454 | MCO | Orlando, FL | FL | 12 | Florida | 33 | 1500 | 1809 | 189.0 | 189.0 | 1.0 | 12 | 1500-1559 | 20.0 | 1829 | 2040 | 14.0 | 1800 | 2054 | 174.0 | 174.0 | 1.0 | 11 | 1800-1859 | 0.0 | 0.0 | 180.0 | 165.0 | 131.0 | 1.0 | 937.0 | 4 | 0.0 | 0.0 | 78.0 | 0.0 | 96.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538827 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N488UA | 1104 | 11298 | 1129806 | 30194 | DFW | Dallas/Fort Worth, TX | TX | 48 | Texas | 74 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 1337 | 1400 | 23.0 | 23.0 | 1.0 | 1 | 1300-1359 | 15.0 | 1415 | 1440 | 6.0 | 1446 | 1446 | 0.0 | 0.0 | 0.0 | 0 | 1400-1459 | 0.0 | 0.0 | 129.0 | 106.0 | 85.0 | 1.0 | 641.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538828 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N35260 | 1103 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 10721 | 1072102 | 30721 | BOS | Boston, MA | MA | 25 | Massachusetts | 13 | 950 | 957 | 7.0 | 7.0 | 0.0 | 0 | 0900-0959 | 27.0 | 1024 | 1533 | 4.0 | 1541 | 1537 | -4.0 | 0.0 | 0.0 | -1 | 1500-1559 | 0.0 | 0.0 | 231.0 | 220.0 | 189.0 | 1.0 | 1754.0 | 8 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538829 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N813UA | 1102 | 11066 | 1106606 | 31066 | CMH | Columbus, OH | OH | 39 | Ohio | 44 | 14771 | 1477104 | 32457 | SFO | San Francisco, CA | CA | 6 | California | 91 | 708 | 706 | -2.0 | 0.0 | 0.0 | -1 | 0700-0759 | 19.0 | 725 | 921 | 4.0 | 933 | 925 | -8.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | 0.0 | 325.0 | 319.0 | 296.0 | 1.0 | 2120.0 | 9 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538830 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N871UA | 1101 | 13502 | 1350202 | 33502 | MTJ | Montrose/Delta, CO | CO | 8 | Colorado | 82 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 1525 | 1613 | 48.0 | 48.0 | 1.0 | 3 | 1500-1559 | 13.0 | 1626 | 2145 | 25.0 | 2126 | 2210 | 44.0 | 44.0 | 1.0 | 2 | 2100-2159 | 0.0 | 0.0 | 241.0 | 237.0 | 199.0 | 1.0 | 1795.0 | 8 | 44.0 | 0.0 | 0.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538831 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N839UA | 1100 | 14869 | 1486903 | 34614 | SLC | Salt Lake City, UT | UT | 49 | Utah | 87 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 1442 | 1824 | 222.0 | 222.0 | 1.0 | 12 | 1400-1459 | 36.0 | 1900 | 2009 | 17.0 | 1611 | 2026 | 255.0 | 255.0 | 1.0 | 12 | 1600-1659 | 0.0 | 0.0 | 89.0 | 122.0 | 69.0 | 1.0 | 391.0 | 2 | 65.0 | 0.0 | 33.0 | 0.0 | 157.0 | 1807 | 11.0 | 11.0 | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538832 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N884UA | 1097 | 11697 | 1169706 | 32467 | FLL | Fort Lauderdale, FL | FL | 12 | Florida | 33 | 13930 | 1393008 | 30977 | ORD | Chicago, IL | IL | 17 | Illinois | 41 | 702 | 657 | -5.0 | 0.0 | 0.0 | -1 | 0700-0759 | 21.0 | 718 | 858 | 11.0 | 923 | 909 | -14.0 | 0.0 | 0.0 | -1 | 0900-0959 | 0.0 | 0.0 | 201.0 | 192.0 | 160.0 | 1.0 | 1182.0 | 5 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538833 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N877UA | 1095 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 13342 | 1334207 | 33342 | MKE | Milwaukee, WI | WI | 55 | Wisconsin | 45 | 825 | 836 | 11.0 | 11.0 | 0.0 | 0 | 0800-0859 | 32.0 | 908 | 958 | 9.0 | 1001 | 1007 | 6.0 | 6.0 | 0.0 | 0 | 1000-1059 | 0.0 | 0.0 | 156.0 | 151.0 | 110.0 | 1.0 | 725.0 | 3 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538834 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N73270 | 1093 | 11292 | 1129202 | 30325 | DEN | Denver, CO | CO | 8 | Colorado | 82 | 12892 | 1289208 | 32575 | LAX | Los Angeles, CA | CA | 6 | California | 91 | 745 | 752 | 7.0 | 7.0 | 0.0 | 0 | 0700-0759 | 49.0 | 841 | 932 | 7.0 | 923 | 939 | 16.0 | 16.0 | 1.0 | 1 | 0900-0959 | 0.0 | 0.0 | 158.0 | 167.0 | 111.0 | 1.0 | 862.0 | 4 | 7.0 | 0.0 | 9.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538835 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N433UA | 1092 | 13930 | 1393008 | 30977 | ORD | Chicago, IL | IL | 17 | Illinois | 41 | 14635 | 1463502 | 31714 | RSW | Fort Myers, FL | FL | 12 | Florida | 33 | 1400 | 1504 | 64.0 | 64.0 | 1.0 | 4 | 1400-1459 | 18.0 | 1522 | 1852 | 4.0 | 1802 | 1856 | 54.0 | 54.0 | 1.0 | 3 | 1800-1859 | 0.0 | 0.0 | 182.0 | 172.0 | 150.0 | 1.0 | 1120.0 | 5 | 0.0 | 0.0 | 54.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538836 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | 1092 | 14635 | 1463502 | 31714 | RSW | Fort Myers, FL | FL | 12 | Florida | 33 | 11618 | 1161802 | 31703 | EWR | Newark, NJ | NJ | 34 | New Jersey | 21 | 1905 | missing | missing | missing | missing | missing | 1900-1959 | missing | missing | missing | missing | 2159 | missing | missing | missing | missing | missing | 2100-2159 | 1.0 | A | 0.0 | 174.0 | missing | missing | 1.0 | 1068.0 | 5 | missing | missing | missing | missing | missing | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ | |||||
538837 | 2023 | 1 | 1 | 2 | 1 | 2023-01-02 | UA | 19977 | UA | N68823 | 1086 | 12266 | 1226603 | 31453 | IAH | Houston, TX | TX | 48 | Texas | 74 | 14771 | 1477104 | 32457 | SFO | San Francisco, CA | CA | 6 | California | 91 | 1814 | 2044 | 150.0 | 150.0 | 1.0 | 10 | 1800-1859 | 21.0 | 2105 | 2306 | 7.0 | 2035 | 2313 | 158.0 | 158.0 | 1.0 | 10 | 2000-2059 | 0.0 | 0.0 | 261.0 | 269.0 | 241.0 | 1.0 | 1635.0 | 7 | 0.0 | 0.0 | 158.0 | 0.0 | 0.0 | missing | missing | missing | 0 | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | missing | ⋯ |
Once data is grouped, you can use the combine
function to call a function on
each group (or columns within a group). For instance, to get the frequency for
each day of week value:
combine(groupby(air, :DayOfWeek), nrow)
Row | DayOfWeek | nrow |
---|---|---|
Int64 | Int64 | |
1 | 1 | 90875 |
2 | 2 | 86270 |
3 | 3 | 68901 |
4 | 4 | 72392 |
5 | 5 | 72554 |
6 | 6 | 61150 |
7 | 7 | 86695 |
You can also use combine
with a Pair in the second argument to apply a
function to specific columns. For example, to get the mean (non-missing) delay
for flights by day of week:
using Statistics
combine(groupby(air, :DayOfWeek), :DepDelay => x -> mean(skipmissing(x)))
Row | DayOfWeek | DepDelay_function |
---|---|---|
Int64 | Float64 | |
1 | 1 | 14.9877 |
2 | 2 | 12.0788 |
3 | 3 | 26.5502 |
4 | 4 | 10.8908 |
5 | 5 | 7.71696 |
6 | 6 | 6.40796 |
7 | 7 | 11.9266 |
The DataFrames.jl documentation provides more examples of ways you can use
groupby
and combine
.
5.4. Helper Packages#
The DataFrames.jl programming interface may seem awkward or excessively verbose if you’re coming to Julia from R or Python. The community is aware of this and as a result, there are now several packages that provide more familiar programming interfaces. In particular:
DataFramesMeta.jl provides a macro interface similar to R’s dplyr package.
Pandas.jl is a wrapper around Python’s Pandas package.