# A tibble: 6 × 19
year month day dep_time sched_dep…¹ dep_d…² arr_t…³ sched…⁴ arr_d…⁵ carrier
<int> <int> <int> <int> <int> <dbl> <int> <int> <dbl> <chr>
1 2013 2 28 NA 605 NA NA 805 NA MQ
2 2013 2 28 NA 850 NA NA 1035 NA MQ
3 2013 2 28 NA 905 NA NA 1115 NA MQ
4 2013 2 28 NA 1115 NA NA 1310 NA MQ
5 2013 2 28 NA 830 NA NA 1205 NA UA
6 2013 2 28 NA 840 NA NA 1147 NA UA
# … with 9 more variables: flight <int>, tailnum <chr>, origin <chr>,
# dest <chr>, air_time <dbl>, distance <dbl>, hour <dbl>, minute <dbl>,
# time_hour <dttm>, and abbreviated variable names ¹sched_dep_time,
# ²dep_delay, ³arr_time, ⁴sched_arr_time, ⁵arr_delay
# A tibble: 336,776 × 5
year month day origin carrier
<int> <int> <int> <chr> <chr>
1 2013 1 1 EWR UA
2 2013 1 1 LGA UA
3 2013 1 1 JFK AA
4 2013 1 1 JFK B6
5 2013 1 1 LGA DL
6 2013 1 1 EWR UA
7 2013 1 1 EWR B6
8 2013 1 1 LGA EV
9 2013 1 1 JFK B6
10 2013 1 1 LGA AA
# … with 336,766 more rows
### join 시킬 데이터셋 - 'airlines'airlines
# A tibble: 16 × 2
carrier name
<chr> <chr>
1 9E Endeavor Air Inc.
2 AA American Airlines Inc.
3 AS Alaska Airlines Inc.
4 B6 JetBlue Airways
5 DL Delta Air Lines Inc.
6 EV ExpressJet Airlines Inc.
7 F9 Frontier Airlines Inc.
8 FL AirTran Airways Corporation
9 HA Hawaiian Airlines Inc.
10 MQ Envoy Air
11 OO SkyWest Airlines Inc.
12 UA United Air Lines Inc.
13 US US Airways Inc.
14 VX Virgin America
15 WN Southwest Airlines Co.
16 YV Mesa Airlines Inc.
### key값 기준으로 왼쪽에 joinflights_sample2 %>%left_join(airlines, by="carrier")
# A tibble: 336,776 × 6
year month day origin carrier name
<int> <int> <int> <chr> <chr> <chr>
1 2013 1 1 EWR UA United Air Lines Inc.
2 2013 1 1 LGA UA United Air Lines Inc.
3 2013 1 1 JFK AA American Airlines Inc.
4 2013 1 1 JFK B6 JetBlue Airways
5 2013 1 1 LGA DL Delta Air Lines Inc.
6 2013 1 1 EWR UA United Air Lines Inc.
7 2013 1 1 EWR B6 JetBlue Airways
8 2013 1 1 LGA EV ExpressJet Airlines Inc.
9 2013 1 1 JFK B6 JetBlue Airways
10 2013 1 1 LGA AA American Airlines Inc.
# … with 336,766 more rows
### key값 컬럼 이름이 다를 때colnames(airlines)[1] <-'different'flights_sample2 %>%left_join(airlines, by=c("carrier"="different"))
# A tibble: 336,776 × 6
year month day origin carrier name
<int> <int> <int> <chr> <chr> <chr>
1 2013 1 1 EWR UA United Air Lines Inc.
2 2013 1 1 LGA UA United Air Lines Inc.
3 2013 1 1 JFK AA American Airlines Inc.
4 2013 1 1 JFK B6 JetBlue Airways
5 2013 1 1 LGA DL Delta Air Lines Inc.
6 2013 1 1 EWR UA United Air Lines Inc.
7 2013 1 1 EWR B6 JetBlue Airways
8 2013 1 1 LGA EV ExpressJet Airlines Inc.
9 2013 1 1 JFK B6 JetBlue Airways
10 2013 1 1 LGA AA American Airlines Inc.
# … with 336,766 more rows