Getting ready

To build a linear regression model to predict the arrival delay time, you will need to have the RevoScaleR library. The dataset for this recipe will be the XDF file containing the following variables:

  • YEAR
  • QUARTER
  • MONTH
  • DAY_OF_MONTH
  • DAY_OF_WEEK
  • ORIGIN
  • DEST
  • DEP_DELAY
  • ARR_DELAY

The objective is to build a linear regression model with ARR_DELAY as the dependant (outcome variable) and ORIGIN, DEST, DEP_DELAY, and DAY_OF_WEEK as the features (independent variables). Note that the variables ORIGIN and DEST will be treated as factor (categorical) variables.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset