Chapter 3: Predicting Sports Winners with Decision Trees

More on pandas

The pandas library is a great package—anything you normally write to do data loading is probably already implemented in pandas. You can learn more about it from their tutorial at http://pandas.pydata.org/pandas-docs/stable/tutorials.html

There is also a great blog post written by Chris Moffitt that overviews common tasks people do in Excel and how to do them in pandas: http://pbpython.com/excel-pandas-comp.html

You can also handle large datasets with pandas; see the answer, from user Jeff (the top answer at the time of writing), to this StackOverflow question for an extensive overview of the process: http://stackoverflow.com/questions/14262433/large-data-work-flows-using-pandas.

Another great tutorial on pandas is written by Brian Connelly: http://bconnelly.net/2013/10/summarizing-data-in-python-with-pandas/

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset