Acquiring the example data

In this chapter, we will explore the Ling-Spam email dataset (The original dataset is described at http://csmining.org/index.php/ling-spam-datasets.html). Download the dataset from http://data.scala4datascience.com/ling-spam.tar.gz (or ling-spam.zip, depending on your preferred mode of compression), and unpack the contents to the directory containing the code examples for this chapter. The archive contains two directories, spam/ and ham/, containing the spam and legitimate emails, respectively.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset