Getting ready

Let's look at an example of a term-document matrix. We are going to look at two news items about the US presidential elections.

The following are the links to the two documents:

Let's build the presidential candidate matrix out of these two news items:

Let's put this matrix in a CSV file and then put it in HDFS. We will apply SVD to this matrix and analyze the results.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset