What you need for this book

There are two ways to work with the recipes in this book:

  • The first is to use Databricks Community Cloud at https://community.cloud.databricks.com. It is a free notebook provided by Databricks. All the sample data for this book has also been uploaded in the Amazon Web Service S3 bucket, namely sparkcookbook.
  • The second option is to use InfoObjects Big Data Sandbox, which is a virtual machine built on top of Ubuntu. This software can be downloaded from http://www.infoobjects.com.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset