There are two ways to work with the recipes in this book:
- The first is to use Databricks Community Cloud at https://community.cloud.databricks.com. It is a free notebook provided by Databricks. All the sample data for this book has also been uploaded in the Amazon Web Service S3 bucket, namely sparkcookbook.
- The second option is to use InfoObjects Big Data Sandbox, which is a virtual machine built on top of Ubuntu. This software can be downloaded from http://www.infoobjects.com.