How it works...

In step 1, we load the dataset into a record reader object. For demonstration purposes, we used CSVRecordReader

In step 2, the execute() method can only be used if TransformProcess returns non-sequential data. For local execution, it is assumed that you have loaded the dataset into a RecordReader

For the LocalTransformExecutor example, please refer to the LocalExecuteExample.java file from this source:
https://github.com/PacktPublishing/Java-Deep-Learning-Cookbook/blob/master/02_Data_Extraction_Transform_and_Loading/sourceCode/cookbook-app/src/main/java/com/javadeeplearningcookbook/app/executorexamples/LocalExecuteExample.java.

For the LocalTransformExecutor example, you need to provide a file path for titanic.csv. It is located in this chapter's GitHub directory.

In step 3, it is assumed that you have loaded the dataset into a JavaRDD object since we need to execute the DataVec transform process in a Spark cluster. Also, the execute() method can only be used if TransformProcess returns non-sequential data.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset