How to do it...

  1. Leave out all the noise features before training the neural network. Remove noise features at the schema transformation stage:
TransformProcess transformProcess = new TransformProcess.Builder(schema)
.removeColumns("RowNumber","CustomerId","Surname")
.build();
  1. Identify the missing values using the DataVec analysis API:
DataQualityAnalysis analysis = AnalyzeLocal.analyzeQuality(schema,recordReader);
System.out.println(analysis);

  1. Remove null values using a schema transformation:
Condition condition = new NullWritableColumnCondition("columnName");
TransformProcess transformProcess = new TransformProcess.Builder(schema)
.conditionalReplaceValueTransform("columnName",new IntWritable(0),condition)
.build();
  1. Remove NaN values using a schema transformation:
Condition condition = new NaNColumnCondition("columnName");
TransformProcess transformProcess = new TransformProcess.Builder(schema)
.conditionalReplaceValueTransform("columnName",new IntWritable(0),condition)
.build();
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset