Tuning Hadoop configurations for cluster deployments
Running benchmarks to verify the Hadoop installation
Reusing Java VMs to improve the performance
Fault tolerance and speculative execution
Debug scripts – analyzing task failures
Setting failure percentages and skipping bad records
Shared-user Hadoop clusters – using fair and other schedulers
Hadoop security – integrating with Kerberos
Using the Hadoop Tool interface
Introduction
This chapter describes how to perform advanced administration steps for your Hadoop Cluster. This chapter assumes that you have followed Chapter 1, Getting Hadoop Up and Running in a Cluster, and have installed Hadoop in a clustered or pseudo-distributed setup.