Chapter 3. Advanced Hadoop MapReduce Administration

In this chapter, we will cover:

  • Tuning Hadoop configurations for cluster deployments
  • Running benchmarks to verify the Hadoop installation
  • Reusing Java VMs to improve the performance
  • Fault tolerance and speculative execution
  • Debug scripts – analyzing task failures
  • Setting failure percentages and skipping bad records
  • Shared-user Hadoop clusters – using fair and other schedulers
  • Hadoop security – integrating with Kerberos
  • Using the Hadoop Tool interface

Introduction

This chapter describes how to perform advanced administration steps for your Hadoop Cluster. This chapter assumes that you have followed Chapter 1, Getting Hadoop Up and Running in a Cluster, and have installed Hadoop in a clustered or pseudo-distributed setup.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset