Home Page Icon
Home Page
Table of Contents for
Hadoop with Python
Close
Hadoop with Python
by Donald Miner, Zach Radtka
Hadoop with Python
Source Code
1. Hadoop Distributed File System (HDFS)
Overview of HDFS
Interacting with HDFS
Common File Operations
HDFS Command Reference
Snakebite
Installation
Client Library
CLI Client
Chapter Summary
2. MapReduce with Python
Data Flow
Map
Shuffle and Sort
Reduce
Hadoop Streaming
How It Works
A Python Example
mrjob
Installation
WordCount in mrjob
What Is Happening
Executing mrjob
Top Salaries
Chapter Summary
3. Pig and Python
WordCount in Pig
WordCount in Detail
Running Pig
Execution Modes
Interactive Mode
Batch Mode
Pig Latin
Statements
Loading Data
Transforming Data
Storing Data
Extending Pig with Python
Registering a UDF
A Simple Python UDF
String Manipulation
Most Recent Movies
Chapter Summary
4. Spark with Python
WordCount in PySpark
WordCount Described
PySpark
Interactive Shell
Self-Contained Applications
Resilient Distributed Datasets (RDDs)
Creating RDDs from Collections
Creating RDDs from External Sources
RDD Operations
Text Search with PySpark
Chapter Summary
5. Workflow Management with Python
Installation
Workflows
Tasks
Target
Parameters
An Example Workflow
Task.requires
Task.output
Task.run
Parameters
Execution
Hadoop Workflows
Configuration File
MapReduce in Luigi
Pig in Luigi
Chapter Summary
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Prev
Previous Chapter
Programming
Next
Next Chapter
Hadoop with Python
Hadoop with Python
Zachary Radtka & Donald Miner
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset