Home Page Icon
Home Page
Table of Contents for
Table of Contents
Close
Table of Contents
by Feris Thia, Manoj R Patil
Pentaho for Big Data Analytics
Pentaho for Big Data Analytics
Table of Contents
Pentaho for Big Data Analytics
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. The Rise of Pentaho Analytics along with Big Data
Pentaho BI Suite – components
Data
Server applications
Thin Client Tools
Design tools
Edge over competitors
Summary
2. Setting Up the Ground
Pentaho BI Server and the development platform
Prerequisites/system requirements
Obtaining Pentaho BI Server (Community Edition)
The JAVA_HOME and JRE_HOME environment variables
Running Pentaho BI Server
Pentaho User Console (PUC)
Pentaho Action Sequence and solution
The JPivot component example
The message template component example
The embedded HSQLDB database server
Pentaho Marketplace
Saiku installation
Pentaho Administration Console (PAC)
Creating data connections
Summary
3. Churning Big Data with Pentaho
An overview of Big Data and Hadoop
Big Data
Hadoop
The Hadoop architecture
The Hadoop ecosystem
Hortonworks Sandbox
Pentaho Data Integration (PDI)
The Pentaho Big Data plugin configuration
Importing data to Hive
Putting a data file into HDFS
Loading data from HDFS into Hive (job orchestration)
Summary
4. Pentaho Business Analytics Tools
The business analytics life cycle
Preparing data
Preparing BI Server to work with Hive
Executing and monitoring a Hive MapReduce job
Pentaho Reporting
Data visualization and dashboard building
Creating a layout using a predefined template
Creating a data source
Creating a component
Summary
5. Visualization of Big Data
Data visualization
Data source preparation
Repopulating the nyse_stocks Hive table
Pentaho's data source integration
Consuming PDI as a CDA data source
Visualizing data using CTools
Visualizing trends using a line chart
Interactivity using a parameter
Multiple pie charts
Waterfall charts
CSS styling
Summary
A. Big Data Sets
Freebase
U.S. airline on-time performance
Amazon public data sets
B. Hadoop Setup
Hortonworks Sandbox
Setting up the Hortonworks Sandbox
Hortonworks Sandbox web administration
Transferring a file using secure FTP
Preparing Hive data
The nyse_stocks sample data
Index
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Prev
Previous Chapter
Cover
Next
Next Chapter
Pentaho for Big Data Analytics
Table of Contents
Pentaho for Big Data Analytics
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. The Rise of Pentaho Analytics along with Big Data
Pentaho BI Suite – components
Data
Server applications
Thin Client Tools
Design tools
Edge over competitors
Summary
2. Setting Up the Ground
Pentaho BI Server and the development platform
Prerequisites/system requirements
Obtaining Pentaho BI Server (Community Edition)
The JAVA_HOME and JRE_HOME environment variables
Running Pentaho BI Server
Pentaho User Console (PUC)
Pentaho Action Sequence and solution
The JPivot component example
The message template component example
The embedded HSQLDB database server
Pentaho Marketplace
Saiku installation
Pentaho Administration Console (PAC)
Creating data connections
Summary
3. Churning Big Data with Pentaho
An overview of Big Data and Hadoop
Big Data
Hadoop
The Hadoop architecture
The Hadoop ecosystem
Hortonworks Sandbox
Pentaho Data Integration (PDI)
The Pentaho Big Data plugin configuration
Importing data to Hive
Putting a data file into HDFS
Loading data from HDFS into Hive (job orchestration)
Summary
4. Pentaho Business Analytics Tools
The business analytics life cycle
Preparing data
Preparing BI Server to work with Hive
Executing and monitoring a Hive MapReduce job
Pentaho Reporting
Data visualization and dashboard building
Creating a layout using a predefined template
Creating a data source
Creating a component
Summary
5. Visualization of Big Data
Data visualization
Data source preparation
Repopulating the nyse_stocks Hive table
Pentaho's data source integration
Consuming PDI as a CDA data source
Visualizing data using CTools
Visualizing trends using a line chart
Interactivity using a parameter
Multiple pie charts
Waterfall charts
CSS styling
Summary
A. Big Data Sets
Freebase
U.S. airline on-time performance
Amazon public data sets
B. Hadoop Setup
Hortonworks Sandbox
Setting up the Hortonworks Sandbox
Hortonworks Sandbox web administration
Transferring a file using secure FTP
Preparing Hive data
The nyse_stocks sample data
Index
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset