Hour 14. The ggplot2 Package for Graphics

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Hour 14. The ggplot2 Package for Graphics

What You’ll Learn in This Hour:

Creating simple plots

Changing plot types

Control of aesthetics

Groups and panels

Themes and legend control

In Hour 13, “Graphics,” you saw how the graphics package can be used to create highly customized graphics. However, as you have seen, the graphics package can be hard work when used as an exploratory tool. To compare levels of a variable, we typically need to use “for” loops or a clever application of factors. Items such as the legend must be added manually.

The lattice and ggplot2 packages offer alternatives to the graphics package that are much easier to use for data exploration. Each has been built using Paul Murrell’s grid package, thus enabling plots to be created as objects that are then printed when required. In this hour we start by looking at the hugely popular ggplot2 package, developed (once again) by Hadley Wickham.

The Philosophy of ggplot2

The ggplot2 package was inspired by Leland Wilkinson’s book The Grammar of Graphics. The grammar of graphics philosophy breaks a graphic into a series of layers. Different layers describe the mapping of the data to plot features, the plot type, the coordinate system, and the associated scaling of plot features. To follow the grammar of graphic using ggplot2, we need just one plot function, ggplot, to which we add the required layers. Different plot types can be achieved through geometric layers, or “geoms.”

In addition to the relatively pure implementation of the grammar of graphics via the ggplot function, ggplot2 offers an additional graphical function, qplot, designed to speed up the creation of graphics by making assumptions about the layers we want to use. The existence of qplot in ggplot2 is divisive: Several vocal supporters of the grammar of graphics concept advocate scrapping qplot. However, as passionate ggplot2 supporters that use and teach the package on a daily basis, the authors of this book cannot relate to this opinion. Our clients want to be able to create powerful visualizations as quickly and easily as possible. Why would anyone want to remove a function that makes it quicker and easier to create high quality graphics?! By the end of the hour, you can decide for yourself whether you prefer the quick-and-easy approach, the true grammar of graphics, or a combination of the two. For now let’s take a look at some ggplot2 basics using the qplot function.

Quick Plots and Basic Control

The “q” in qplot stands for “quick.” The speed mainly relates to typing; the function requires a lot less typing than its ggplot counterpart. It achieves this by making assumptions; however, the function is also far more flexible than most people realize and can be used in conjunction with a layered grammar of graphics approach.

Using qplot

We have stated that qplot is quick because it makes assumptions. Thankfully there are very few assumptions, and they are all very sensible! Indeed, most of the assumptions are no different from the assumptions made by graphics functions such as plot and hist. In addition to assumptions about the coordinate system, axes, plotting character, and so on, qplot also makes an assumption about the plot type. For example, if we provide a single variable to qplot, it is assumed that we want to draw a histogram. If we provide two variables, it is assumed that we want to draw a scatter plot.

Later, you’ll see how to easily vary the plot type using qplot, but for now we start with a simple scatter plot using the mtcars data. We specify mtcars as the data frame that we are using and refer to the wt and mpg variables directly. The output is displayed in Figure 14.1.

Table of Contents for Hour 14. The ggplot2 Package for Graphics

Create new playlist

Sign In

Sign Up

Hour 14. The ggplot2 Package for Graphics

The Philosophy of ggplot2

Quick Plots and Basic Control

Using qplot

Titles and Axes

Working with Layers

Plots as Objects

Changing Plot Types

Plot Types

Combining Plot Types

Aesthetics

Control of Aesthetics

Scales and the Legend

Working with Grouped Data

Paneling (a.k.a Faceting)

Using facet_grid

Using facet_wrap

Faceting from qplot

Custom Plots

Working with ggplot

The aes Function

Working with ggplot

Where to Specify Aesthetics

Working with Multiple Data Frames

Coordinate Systems

Themes and Layout

Tweaking Individual Plots

Global Themes

Legend Layout

The ggvis Evolution

Summary

Q&A

Workshop

Quiz

Answers

Activities

Table of Contents for
Hour 14. The ggplot2 Package for Graphics