Chapter 8

Bestsellers and Blockbusters: Movies, Music, and Books

W.D. Walls,    University of Calgary, Calgary, Alberta, Canada

Abstract

This chapter is a survey on the economics of blockbusters and bestsellers in popular entertainment and culture, with a particular focus on empirical methods and applications to movies, music, and books. We first survey various conceptual models on the economics of superstars and the winner-take-all nature of product success. We then survey the range of statistical methodologies that can be used to quantify formally the winner-take-all nature of the distribution of success while estimating the correlates of success across competing titles. In separate subsections for movies, music, and books, we survey the empirical literature that relates to the blockbuster phenomenon. The final section discusses specific current topics related to the distribution of success, focusing on issues that appear to be fruitful areas for future research.

Keywords

Superstars; Blockbusters; Movies; Music; Books

JEL Classification Codes

Z10; C18

8.1 Introduction

It is an empirical regularity that winner-take-all payoff distributions are often observed, where the most successful competitors seem to reap rewards that are orders of magnitude greater than the others (Frank and Cook, 1996). This is true in sports as well as in popular culture. It is palpable in the entertainment industries. For example, consider the distribution of box office revenue across movies released in the United States in the year 2007. Figure 8.1 plots the Lorenz curve. This corresponds to a Gini coefficient of 0.653 with an estimated standard error of 0.022. The Lorenz curve for worldwide revenue is nearly identical, with a Gini coefficient of 0.664 and associated standard error of 0.023.

image

Figure 8.1 Lorenz curve for us box office revenue in 2007.

This chapter is a survey of interesting issues surrounding the economics of bestsellers and blockbusters with special reference to the markets for books, movies, and music.1 We shall consider the now-standard superstar models as well as the extent to which the transmission of information affects demand (Mizerski, 1982; Mahajan et al., 1984; Moul, 2007) and how this may contribute to the winner-take-all property where profits are highly concentrated. When demand is characterized by information feedback, the success of a title may be leveraged into further success while unsuccessful titles fail. This can lead to a distribution of payoffs across titles which have ‘heavy tails’ where the few winners earn a disproportionately large share of the total.

Statistical models relevant in the quantification of bestsellers and blockbusters will be covered in some detail. Markets with average outcomes dominated by superstar products or performers are characterized by highly asymmetric distributions in which the probability of extreme outcomes is much larger than would be implied by a normal distribution. In some cases, these distributions may have such a high level of uncertainty attached to outcomes that measures of dispersion such as the variance may not even be defined (i.e. the integral that defines the variance does not converge to a finite value). For these reasons, a number of statistical models have been applied to market data for books, movies, and music to control explicitly for these important statistical properties of these markets.

8.2 Conceptual Models: Bestsellers and Blockbusters

Understanding the nature of demand for entertainment and cultural products is essential if one is to make sense of the distribution of success across competing titles, whether they be individual books, films, or audio recordings. Early consumers of a given title may affect the choices of potential consumers. There are several variations of this sort of behavior – called herding, contagion, network effects, bandwagons, path-dependence, momentum, and information cascades – that contain the essential feature that demand depends on revealed demand. The details differ across the various models, but the important point is that the outcome of group behavior arises from the interaction of individual decision makers (Epstein and Axtell, 1996). In these models where demand is described by recursive feedback, initial differences in consumption across titles can lead to extreme differences in final outcomes.

8.2.1 Information, Demand, and Outcome Distributions

Banerjee (1992) and Bikhchandani et al. (1992) are the seminal herding papers that show how demand can be affected by previous customers’ choices. To make the process of dynamic demand more transparent we will follow the intuition set out in De Vany and Walls (1996). They assume for simplicity a single title that can be consumed sequentially (and not simultaneously) by a population of n potential customers. The particular sequence in which consumers make their choices is random and they have a common prior belief in the quality of the title. Denote the probability that a randomly chosen demander will choose to purchase and consume the title p. Letting X denote the number of people choosing to consume the title, then X is a binomial random variable; it follows that when consumers share a common prior over quality that the distribution of a title’s sales would be binomial. However, when quality is unknown and priors over quality differ among potential customers, p is a random variable. By conditioning on p and integrating over the binomial distribution, each of the n + 1 possible outcomes is equally likely; adding uncertainty to the priors transforms the distribution of revenue from the binomial to the uniform distribution.

Now consider information sharing where potential consumers can use information revealed over time to refine their prior on a title’s quality.2 De Vany and Walls (1996) let the distribution of customers over screens be multinomial uniform, so the search problem – a search for quality with an unknown distribution – is similar to the search for price with an unknown distribution. Customers who do not know the distribution begin with a uniform prior and update the distribution sequentially. The result of this process is the Bose–Einstein distribution, which has the property that all of the possible outcome vectors are equally likely! This means the vector in which consumption of every title is equal to 0 is as likely as one in which all n trials go to only one title and every other vector is equally likely (Feller, 1957). The Bose–Einstein distribution has uniform mass over a space of possible outcome vectors, including the outcome where consumers choose to purchase nothing.

What is important about the evolution of choice probabilities under the Bose–Einstein choice logic is the way past successes are leveraged into future successes: as soon as individual differences emerge among the films, they are compounded by information feedback through the run into very large differences. Substantial promotional activity with large supply can produce large and rapidly growing sales, but it also can lead to early failure if the large number of consumers transmit negative information. As customers sequentially select titles, the probability that a given customer selects a particular title is proportional to the fraction of previous customers who selected it. This leveraging of success results when probabilities are not known and sampling reveals information that causes previous selections to attract subsequent selections.

There are several additional interesting issues that can be explored in the context of models of information sharing. For example, Chamley and Gale (1994) have an endogenous element in their model where consumers may wait to consume to see what others think first; this has a particular appeal to the extent that consuming a title is an investment rather than pure consumption, and in this context the paper is also related to the Ottaviani and Sorensen (2000) paper on herding and investment. Avery et al. (1999) also present a model in which the order of the choice sequence is endogenous. Sgroi (2002) investigates the strategy of focusing effort on early consumers. Smith and Sorensen (2000) examine confounded learning in a herding context as an alternative to the rapid success or failure in the older herding models. De Vany and Lee (2001) investigated the role of information in a simulation model, and showed that even with noisy information better movies usually win and that payoffs have the winner-take-all or heavy tails property. De Vany and Walls (1996) showed that box office revenues have a contagion-like property where the week-to-week change in demand is stochastically dependent on previous demand.

8.2.2 The Roles of Talent and Luck in Success

Rosen (1981) and Adler (1985) provide the now-classic analyses of superstardom. Adler (2006) in volume 1 of this Handbook provides a survey of superstar models in relation to talent. In these models there are a few ways that the talent of a performer can be related to whether or not the performer’s output becomes a bestseller or blockbuster.

In Rosen’s (1981) model consumer demand is characterized by substitution – so consumers will pay more for higher quality services or will substitute quantity for quality. Also, there are known differences of talent across performers and performers compete with one another on two margins: other performers may be equally talented or other performers maybe become substitutes at a given price or type of venue. Rosen simplifies things by assuming that quality does not deteriorate with quantity provided, but even in this simplified case we see that total revenue will increase with quality because both price and quantity provided increase with talent. Thus, in the Rosen model total revenue increases in talent more than proportionally and implies that revenues will be concentrated on the most talented people. Rosen further shows that if the services provided by more talented sellers are less affected by competition from other sellers, then even if there is more than one provider in the market it will still be the case that higher quality providers service larger crowds and the skewed distribution of success across performers will continue to hold.

Adler (2006) discusses competing models that relate talent to the distribution of success. In particular, he considers the model of MacDonald (1988) in which talent is defined in a different way. MacDonald, instead of focusing on the certain talent of a performer, focuses on the volatility of a performer’s talent. In his model, each performer can deliver good and bad performances, and the difference in talent is the probability that a particular performance will be a good one. Consumers value talent such that the probability of a good performance is higher for experienced performers than for inexperienced performers because only good performers will survive. Through this sort of selection, performers with a record of success will command higher ticket prices and supply larger audiences. In these circumstances, artists of equal talent do equally well.

Adler (1985) also describes another alternative model that can explain the emergence of a superstar performer. His model differs from the Rosen model because the star emerges from among several equally talented performers due to pure luck. The process is as follows. Consumers select a performer randomly in their purchase decisions, so it is only by luck that one performer supplies more consumers than another performer. Luck makes the superstar performer the most popular and because consumers have a preference for popular performers this initial advantage, which is purely the result of chance, is leveraged into further success. Adler notes that if the performers are knowledgeable of consumers’ selection mechanism, they may try to influence their perceived popularity rather than leaving it up to nature’s lottery.

8.2.3 Innovation Tournaments

De Vany and Walls (1997) propose using a tournament model as a way of understanding competition in the movie market. In their depiction of the movie tournament, each film must earn a critical level of box office revenue to survive to the next stage of competition. If films fail to earn this threshold level of revenue, exhibitors will choose to drop the film and replace it with another film that has a higher level of expected revenue. The revenue threshold will change in response to audience changes, over the course of a film’s theatrical lifetime, and as other competing films enter the tournament.

Books, movies, and music can all be modeled as if they are competing for the top positions in a tournament. In fact, weekly lists of sales rankings are reported in precisely this way, sometimes on the evening news. Suppose there are m(t) > 10 titles competing at time t for these 10 tournament positions. Rank sales revenues at time t from high to low in the m-dimensional vector r =  (r1(t), r2(t), …, rm(t)), where low numerical ranks indicate high sales revenues. The threshold revenue that a title must earn to survive from week t to t + 1 is a random variable C(t) = r10(t). A title i in release at time t survives if its revenue exceeds the critical level, ri(t) ≥ C(t), and it is eliminated from competition otherwise. Every title in the top 10 sample meets this condition by definition. When a title does not meet the condition, it is dropped from the sample to be replaced by a competing title. A title may move up or down in rank during the course of its lifetime. The length of time that a title remains within each rank is a random variable.

If a title gains in popularity its rank will increase over time, but its revenues eventually must fall as it exhausts its potential audience. As it declines in rank, a title becomes vulnerable to newly released competitors and we might expect that its occupancy time in rank becomes more brief. This intuition of an innovation tournament where new titles enter the tournament as older titles are dropped corresponds well with statistical survival models as discussed toward the end of Section 8.3.

8.3 Empirical Models: Bestsellers and Blockbusters

A variety of alternative statistical models have been used to quantify the distribution of success across competing titles. These statistical models either attempt to test directly the refutable implications (if there are any) of conceptual models or they explicitly account for the winner-take-all nature of the distribution of success while possibly quantifying the correlates of success.

8.3.1 The Pareto Model (and Departures From It)

Now we demonstrate the observable implications of various types of demand dynamics discussed above on the distribution of sales across titles. This provides a correspondence between the conceptualization of demand and the testable implications to be confronted with data in the empirical analysis. The idea here is to examine the distribution of success across products and to test competing characterizations of the demand process. The approach taken here is discussed at some length by Brock (1999), who argues that empirical regularities in the form of scaling laws, such as the Pareto law, may be useful in providing information on the process that is generating the data.3 The Pareto law as used to characterize the distribution of success – across competing books, films, or songs – is similar to recent work in econophysics (Rosser, 2008).

Many empirical regularities in economics (as well as in the physical world) follows power laws (Gabaix, 2008). For example, Steindl (1965) found that the size of a firm S is systematically related to its rank R in an industry according to the Pareto law S × Rβ = A, where β and A are constants. In the context of entertainment media consumption, titles are naturally ranked each week by their revenues so the Pareto law implies the following log-linear relationship between weekly revenue and rank: log Revenue = log A − β log Rank. Not only is the Pareto law shown above an empirical regularity for several economic quantities – income, firm size, financial returns – but Simon (1955) showed that it can be derived analytically assuming that the growth rate of revenues is independent of size and that there is a constant rate of entry for new titles. What this means is that in the absence of demand feedback the Pareto law would describe the distribution of outcomes.

The actual empirical distribution can differ from the Pareto law defined in the linear relationship between log Revenue and log Rank, and it is in this way that we can make inference on feedback in the demand process. In studying the distribution of firm sizes, Ijiri and Simon (1974) found that empirical distributions often deviated from the Pareto law: instead of Revenue and Rank following a log-linear relationship as implied by the Pareto law they found that the plot had curvature.4 In markets for competing entertainment titles we can interpret deviations from the Pareto law in the following way: an increase in revenues affects future growth through either (i) the information sharing between those individuals who consumed a title and potential purchasers, or (ii) herd- or contagion-like behavior on the part of potential purchasers. Regardless of the behavioral cause of the information feedback onto demand, the effect of the demand increase will ultimately diminish over time due to the saturation of the potential audience and the entry of new competing titles.

In practice one can test for the presence of positive or negative information feedback in the demand process by quantifying the curvature of the Pareto relationship by adding a quadratic term in rank to capture the non-linearity as suggested by Ijiri and Simon (1974): log Revenue = log A − β log Rank + γ(log Rank).2 A non-zero value of the parameter γ indicates a deviation from the Pareto law and the sign of this coefficient indicates whether the information feedback is positively or negatively associated with growth in demand. Finding curvature away from the Pareto law such that γ > 0 is an indication of autocorrelated growth in revenues, and this is the observable implication of the increasing returns that can cause some titles to become ‘hits’ and others to become ‘bombs’ through information feedback. Finding that γ < 0 would indicate negative information feedback (or negatively autocorrelated growth) or that titles that did well last period will have falling demand this period. This test for information feedback in demand does not permit us to distinguish between the competing explanations of the source of the increasing returns, but it does permit us to test the hypothesis of information feedback using data generated in actual markets. It has been employed in several recent studies of books, music, the theatrical movie market as well as the market for DVDs. We will discuss these empirical studies in Section 8.4.

8.3.2 Stretched Exponentials

Concavity in log-log plots of size against rank, also known as a parabolic power law, are interpreted as evidence of increasing returns to information in the demand for movies (De Vany and Walls, 1996; Walls, 1997; Hand, 2001). Frisch and Sornette (1997) propose a multiplicative stochastic process that can explain the deviation of the data relative to a power law distribution. LaHerrere and Sornette (1998) show the stretched exponential distribution accounts for many deviations from power law distributions in both the physical and the social sciences. Sornette (1998) provides rigorous technical details on multiplicative processes leading to power laws and stretched exponentials. Let the probability density function of independent and identically distributed (IID) random variables m be p(m). Now consider the product Xn = m1m2mn, which has probability density function image for X → ∞ and finite n. From the relation Xn+1 = Xnxn+1 we can write the equation for the density function of Xn+1 in terms of the density functions of xn+1 and Xn, and derive the probability density function by the formal application of the Laplace method (Frisch and Sornette, 1997). The tail of Pn(X) is controlled by the realizations where all terms in the product are of the same order, so that Pn(X) is to a first approximation the product of the probability density functions of the n terms in the product m1mn. When the density function p(·) is exponential, then the density function of the product is a stretched exponential for large n and follows the Weibull cumulative distribution given by image. When the exponent α in the Weibull distribution is unity, it is the exponential distribution; when the exponent is less than unity, the tails are heavier than exponential tails and the distribution is often referred to as the stretched exponential.

8.3.3 Black Swans and the Lévy–Stable Model

Taleb (2007), in his bestseller The Black Swan, emphasizes the importance of high-impact low-probability events. He is critical of the Gaussian distribution as a model to predict future events based on a sample of data. Taleb’s black swan idea is not purely statistical, but from our perspective we can view it as arguing that distributions of success outcomes have heavier-than-normal tails. We now discuss some particular models that can be useful in characterizing black swan outcomes.

8.3.3.1 Unconditional Lévy–Stable Model

Many observed quantities that are summations of a large number of terms are characterized by asymmetry and heavier-than-normal tails.5 Mandelbrot (1963a,b, 1967) and Fama (1965) generated academic interest among economics and finance scholars in the use of stable laws; interest in stable models waned in subsequent years, at least partially due to the computational complexities of using the distribution. Recently, though, there has been a resurgence of interest in the application of stable distributions in both physical and social science settings due to the important contributions of Zolotarev (1986), Samorodnitsky and Taqqu (1994), and Nolan (1998a,b, 1999, 2001, 2005, 2011).6 De Vany and Walls (2004) contend that the stable distribution is the correct model for movie industry outcomes due to its consistency with the empirical regularities of the industry’s data and its statistical foundation on the general central limit theorem.7 The theoretical reason to think a stable distribution might apply to movies is because Mandelbrot (1963a) showed that a dynamic process that is stable under choice, mixture, and aggregation converges in distribution to the stable distribution. If revenues and costs are discrete time processes with stable increments, then profit will converge to a stable distribution. The limits of a stable process form a generalized central limit theorem that states that the probability distribution of sums of IID terms is stable.

The probability density function for the stable distribution cannot in general be expressed explicitly; instead it is most commonly specified by its characteristic function. The literature contains numerous alternative parameterizations of the stable distribution as some are at times more convenient than others.8 For convenience in exposition, and with no loss in generality, we use a parameterization that is a variation of that given by Zolotarev (1986): X ∼ S(α, β, γ, δ; 0). The index of stability or characteristic exponent is α ∈  (0,2]; in financial data and in film industry data it is typical to find that 1 < α < 2, in which case the mean of the distribution is finite and the variance is infinite. Lower values of α correspond to a distribution with a higher peak around the center and more probability mass in the tails. The skewness coefficient β ∈ [−1, 1] is a measure of the asymmetry of the distribution; the sign and magnitude of β indicate the direction and extent of skewness, respectively. The scale parameter γ > 0 generalizes the standard deviation and it expands or contracts the distribution about the location parameter δ; in the stable model γα is analogous to the variance.

Stable distributions are closed under summation – a particularly useful property in financial modeling applications – so linear combinations of independent stable variates with index α also have the same index α. The stable distribution is the limiting distribution of all stable processes so that it contains the other three well-known stable distributions, as special cases: the Cauchy distribution results when α = 1 and β = 0, the Lévy distribution results when α = 0.5 and β = ±1, and finally the normal distribution is a special case of the stable distribution when α = 2. As the characteristic exponent α approaches 2, the skewness coefficient β has less impact on the shape of the distribution and when α = 2 the distribution collapses to the normal distribution, having only two parameters – location and scale – corresponding to mean and variance.

An interesting property of the stable Paretian distribution discussed above is that conditional expectation does not converge. The tails of stable distributions are Paretian and the conditional probability that x ≥ x0 is P[x > x0] =  (x0/x)α. The conditional mean, given that x > x0 equals image. Since α is a constant, the conditional expected value of profit depends linearly on x0. Conditional on having earned a profit, the expected profit continues to rise with current profit and this does not end as the movie earns more profit. This is not paradoxical because titles that make it into the upper tail of the profit distribution have been selected from among their competitors. The heavy tails of the stable distribution imply that probability does not decline rapidly enough for the conditional expectation to converge. For the Gaussian or log-Gaussian distributions, the conditional expectation converges to a constant as the conditioning event increases. The linear conditional expectation of the Paretian distribution means that blockbuster movies that have already attained high profit have an expectation of even higher profit and this prospect does not diminish as profit grows. This captures the idea of demand momentum.

8.3.3.2 Lévy–Stable Model with Covariates

The parameters of the stable distribution can be modeled in terms of covariates to provide a very flexible regression model of which classical least-squares regression is a special case. For example, if we model the location δ of the distribution in terms of a covariate, let the dispersion be a constant to be estimated, and set the skewness equal to 0 and the tail weight equal to 2, then we have the classical normal linear regression model. In this case the interpretation of the regression coefficient for the location parameter is the same as it would be for a linear regression model since the mean, median, and mode are equal. However, in general, the location of the stable distribution differs from the mean and median due to skewness. In this case the location regression parameters are essentially changing the mode of the distribution through the covariates rather than the mean or median.

Dispersion, skewness, and tail exponent parameters can also depend upon covariates. The values of these parameters is restricted as set out above, with dispersion being positive, skewness lying in the interval between image and +1, and the tail exponent lying between 0 and 2. To ensure these constraints are satisfied when estimating the parameters in terms of covariates, the estimation uses linking functions defined by Lambert and Lindsey (1999) to relate the stable parameters to the covariates. For example, if we let xi represent a covariate on observation i, the linking function for the dispersion parameter is ln(γi) = b0 + b1xi, the linking function for the skewness parameter is ln[(1 + βi)/(1 − βi)] = c0 + c1xi, and the linking function for the index of stability is ln[αi/(2 − αi)] = d0 + d1 xi. Maximum likelihood can be used to estimate the parameters.

8.3.4 Skew-Normal and Skew-t Models

Several statistical models have been proposed to account for the non-normal distribution of data from the physical and social sciences. The stable Paretian model was proposed by Mandelbrot (1963a,b) to account for the heavy tails of financial data; other authors have suggested the Student-t distribution with the appropriate degrees of freedom as a natural alternative to the normal distribution. The multivariate Student-t distribution permits regression analysis of film returns allowing for heavy tails, but it does not allow for asymmetry. Several recent empirical papers in finance have employed various statistical models to capture the skewness and heavy tails in the distribution of financial returns using non-standard distributions that make it difficult for applied researchers to condition the distribution on a vector of explanatory variables.9 The logarithmic versions of the skew-normal and skew-t distributions have been proposed as models of sales or revenue that explicitly account for skewness, and skewness and heavy tails, respectively, in a multivariate regression-like framework of analysis that is familiar to financial economists and other social scientists.

8.3.4.1 The Skew-Normal Distribution

Azzalini (1985, 1986) defines a continuous random variable Z to have a skew-normal distribution, denoted SN(0, 1, α), if it has density function 2φ(z)Φ(αz) where φ and Φ denote the density and distribution functions, respectively, of a standard normal N(0, 1) variate. The skew-normal distribution is essentially a normal distribution that has been augmented by the addition of a shape parameter α ∈ (−∞, +∞) that quantifies the skewness of the distribution; when α = 0, the skew-normal distribution simplifies to the standard normal distribution. In our empirical application we analyze the distribution of the variable y =ξ + ωz where ξ is the location parameter and ω > 0 is the scale parameter. Azzalini and Dalla Valle (1996) and Azzalini and Capitanio (1999) formally derive the statistical properties of the multivariate skew-normal distribution.

8.3.4.2 The Skew-t Distribution

The skew-normal distribution can naturally be extended to obtain the skew-t distribution to account for heavier-than-normal tails. The connection between the skew-normal and skew-t distributions is analogous to that between the normal and Student-t distributions (Hogg and Craig, 1978). The standard skew-t distribution is obtained by considering the transformed variable image, where v is distributed χ2 with df degrees of freedom and is statistically independent of z. In empirical applications, the distribution of the variable image is analyzed. Azzalini and Capitanio (2003) provide a formal detailed treatment of the properties of the skew-t distribution, including its multivariate extension.

The skew-normal and skew-t distributions can be fit using their log-transformed versions. These are referred to as log-skew-normal and log-skew-t distributions and they are related to the distributions described above in the same way that the log-normal distribution corresponds to the normal distribution.

8.3.4.3 Skew-Normal and Skew-t Regression Analysis

Using the skew-normal and skew-t distributions, the distribution of movie revenues conditional on a vector of title attributes can be fit in a regression-like framework. The distribution of sales or revenue conditional on a vector of explanatory variables can be fit in the form of a logarithmic linear regression log Revenuei = β0 + β1Xi + μi where i indexes individual titles; Xi is a vector of title-specific attributes; and the random disturbance μi follows a normal, skew-normal, or skew-t distribution depending on the model being estimated. This basic linear regression equation for cinema box office revenue has been employed by several previous researchers.10 Estimation of the log-linear specification allows the parameters to be interpreted as elasticities.

8.3.5 Non-Parametric Kernel Regression

All of the techniques listed above – even the ones that use distributions accounting for skewness, heavy tails, and infinite variance – assume a functional form for the basic regression model. The validity of each model’s statistical tests and the resulting conclusions are predicated on the regression equation being properly specified. To avoid the model specification problem entirely, one could use non-parametric kernel regression. The distinguishing feature of non-parametric regression, in contrast to a parametric model, is that little or no a priori knowledge is assumed about the form of the true function which is being estimated. The function is still modeled using an equation containing free parameters, but in a way that allows an extremely broad class of functions to be represented. There are different types of non-parametric models, including neural networks. Non-parametric kernel regression is a powerful statistical technique suited to this purpose that can readily be added to the toolkit of applied researchers. The non-parametric kernel regression model used in several movie papers is based on the earlier work of Fan and Gijbels (1992, 1996) on local linear non-parametric regression that has been supplemented by cross-validated bandwidth selection and recent advances in generalized kernel estimation (Li and Racine, 2004; Racine and Li, 2004).

8.3.6 Other Tools and Models

8.3.6.1 Quantile Regression

Instead of modeling the mean of the log Revenue distribution as is done in least-squares regressions, it may be more illuminating to model various quantiles of the Revenue distribution since the effects of title attributes may differ for the various attributes of the probability distribution. This can reveal how the explanatory variables associated with the blockbuster strategy alter the shape of the revenue distribution at various quantiles (De Vany and Walls, 2005).

8.3.6.2 Discrete Choice Models

A number of studies have chosen to use discrete choice models to quantify the chance that a particular product becomes successful.11 We have seen that in many of the statistical models heavy tails lead to infinite variance. This means that forecasts have zero precision because they have infinite variance attached to them. However, it has been suggested that one can still apply standard binary choice models, such as a logit or probit, because these models are not affected by the infinite variance of the outcome distribution. So even though one cannot model with precision the value of expected sales or revenue, one can make probability statements about whether or not it will exceed a given threshold value.

8.3.6.3 Survival Models

The analysis of product survival is important in entertainment markets. For theatrical performances, the length of a product’s lifetime in supply is the most common way to adjust supply. The other margin of supply adjustment is the number of copies of the product, such as the number of cinemas playing a film or the number of showings per day. For products that are not live performances, such as DVDs for movies, book and music sales, the relevant duration of survival can refer to the time a title appears on a top-selling list, such as Variety’s list for films, the Billboard chart for music, or the New York Times list of best-selling books. Survival models correspond intuitively to elimination tournaments as discussed in Section 8.2.3. The temporal pattern of sales or revenue, and the duration from a title’s birth until its death, will provide a great deal of useful information about the dynamics of demand and supply.

The life cycle of a title can be modeled as a pure birth-death process in a system that is time-dependent. Define the survival time of a title as the time interval from birth until death. This time interval is a random variable τ with distribution function F(t) = Prob (τ ≤ t). The survivor function is defined as the probability that a movie is still alive at time t and is denoted by R(t) = 1 − F(t). The probability that a title alive at time t will fail prior to time τ is the conditional distribution F(τentτ > t). Given that the title is alive at time t, the probability that it will die between t and t + dt is given by the product f(τentτ ≥ t)dt. At time (τ = t) the conditional density is a function of t alone and is known as the hazard rate, the instantaneous rate of failure for each time image, where r(t) = R′(t) is the density of the survivor function. The hazard rate representation of the survival model has a natural interpretation: it is the probability that a title will fall from the top-selling charts during a time interval given that it is on the charts at the beginning of the interval.

8.4 A Selective Survey of Empirical Findings

This section discusses recent empirical literature on books, movies, and music that relates specifically to the phenomenon of bestsellers and blockbusters. This is not a survey of all empirical papers. Instead, we focus on those that have something to say about the distribution of success across titles. Most empirical papers simply model the expected value of product success while treating the distribution of outcomes as symmetric with exponentially declining tails.

8.4.1 Movies

Recent surveys of the economics of the movie industry are given by Chisholm (2003), in Volume 1 of this Handbook by De Vany (2006), and Walls (2008). A survey with a more management-related focus is provided by Hadida (2009) and Eliashberg et al. (2006) provide a marketing-related survey. A very well-written and thorough survey of empirical work on the economics of the movie industry is provided in McKenzie (2012). Here, we shall focus specifically on empirical findings that relate to the bestsellers and blockbusters.

Many papers on the movie industry have focused on the types of demand processes that are consistent with the observed distribution of product success. De Vany and Walls (1996), Walls (1997), Hand (2001), and Sornette (2002) look at the importance of blockbusters when tails follow a power law. De Vany and Walls (1996) provide evidence of maximal uncertainty in movie outcomes resulting from the way that demand feeds back on itself. The empirical evidence in their paper is based primarily on departures from the Pareto size distribution as discussed in Section 8.3.1: concavity in log-log plots of size against rank, which is known as a parabolic power law, are interpreted as evidence of increasing returns to information in the demand for movies. Numerous authors, including Walls (1997), Hand (2001), Maddison (2004), Akdede and Ogus (2006), and others, have applied the empirical framework from the De Vany and Walls (1996) paper to other samples of data and have confirmed their results of positive information feedback in the movie industry. All these papers base their results on deviations from the Pareto size distribution. The linkage to information feedback is indirect instead of being structural. One can interpret that as a primary weakness to the approach of identifying the process that generated the data by examination of the outcome distribution. De Vany and Lee (2001) investigated the role of information in a simulation model and showed that even with noisy information better movies usually win and that payoffs have the winner-take-all or heavy tails property.

The stretched exponential distribution has also been applied to movie success. Walls (2005a) finds strong evidence that the distribution of movie outcomes does not follow a power law or a even a parabolic power law as proposed by the papers discussed in the paragraph above. He presents empirical evidence from North American box office revenue showing that the stretched exponential distribution provides a statistically superior fit. The stretched exponential distribution does not truncate the upper tail in its estimates of the probability of a movie earning a larger amount than previous movies. The distribution also accounts for the deviation from the strict Pareto power law in a way that does not place artificial restrictions on the possibility that a movie can earn far more than our experience suggests. This is consistent with a model of consumer demand where shocks are multiplicative and there is decay in information transmission that leads to local audience saturation.

In related work, De Vany and Walls (1999, 2002) fit the Pareto distribution – not the rank-order size distribution – and find evidence of infinite variance in the outcome distribution. Sinha and Raghavendra (2004) find evidence of long tails using weeks in top 60, opening week gross, and cumulative lifetimes in theatrical exhibition. These empirical results do not correspond directly to a behavioral model in the way that deviations from a Pareto size distribution are consistent with increasing returns to information. Instead the Pareto model allows the possibility of heavy tails and the possibility of infinite variance. The De Vany and Walls (1999, 2002) papers find evidence of a heavy upper tail in the outcome distribution as well as evidence of infinite variance. This is interpreted as evidence of the ‘nobody knows’ principle made famous by screenwriter William Goldman (1983) and set out academically by Caves (2000). This line of research on the distribution of film outcomes is further developed by De Vany and Walls (2004) and McKenzie (2008). These researchers use the unconditional Lévy-stable model set out in Section 8.3.3. These results generalize the earlier studies that fit the Pareto distribution, which only models the upper tail of film outcomes. The unconditional Lévy-stable model captures the entire distribution, including skewness, kurtosis, and heavy tails that in the limit follow a Pareto distribution. These studies further confirm the uncertainty surrounding film success in that the variance is infinite.

Many empirical studies, including Wallace et al. (1993), Ravid (1999), Albert (1998), Sedgwick and Pokorny (1999), Albert (1999), Collins et al. (2002), among others, quantify the correlates of high-grossing films in the context of standard statistical models; a thorough discussion of these models and their results relating to the use of marquee talent or stars, genres, ratings, awards, and reviews is provided in McKenzie (2012). These sort of models are useful in that they permit straightforward examination of the impact of various explanatory variables on a film’s box office revenue or rate of return. However, these models fail to capture the important statistical properties, such as heavy tails, skewness, and possibly infinite variance, that characterize the film industry. In an effort to bring covariates to the Lévy-stable analysis, Walls (2005b) applied McCulloch’s (1998) symmetric-stable regression model to film returns and the empirical results confirmed that the variance of film returns was infinite even when conditioning on a vector of covariates; however, a major shortcoming of that model is that it imposes symmetry. Walls (2005c) applies the skew-Student-t regression model to film returns to accommodate skewness and heavy tails; although the model is ad hoc, it does accommodate skewness and heavy tails.

Walls (2010a) estimates the generalized skew-stable model set out in Section 8.3.3. That model quantifies the effects of budgets and stars on film outcomes while allowing for skewness, heavy tails, and infinite variance. The empirical results indicate that the effect of large budgets and marquee stars is to reduce the risk that a film earns very low rates of return while increasing the most probable rate of return. This is consistent with Ravid's (2004) and Basuroy and Ravid's (2004) hypothesis that the objective of studio executives is not to maximize expected returns, but instead to make projects that are unlikely to fail. The findings are also consistent with quantile regressions reported by De Vany and Walls (2005) showing that big budgets and stars increased the lower quantiles of film revenues, but that they did not increase the upper quantiles. The general stable regression model is revealing because it identifies how the shape of the distribution is changed. The skew-stable regression analysis also confirms that the distribution of film returns – even when conditioning on a film’s attributes – has infinite variance so point predictions are not very useful. Predictions of the conditional expectation of film returns have infinite variance even though one can estimate a least-squares regression and obtain estimates that ignore this (e.g. Litman, 1983; Litman, 1990, Litman and Ahn, 1998). A pragmatic approach suggested by De Vany and Walls (1999) and applied by Collins et al. (2002) is to estimate the probability that returns exceed a given threshold using discrete choice analysis. While this approach can be used, it is preferable to make use of all the sample information by modeling the entire distribution of returns using the skew-stable model rather than modeling a discrete probability. The general stable model does permit the conditional distribution of film returns to be quantified and it can be used to make probability statements that may be useful in practice (e.g. pricing contingent claims on a film’s stream of future returns).

An alternative to the skew-stable model is non-parametric regression in which the precise functional form need not be specified explicitly. Walls (2009a,b) applies non-parametric regression with data-driven bandwidth selection to model movie revenues and movie profitability, respectively. It is reported that the non-parametric regression model fits the data far better than the logarithmic regression model employed by most applied researchers and that one can reject the hypothesis that the log-linear model is correctly specified. The non-parametric regression estimates of revenue response to production budget differ from those obtained by other researchers. For example, Walls (2005c) using the skew-Student-t model reports an elasticity of box office revenue with respect to budget of 0.75; Walls (2010a) using the symmetric stable regression model reports a budget elasticity of 0.81; De Vany and Walls (2005) use a log-linear regression and quantile regression to report budget elasticities of 0.55 and 0.56, respectively; Basuroy and Ravid (2004) use a log-linear model to estimate budget elasticities of 1.44; Ravid (1999) estimates a budget elasticity of 1.35 and reports further that it is the most significant determinant of domestic box office revenue; and the Litman and Ahn (1998) results imply an elasticity of 0.23.12 The estimates of revenue response to opening screens obtained from the non-parametric kernel regression also differ from those obtained by other researchers. Walls (2005c) using the skew-Student-t model reports an elasticity of box office revenue with respect to opening screens of 0.273; Walls (2010a) using the symmetric stable regression model reports a budget elasticity of 0.313; De Vany and Walls (2005) use a log-linear regression and quantile regression to report budget elasticities of 0.205 and 0.234, respectively; Litman and Ahn (1998) estimates imply an elasticity of box office gross with respect to opening screens of 0.65.13

McKenzie (2010) and Walls (2010b) have in independent studies using different datasets examined the distribution of DVD sales. McKenzie (2010) found that the DVD revenue distribution had thicker tails than the theatrical revenue distribution; this implies that the top-ranked DVDs earned a greater share of revenues than their theatrical counterparts. His comparison of revenues shows not only a high degree of correlation between the two markets, but a relationship that is non-linear and increasing at higher theatrical revenue levels. This is interpreted as being consistent with a word-of-mouth momentum effect and more institutional flexibility in the DVD market. Walls (2010b) finds that the DVD market differs markedly from the theatrical market in the dynamics of demand and in the distribution of payoffs.14 Where the theatrical market has strong information feedback as evidenced by autocorrelated growth in weekly revenues, Walls (2010b) finds no such evidence of information feedback in the analysis of weekly DVD revenue. While the theatrical movie market has a clear winner-take-all character where the few high-grossing films are improbably large to have been drawn from a (log) normal probability model, he finds no evidence of heavier-than-normal tails in the distribution of cumulative revenues over DVD titles. The DVD market appears to be much less uncertain than the theatrical market for movies and it is populated by a more diverse set of competitors, including not only theatrically released movies, but also films released directly to DVD in addition to a wide variety of repackaged programming originally broadcast on television and cable TV. Competition at the box office leads to a winner-take-all distribution of prizes while competition for DVD sales has a much less skewed, though not entirely normal, distribution of payoffs. These findings are complementary to those of Jozefowicz et al. (2008) who found that word-of-mouth, critical reviews, and other structural factors that would create positive information feedback did not have any systematic impact on DVD rentals.

De Vany and Walls (1997) model film survival on the Variety’s top 50 chart. Each film in their sample was tracked from its birth to its death, where birth is defined as the beginning of a film’s theatrical run and death was defined as falling off the top 50 chart. Modeling film lifetimes is important because lengthening the theatrical run of a film is the most common way to adjust supply to evolving demand and because long lifetimes are very important in creating high-grossing films. They estimated the survival model set out in Section 8.3.6, with the vector of explanatory variables including the number of first run bookings, the week’s revenue, the number of weeks the film previously had been in the top 50, the film’s rank, and the number of screens in which the film played in its initial theatrical release. The empirical results indicated that the number of first-run screens, revenue, and the number of weeks in which the film previously was in the top 50 increase significantly the survival time of a movie. For a re-released film, the number of weeks it ran in its previous release is a measure of its entertainment value that carries over to the subsequent theatrical runs. This can be interpreted as further evidence of information transmission and memory. Walls (1998) models movie lifetimes using a sample of data from Hong Kong. McKenzie (2009b) also uses a survival analysis to analyze life length on Australian box office charts. He finds, contrary to several earlier studies, that production budget, opening screens, and the use of marquee star talent are not important determinants of life length. Simonoff and Ma (2003) model the hazard rate of Broadway shows to quantify the correlates of survival, and find that the type of show, whether a show is a revival or not, and opening-week attendance are strongly related to show survival. They also find mixed evidence regarding critic reviews: winning major Tony Awards is associated with longer runs, but being nominated and then losing is associated with shorter post-award runs. Maddison (2004) and Akdede and Ogus (2006) also examine the lifetimes of live performances and find results consistent with those for movie lifetimes.

8.4.2 Music

Connolly and Krueger (2006) in Volume 1 of this Handbook provide a thorough overview of the institutions and empirics on the economics of the popular music business. Throsby (2002) provides an empirical overview of the global music industry.

Hamlen (1991, 1994) tests superstar models using voice quality and finds some evidence that greater talent received greater rewards, but that differences in talent far exceed differences in reward. Chung and Cox (1994) model sales as a dynamic demand process where customers’ demand is related to prior-period demand and this provides some support for the type of model discussed in Section 8.2.2 where previous success is leveraged into future success; this supports Adler’s (1985) model to the extent that initial advantages may determine future success more than differences in talent. Giles (2006) and Spierdijk and Voorneveld (2009) both provide updates to the Chung and Cox (1994) paper, finding that the statistical model – the Yule distribution – does not provide a very good fit for the upper tail of the distribution, which implies that the model does not provide a good description of superstardom.

Giles (2007b) models durations of #1 Billboard chart hits and Giles (2007a) uses data from the Billboard Hot 100 chart and finds evidence of increasing returns to information in the US market for popular music. These results are consistent with the large number of results reported for the movie industry in Section 8.4.1. Davies (2002) examines the lifetime total success of bands measured by the total number of weeks they were in the weekly top 75 list of best-selling recordings and finds evidence that the individual success of musicians follows a stretched exponential distribution. Strobl and Tucker (2000) find that chart success is substantially skewed to the right for UK pop music using data from the 1980–1993 British rankings charts. Success is substantially skewed to the right, whether measured by total weeks per artist, average weeks per album, or the total number of albums per artist. The determinants of chart survival are also examined, and these indicate that the type of album and initial popularity are important in album survival.

Pitt (2010) examines the distribution of royalty income, explicitly modeling skewness and heavy tails using the skew-normal and skew-t distributions in a regression framework. Pitt finds strong evidence of superstar effects, in that average royalty payments are dominated by the values found in the upper tail and a small number of performers earned a very large share of the total royalty payments from blockbuster hits.

8.4.3 Books

General background on the economics of books, including institutions and empirics for the book publishing industry, are provided in Canoy et al. (2006) and also in van der Ploeg (2004). The empirical literature on the economics of bestsellers and blockbusters in relation to books is much less extensive than the literature on music or movies. There are studies, such as Clerides (2002), that focus on industrial organization issues related to the pricing of books, but few studies examine the distribution of outcomes across competing titles.

One paper that does examine the market for books in a way similar to the studies for movies and music is Gaffeo et al. (2008). This paper uses a form of the rank–revenue model set out in Section 8.3.1 to study the dynamics of the demand for books in Italy. Gaffeo et al. (2008) disaggregated the Italian book publishing industry into three broad submarkets: Italian novels, foreign novels, and non-fiction. Within each submarket the authors find evidence that the distribution of sales can be well-fitted by a power law distribution. However, in contrast to the studies of music and movie markets, a significant departure from the Pareto law was not found, so the authors conclude that the information transmission in the Italian market for books is characterized by constant returns. However, their results are consistent with the nobody-knows principle, the winner-take-all property, and the success-breeds-success principle.

Chevalier and Mayzlin (2006) investigate word-of-mouth impact on book sales using book reviews at two popular online booksellers: Amazon.com and Barnesandnoble.com. Book reviews are found to be positively associated with book sales, but the marginal impact of bad reviews is greater than the impact of good reviews. They also find evidence to indicate that customers actually read the text of online reviews rather than relying solely on the numerical review. This final result is probably the most interesting of their study in that it provides some evidence for models that are predicated on the sharing of quality information rather than models that are based solely on quantity signals; in this sense the Chevalier and Mayzlin (2006) paper is consistent with the De Vany and Lee (2001) paper, which in a simulation compares models of quantity signals to models of quality signals. Beck (2007) also estimates the impact of word-of-mouth on sales of novels. In an interesting empirical study, Sorensen (2007) finds that appearing on the bestseller list leads to a modest increase in sales for the average book and that the effect is bigger for first-time authors. Sorensen finds that the market expansion effect of bestseller lists appears to dominate any business stealing from non-best-selling titles.

8.5 Conclusions: Some Interesting Issues for Further Research

8.5.1 Heavy Tails or the Long Tail?

Changes in the way goods are sold may affect the distribution of success. Anderson (2006) made popular the hypothesis that firms in the Internet age may be able to profitably sell small volumes of less popular titles to many customers instead of only selling large volumes of very popular titles. The total sales of a large number of non-hit titles is called the long tail. Does the evidence allow us to distinguish between these competing hypotheses? Are the results different across industries? Leskovec et al. (2007) analyze a person-to-person recommendation network, consisting of 4 million people who made 16 million recommendations on half a million products. They study the propagation of recommendations and the cascade sizes, and analyze how user behavior varies within user communities defined by a recommendation network. Product purchases are found to follow a ‘long tail’ where a significant share of purchases belongs to rarely sold items.

Tucker and Zhang (2011) find evidence of a steep tail effect where customers tend to herd to the most popular items, though there is a business creation effect that may stimulate demand for less popular products. This is not unlike the business creation effect of best-selling titles found by Sorensen (2007) in the book market. Elberse and Oberholzer-Gee (2007) examine data from music and movie Internet sales, and find no evidence to support the long tail theory. A few studies have examined the market for DVDs in comparison to the theatrical market for movies. McKenzie (2010) using Australian data found that the DVD revenue distribution had thicker tails than the theatrical revenue distribution so that top-ranked DVDs earned a greater share of revenues than their theatrical counterparts; thus success was even more highly concentrated on a small number of DVDs than it was in the theatrical market. Walls (2010b) using North American data finds that the DVD market differs markedly from the theatrical market in the opposite way: the theatrical movie market was found to have a clear winner-take-all character where the few high-grossing films are improbably large to have been drawn from a (log) normal probability model, but he finds no evidence of heavier-than-normal tails in the distribution of cumulative revenues over DVD titles. The North American sample of DVDs is populated by a more diverse set of competitors, including not only theatrically released movies, but also films released directly to DVD in addition to a wide variety of repackaged programming originally broadcast on television and cable TV.

8.5.2 Piracy and the Distribution of Success

Piracy is one of the most challenging problems faced by the movie, book, and music industries. One of the biggest problems that has hampered studies of piracy is the counterfactual methods by which effects are usually estimated. De Vany and Walls (2007) develop and estimate a statistical model of the effects of piracy on the box office performance of a widely released movie. The model discredits the argument that piracy increases sales, showing unambiguously that Internet piracy diminished the box office revenues of a widely released movie. The model overcomes a major weakness of counterfactual or ‘but for piracy’ methods widely used to estimate damages. These counterfactual methods violate the ‘nobody-knows’ principle because they forecast what the product would have earned in the absence of piracy. The model does not violate this basic principle of movie uncertainty. To summarize the economics, piracy adversely affects the dynamics of demand and supply in the theatrical market. The immediate effect on the dynamics is to increase the rate at which revenue and the number of screens showing the movie decline. The longer-term effect is to alter the information dynamics in such a way that demand in later weeks is also reduced. Thus, a pirated movie will play off more rapidly and lose revenue at an accelerated rate during its run. They test this economic hypothesis by empirically modeling the rate of decline in weekly revenues in relation to the number of pirated prints available for Internet download.

Are illegal downloads responsible for declining music/book/film sales? One theory is that instead of making a purchase a customer obtains the title online for free. However, sharing may lead to learning about performers and this extra information may increase legitimate sales. Oberholzer-Gee and Strumpf (2007) conclude that file sharing has a nearly insignificant impact on CD sales. Hong (2013) examines a panel of household expenditures at the time of the introduction of Napster. He concludes that file sharing accounts for 20% of the decline in music sales. Moreover, he concludes that this is primarily driven by households with children ranging between 6 and 17 years of age. Rob and Waldfogel (2006) asked a panel of undergraduate students at the University of Pennsylvania how much music they obtained from file sharing and from purchases. They estimate that for albums that sold more than 2 million copies from 1999 to 2003, there is no relationship between downloads and CD sales. However, for lower-sales albums, five album downloads correspond to one less album purchased. Rob and Waldfogel (2007) conduct a similar analysis for films. Liebowitz (2006, 2008) finds that file sharing accounts for at least all of the decline in US music sales. McKenzie (2009a) explores illegal music file-sharing activity and its effect on Australian sales of singles in the physical and digital retail markets. Using 15 weeks of Australian weekly chart rankings of physical and digital sales, combined with a proxy for download activity, his evidence suggests no significant impact of download activity on legitimate sales. Liebowitz (2011) standardizes the metric of sales decline across most of the studies of music sales mentioned above and concludes that in most cases the entire decline in music sales can be attributed to file sharing.

Mortimer and Sorensen (2007) study the relation between file sharing and concert attendance. They conclude that while sales of recorded music declined after the introduction of file sharing, concert revenues and the number of artists performing concerts increased dramatically. Overall, the patterns in the data suggest that while file sharing may have eroded profits from CD sales, it also increased the profitability of live performances. Hendricks and Sorensen (2009) investigate the role of information in the distribution of success across products with an empirical application to recorded music. Specifically, their paper models the role of information discovery in demand. Their results show that the release of a new album is associated with an increase in the sales of a performer’s previously released albums. Hendricks and Sorensen call these effects ‘backward spillovers’, and they support a structure of information flows where demanders discover the artist from the newly released album and subsequently make purchases of the performer’s previously released albums. They conclude that the distribution of sales across performers is more skewed than it would be if consumers were more fully informed. Examining the role of piracy on the dynamics of demand and the distribution of product success appears to be a fruitful area for future research.

8.5.3 Customer Reviews, Word of Mouth, and Social Media

How do customer reviews affect customer choice? How do rankings lists affect customer choice? Studies such as Chevalier and Mayzlin (2006) and Liu (2006) used online product reviews to predict product sales. Most studies such as Chevalier and Mayzlin (2006) show that online product reviews have a significant effect on product sales. However, other studies such as Liu (2006) show that it is the volume of product reviews that significantly influences sales while the valence does not show any significant effects. Duan et al. (2008) provide a preliminary analysis of online word-of-mouth impact on film sales. The way in which online reviews impact the entire distribution of sales is not clear. This appears to be a fertile area for research.

A dynamic component of online communication that has been rapidly increasing in volume are blogs and other forms of social media. Social media are a fast-growing form of online communication and they represent a widely used method of personal expression. Asur and Huberman (2010) use the chatter from Twitter.com to forecast box office revenues for movies. They show that a simple model built from the rate at which tweets are created about particular topics can outperform market-based predictors. They also show that sentiments extracted from Twitter can be further utilized to improve the forecasting power of social media. A number of interesting questions arise. Do the conceptual or theoretical models yield refutable implications that allow us to distinguish between competing models? Does the empirical evidence permit us to test one model against another? Do changes in electronic social interaction have implications for herding or learning in the process that leads to bestsellers and blockbusters?

References

1. Adler M. Stardom and talent. American Economic Review. 1985;75:208–212.

2. Adler M. Stardom and talent. In: Amsterdam: North-Holland; 2006;895–906. Ginsburgh V, Throsby D, eds. Handbook of the Economics of Art and Culture. vol. 1.

3. Adler RJ, Feldman RE, Taqqu MS, eds. A Practical Guide to Heavy Tails: Statistical Techniques and Applications. Berlin: Birkhäuser; 1998.

4. Akdede SH, Ogus A. Increasing returns to information and the survival of Turkish public theatre plays. Applied Economics Letters. 2006;13:785–788.

5. Albert S. Movie stars and the distribution of financially successful films in the motion picture industry. Journal of Cultural Economics. 1998;22:249–270.

6. Albert S. Reply: Movie stars and the distribution of financially successful films in the motion picture industry. Journal of Cultural Economics. 1999;23:325–329.

7. Anderson C. The Long Tail: Why the Future of Business is Selling Less of More. New York: Hyperion; 2006.

8. Asur, S., Huberman, B.A. 2010. Predicting the future with social media. Research Paper. Hewlett-Packard, Palo Alto,CA. Available at: arvix.org/abs/1003.5699.

9. Avery C, Resnick P, Zeckhauser R. The market for evaluations. American Economic Review. 1999;89:564–584.

10. Azzalini A. A class of distribution which includes the normal ones. Scandinavian Journal of Statistics. 1985;12:171–178.

11. Azzalini A. Further results on a class of distribution which includes the normal ones. Statistica. 1986;46:199–208.

12. Azzalini A, Capitanio A. Statistical applications of the multivariate skew-normal distribution. Journal of the Royal Statistical Society. 1999;B61:579–602.

13. Azzalini A, Capitanio A. Distributions generated by perturbation of symmetry with emphasis on a multivariate skew-t distribution. Journal of the Royal Statistical Society. 2003;B65:367–389.

14. Azzalini A, Dalla Valle A. The multivariate skew-normal distribution. Biometrika. 1996;83:715–726.

15. Banerjee AV. A simple model of herd behavior. Quarterly Journal of Economics. 1992;107:797–817.

16. Basuroy S, Ravid SA. Managerial objectives, the R-rating puzzle, and the production of violent films. Journal of Business. 2004;77(Suppl.2):S155–S192.

17. Beck J. The sales effect of word of mouth: a model for creative goods and estimates for novels. Journal of Cultural Economics. 2007;31:5–23.

18. Bikhchandani S, Hirshleifer D, Welch I. A theory of fads, fashion, custom, and cultural change as informational cascades. Journal of Political Economy. 1992;100:992–1026.

19. Brannas K, Nordman N. Conditional skewness modelling for stock returns. Applied Economics Letters. 2003;10:725–728.

20. Brock WA. Scaling in economics: a reader’s guide. Industrial and Corporate Change. 1999;8:409–446.

21. Canoy M, van Ours JC, van der Ploeg F. The economics of books. In: Amsterdam: North-Holland; 2006;721–761. Ginsburgh V, Throsby D, eds. Handbook of the Economics of Art and Culture. Vol. 1.

22. Caves RE. Creative Industries: Contracts between Art and Commerce. Cambridge, MA: Harvard University Press; 2000.

23. Chamley C, Gale D. Information revelation and strategic delay in a model of investment. Econometrica. 1994;62:1065–1085.

24. Chevalier JA, Mayzlin D. The effect of word of mouth on sales: online book reviews. Journal of Marketing Research. 2006;XLIII:345–354.

25. Chisholm D. Motion pictures. In: Towse R, ed. A Handbook of Cultural Economics. Cheltenham: Edward Elgar; 2003;306–314.

26. Chung KH, Cox RAK. A stochastic model of superstardom: an application of the Yule distribution. Review of Economics and Statistics. 1994;76:771–775.

27. Clerides SK. Book value: intertemporal pricing and quality discrimination in the US market for books. International Journal of Industrial Organization. 2002;20:1385–1408.

28. Collins A, Hand C, Snell MC. What makes a blockbuster? Economic analysis of film success in the United Kingdom. Managerial and Decision Economics. 2002;23:343–354.

29. Connolly M, Krueger AB. Rockonomics: the economics of popular music. In: Amsterdam: North-Holland; 2006;667–719. Ginsburgh V, Throsby D, eds. Handbook of the Economics of Art and Culture. Vol. 1.

30. Davies J. The individual success of musicians, like that of physicists, follows a stretched exponential distribution. European Physical Journal B: - Condensed Matter and Complex Systems. 2002;27:445–447.

31. De Vany A. The movies. In: Amsterdam: North-Holland; 2006;615–665. Ginsburgh V, Throsby D, eds. Handbook of the Economics of Art and Culture. vol. 1.

32. De Vany AS, Lee C. Quality signals in information cascades and the dynamics of the distribution of motion picture box office revenues. Journal of Economic Dynamics and Control. 2001;25:593–614.

33. De Vany AS, Walls WD. Bose–Einstein dynamics and adaptive contracting in the motion picture industry. The Economic Journal. 1996;439:1493–1514.

34. De Vany AS, Walls WD. The market for motion pictures: rank, revenue and survival. Economic Inquiry. 1997;4:783–797.

35. De Vany AS, Walls WD. Uncertainty in the movie industry: does star power reduce the terror of the box office? Journal of Cultural Economics. 1999;23:285–318.

36. De Vany AS, Walls WD. Does Hollywood make too many R-rated movies? Risk, stochastic dominance, and the illusion of expectation. Journal of Business. 2002;75:425–451.

37. De Vany AS, Walls WD. Motion picture profit, the stable Paretian hypothesis, and the curse of the superstar. Journal of Economic Dynamics and Control. 2004;28:1035–1057.

38. De Vany AS, Walls WD. Big budgets, big openings and legs: analysis of the blockbuster strategy. Asian Economic Review. 2005;47:308–328.

39. De Vany AS, Walls WD. Estimating the effects of movie piracy on box-office revenue. Review of Industrial Organization. 2007;30:291–301.

40. Duan W, Gu B, Whinston AB. The dynamics of online word-of-mouth and product sales – an empirical investigation of the movie industry. Journal of Retailing. 2008;84:233–242.

41. Elberse A, Oberholzer-Gee F. Superstars and underdogs: an examination of the long-tail phenomenon in video sales. Marketing Science Institute. 2007;4:49–72.

42. Eliashberg J, Elberse A, Leenders MAAM. The motion picture industry: critical issues in practice, current research, and new research directions. Marketing Science. 2006;25:638–661.

43. Epstein JM, Axtell RL. Growing Artificial Societies: Social Science from the Bottom Up. Cambridge, MA: Brookings/MIT Press; 1996.

44. Fama E. The behavior of stock market prices. Journal of Business. 1965;38:34–105.

45. Fan J, Gijbels I. Variable bandwidth and local linear regression smoothers. Annals of Statistics. 1992;20:2008–2036.

46. Fan J, Gijbels I. Local Polynomial Modeling and its Applications. London: Chapman & Hall; 1996.

47. Feller W. An Introduction to Probability Theory and Its Applications. New York: Wiley; 1957.

48. Frank R, Cook P. The Winner-Take-All Society: Why the Few at the Top Get So Much More Than the Rest of Us. London: Penguin Books; 1996.

49. Frisch U, Sornette D. Extreme deviations and applications. Journal de Physique I. 1997;7:1155–1171.

50. Gabaix, X. 2008. Power laws In Durlauf, S.N., Blume, L.E. (Eds.), The New Palgrave Dictionary of Economics, 2nd edn. Palgrave Macmillan, New York [Online]. Available at: http://www.palgraveconnect.com/pc/doifinder/10.1057/9781137336583. (Accessed: 2 July 2013).

51. Gaffeo E, Scorcu AE, Vici L. Demand distribution dynamics in creative industries: the market for books in Italy. Information Economics and Policy. 2008;20:257–268.

52. Giles DE. Superstardom in the US popular music industry revisited. Economics Letters. 2006;92:68–74.

53. Giles DE. Increasing returns to information in the US popular music industry. Applied Economics Letters. 2007a;14:327–331.

54. Giles DE. Survival of the hippest: life at the top of the hot 100 Applied Economics. 2007b;39:1877–1887.

55. Goldman W. Adventures in the Screen Trade. New York: Warner Books; 1983.

56. Hadida AL. Motion picture performance: a review and research agenda. International Journal of Management Reviews. 2009;11:297–335.

57. Hamlen WAJ. Superstardom in popular music: empirical evidence. Review of Economics and Statistics. 1991;73:729–733.

58. Hamlen WAJ. Variety and superstardom in popular music. Economic Inquiry. 1994;32:395–406.

59. Hand C. Increasing returns to information: further evidence from the UK film market. Applied Economics Letters. 2001;8:419–421.

60. Harris RDF, Kucukozmen CC. The empirical distribution of stock returns: evidence from an emerging European market. Applied Economics Letters. 2001;8:367–371.

61. Hendricks K, Sorensen A. Information and the skewness of music sales. Journal of Political Economy. 2009;117:324–369.

62. Ho, J., Ho, K., Mortimer, J., 2008. Welfare effects of full-line forcing contracts in the video rental industry. Working Paper. Harvard Business School, Cambridge, MA.

63. Hogg RV, Craig AT. Introduction to Mathematical Statistics. 4th edn. New York: Macmillan; 1978.

64. Hong SH. Measuring the effect of Napster on recorded music sales: difference-in-differences estimates under compositional changes. Journal of Applied Econometrics. 2013;28:297–324.

65. Ijiri Y, Simon HA. Interpretations of departures from the Pareto curve firm-size distributions. Journal of Political Economy. 1974;82:315–332.

66. Jozefowicz JJ, Kelley JM, Brewer SM. New release: an empirical analysis of VHS/DVD rental success. Atlantic Economic Journal. 2008;36:139–151.

67. LaHerrere J, Sornette D. Stretched exponential distributions in nature and economy: ‘Fat tails’ with characteristic scales. European Physical Journal B. 1998;2:525–539.

68. Lambert P, Lindsey JK. Analysing financial returns using regression models based on non-symmetric stable distributions. Applied Statistics. 1999;48:409–424.

69. Leskovec J, Adamic LA, Huberman BA. The dynamics of viral marketing. ACM Trans Web. 2007;1 art. 5.

70. Li Q, Racine JS. Cross-validated local linear nonparametric regression. Statistica Sinica. 2004;14:485–512.

71. Liebowitz SJ. File sharing: creative destruction or just plain destruction? Journal of Law and Economics. 2006;49:1–28.

72. Liebowitz SJ. Research note–testing file sharing’s impact on music album sales in cities. Management Science. 2008;54:852–859.

73. Liebowitz, S.J., 2011. The metric is the message: how much of the decline in sound recording sales is due to file-sharing? Working Paper. School of Management, University of Texas, Dallas,TX.

74. Litman BR. Predicting the success of theatrical movies: an empirical study. Journal of Popular Culture. 1983;16:159–175.

75. Litman B. The motion picture entertainment industry. In: Adams W, ed. The Structure of American Industry. New York: Macmillan; 1990;183–216.

76. Litman BR, Ahn H. Predicting financial success of motion pictures: the early ’90s experience. In: The Motion Picture Mega-Industry. MA: Allyn and Bacon, Needham Heights; 1998;172–197.

77. Liu Y. Word of mouth for movies: its dynamics and impact on box office revenue. Journal of Marketing. 2006;70:74–89.

78. MacDonald GM. The economics of rising stars. American Economic Review. 1988;78:155–166.

79. Maddison D. Increasing returns to information and the survival of Broadway theatre productions. Applied Economics Letters. 2004;11:639–643.

80. Mahajan V, Muller E, Kerin RA. Introduction strategy for new products with positive and negative word-of-mouth. Management Science. 1984;30:1389–1404.

81. Mandelbrot B. New methods in statistical economics. Journal of Political Economy. 1963a;71:421–440.

82. Mandelbrot B. The variation of certain speculative prices. Journal of Business. 1963b;36:394–419.

83. Mandelbrot B. The variation of some other speculative prices. Journal of Business. 1967;40:393–413.

84. Mandelbrot B. Fractals and Scaling in Finance: Discontinuity, Concentration, Risk. New York: Springer; 1997.

85. McCulloch, J.H., 1996. Financial applications of stable distributions. In: Maddala, G.S., Rao, C.R. (Eds.), Statistical Methods in Finance. North-Holland, New York, pp. 393–425.

86. McCulloch JH. Linear regression with stable disturbances. In: Adler RJ, Feldman RE, Taqqu MS, eds. A Practical Guide to Heavy Tails: Statistical Techniques and Applications. Berlin: Birkhäuser; 1998;359–378.

87. McKenzie J. Bayesian information transmission and stable distributions: motion picture revenues at the Australian box office. Economic Record. 2008;84:338–353.

88. McKenzie J. Illegal music downloading and its impact on legitimate sales: Australian empirical evidence. Australian Economic Papers. 2009a;48:296–307.

89. McKenzie J. Revealed word-of-mouth demand and adaptive supply: survival of motion pictures at the Australian box office. Journal of Cultural Economics. 2009b;33:279–299.

90. McKenzie J. How do theatrical box office revenues affect DVD sales? Australian empirical evidence Journal of Cultural Economics. 2010;34:159–180.

91. McKenzie J. The economics of movies: a literature survey. Journal of Economic Surveys. 2012;26:42–70.

92. Mizerski RW. An attribution explanation of the disproportionate influence of unfavorable information. Journal of Consumer Research. 1982;9:301–310.

93. Mortimer, J., Sorensen, A., 2007. Supply responses to digital distribution: recorded music and live performances. Working Paper. Stanford Graduate School of Business, Stanford, CA.

94. Moul C. Measuring word-of-mouth’s impact on theatrical movie admissions. Journal of Economics and Management Strategy. 2007;16:859–892.

95. Nolan JP. Parameterizations and modes of stable distributions. Statistics and Probability Letters. 1998a;38:187–195.

96. Nolan JP. Univariate stable distributions: parameterizations and software. In: Adler RJ, Feldman RE, Taqqu MS, eds. A Practical Guide to Heavy Tails: Statistical Techniques and Applications. Berlin: Birkhäuser; 1998b;527–533.

97. Nolan, J.P., 1999. Fitting data and assessing goodness-of-fit with stable distributions. In: Nolan, J.P., Swami, A. (Eds.), Proceedings of the ASA-IMS Conference on Heavy Tailed Distributions. Washington, DC, pp. 1–52.

98. Nolan JP. Maximum likelihood estimation of stable parameters. In: Barndor-Nielsen OE, Mikosch T, Resnick SI, eds. Lévy Processes: Theory and Applications. Boston, MA: Birkhäuser; 2001;379–400.

99. Nolan, J.P., 2005. STABLE computer program. Available at: <http://academic2.american.edu/simjpnolan/stable/stable.html>.

100. Nolan, J.P., 2008. Bibliography on stable distributions, processes and related topics. Mimeograph, Mathematics Department, American University, Washington, DC.

101. Nolan JP. Stable Distributions – Models for Heavy Tailed Data. Boston, MA: Birkhauser; 2011.

102. Oberholzer-Gee F, Strumpf K. The effect of file sharing on record sales: an empirical analysis. Journal of Political Economy. 2007;115:1–42.

103. Ottaviani M, Sorensen P. Herd behavior and investment: comment. American Economic Review. 2000;90:695–704.

104. Pardoe I, Simonton DK. Applying discrete choice models to predict Academy Award winners. Journal of the Royal Statistical Society Statistics in Society. 2008;171:375–394.

105. Pitt I. Superstar effects on royalty income in a performing rights organization. Journal of Cultural Economics. 2010;34:219–236.

106. Prag J, Casavant J. An empirical study of determinants of revenues and marketing expenditures in the motion picture industry. Journal of Cultural Economics. 1994;18:217–235.

107. Rachev ST, Mittnik S. Stable Paretian Models in Finance. New York: Wiley; 2000.

108. Racine J, Li Q. Nonparametric estimation of regression functions with both categorical and continuous data. Journal of Econometrics. 2004;119:99–130.

109. Ravid SA. Information, blockbusters and stars: a study of the film industry. Journal of Business. 1999;72:463–486.

110. Ravid SA. Are they all crazy or just risk averse? Some movie puzzles and possible solutions. In: Ginsburgh V, ed. The Economics of Art and Culture. Amsterdam: North-Holland; 2004;33–47.

111. Rob R, Waldfogel J. Piracy on the high C’s: music downloading, sales displacement, and social welfare. Journal of Law and Economics. 2006;XLIX:29–62.

112. Rob R, Waldfogel J. Piracy on the silver screen. Journal of Industrial Economics. 2007;55:379–395.

113. Rosen S. The economics of superstars. American Economic Review. 1981;71:167–183.

114. Rosser Jr JB. Econophysics. In: Durlauf SN, Blume LE, eds. The New Palgrave Dictionary of Economics. 2nd edn. New York, pp. 729–732: Palgrave Macmillan; 2008.

115. Samorodnitsky G, Taqqu MS. Stable Non-Gaussian Random Processes. New York: Chapman & Hall; 1994.

116. Sedgwick J, Pokorny M. Comment: Movie stars and the distribution of financially successful films in the motion picture industry. Journal of Cultural Economics. 1999;23:319–323.

117. Sgroi D. Optimizing information in the herd: guinea pigs, profits, and welfare. Games and Economic Behavior. 2002;39:137–166.

118. Simon HA. On a class of skew distribution functions. Biometrika. 1955;52:425–440.

119. Simonoff JS, Ma L. An empirical study of factors relating to the success of Broadway shows. Journal of Business. 2003;76:135–150.

120. Sinha S, Raghavendra S. Hollywood blockbusters and long-tailed distributions. European Physical Journal B: Condensed Matter and Complex Systems. 2004;42:293–296.

121. Smith SP, Smith VK. Successful movies: a preliminary empirical analysis. Applied Economics. 1986;18:501–507.

122. Smith L, Sorensen P. Pathological outcomes of observational learning. Econometrica. 2000;68:371–398.

123. Sorensen AT. Bestseller lists and product variety. Journal of Industrial Economics. 2007;55:715–738.

124. Sornette D. Multiplicative processes and power laws. Physical Review E. 1998;57:4811–4813.

125. Sornette D. Economy of scales in R&D with block-busters. Quantitative Finance. 2002;2:224–227.

126. Spierdijk L, Voorneveld M. Superstars without talent? the Yule distribution controversy. The Review of Economics and Statistics. 2009;91:648–652.

127. Steindl J. Random Processes and the Growth of Firms: a Study of the Pareto Law. New York: Hafner; 1965.

128. Strobl E, Tucker C. The dynamics of chart success in the UK pre-recorded popular music industry. Journal of Cultural Economics. 2000;24:113–134.

129. Taleb NN. The Black Swan: The Impact of the Highly Improbable. New York: Random House; 2007.

130. Throsby, D., 2002. The Music Industry in the New Millennium: Global and Local Perspectives. UNESCO, Paris.

131. Tucker C, Zhang J. How does popularity information affect choices? A field experiment. Management Science. 2011;57:828–842.

132. Uchaikin VV, Zolotarev VM. Chance and Stability: Stable Distributions and their Applications. Utrecht: VSP; 1999.

133. van der Ploeg F. Beyond the dogma of the fixed book price agreement. Journal of Cultural Economics. 2004;28:1–20.

134. Vining D. Autocorrelated growth rates and the Pareto law: a further analysis. Journal of Political Economy. 1976;84:369–380.

135. Wallace WT, Seigerman A, Holbrook MB. The role of actors and actresses in the success of films: how much is a movie star worth? Journal of Cultural Economics. 1993;17:1–24.

136. Walls WD. Increasing returns to information: evidence from the Hong Kong movie market. Applied Economics Letters. 1997;4:187–190.

137. Walls WD. Product survival at the cinema: evidence from Hong Kong. Applied Economics Letters. 1998;5:215–219.

138. Walls WD. Demand stochastics, supply adaptation, and the distribution of film earnings. Applied Economics Letters. 2005a;12:619–623.

139. Walls WD. Modeling movie success when ‘nobody knows anything’: conditional stable-distribution analysis of film returns. Journal of Cultural Economics. 2005b;29:177–190.

140. Walls WD. Modeling skewness and heavy tails in film returns. Applied Financial Economics. 2005c;15:1181–1188.

141. Walls WD. Economics of motion pictures. In: Durlauf SN, Blume LE, eds. The New Palgrave Dictionary of Economics. 2nd edn. New York, Palgrave Macmillan; 2008.

142. Walls WD. Robust analysis of movie earnings. Journal of Media Economics. 2009a;22:20–35.

143. Walls WD. Screen wars, star wars, and sequels: nonparametric reanalysis of movie profitability. Empirical Economics. 2009b;37:447–461.

144. Walls WD. General stable models of the rate of return to Hollywood films. Advances and Applications in Statistical Sciences. 2010a;2:19–36.

145. Walls WD. Superstars and heavy tails in recorded entertainment: Empirical analysis of the market for DVDs. Journal of Cultural Economics. 2010b;34:261–280.

146. Zolotarev, V.M., 1986. One-Dimensional Stable Distributions. American Mathematical Society, Providence. (Translation of the original 1983 Russian edition).


1For simplicity in the exposition, I will refer to all audio recordings generically as music, even though these conceptually cover the full range of titles from an Anthony Robbins motivational lecture to a ZZ Top song.

2This information could be from the opinions of expert reviewers (or any other individual), advertising, or sales rank or volume.

3The Pareto law is a special case of a power law. See Gabaix (2008) for a brief discussion of the forces that generate power laws and empirical applications in economics. A more detailed exposition of particular applications of power laws is contained in the works of Mandelbrot (1997).

4Vining (1976) provides further analysis and refinement of the Ijiri and Simon (1974) analytical framework. Vining’s (1976) insights will become useful in the interpretation of our empirical results, but to focus on the methodological issues raised in his research at this point in this paper would overly complicate the exposition of this section while not adding anything fundamental to the reader’s understanding.

5This is discussed at some length in Uchaikin and Zolotarev (1999).

6See, for example, the volume edited by Adler et al. (1998) for many papers that extend or apply the stable distribution model, and Rachev and Mittnik (2000) and McCulloch (1996) for specific applications in finance. Nolan (2008) has compiled an exhaustive bibliography relating to nearly all aspects of stable modeling.

7Competing candidate distributions, such as the Student-t, can be used to model heavy tails, and the skewed-Student-t (Azzalini and Capitanio, 2003) can accommodate skewness and heavy tails, but they are ad hoc. Only distributions in the stable class can appeal to the general central limit theorem.

8See Nolan (1998a,b) and Nolan (2011, chapter 1) for useful discussions of the various parameterizations of the stable distribution.

9For example, Harris and Kucukozmen (2001) use the exponential generalized image and skew generalized t distributions, and Brannas and Nordman (2003) apply the log-generalized image and Pearson type IV specifications.

10In the context of the film industry, this log-linear regression equation or one similar to it has been used by researchers including Smith and Smith (1986), Prag and Casavant (1994), Litman and Ahn (1998), Ravid (1999), Walls (2005b, 2009a), and others. For a more complete listing, see McKenzie (2012).

11For example, De Vany and Walls (1999), Collins et al. (2002), and Pardoe and Simonton (2008) use discrete choice analysis to model various metrics of film success.

12Litman and Ahn (1998) do not report elasticities in their paper since. Since their model is estimated in levels, I have calculated the elasticity using their estimate regression coefficient on budget of 0.38254, their reported average budget of 31.38 million and average box office revenue of 51.24.

13Again, Litman and Ahn (1998) do not explicitly report elasticity estimates in their paper. I have calculated the point elasticity using their estimated regression coefficient of 0.01982, and their reported average screens of 1669.9 and average box office gross of 31.38 million.

14Structural analysis of the DVD market – as opposed to the approach based on scaling laws and winner-take-all distributions – is contained in recent papers such as those of Ho et al. (2008) and Elberse and Oberholzer-Gee (2007).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset