Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 3

Portfolio Management

3.1 Expected Utility Functions

3.1.1 Utility Functions

Suppose we have different investment opportunities to choose from. These investments may affect our future wealth. For example, the task is to allocate an initial capital among several risky assets to form an investment portfolio. Once we decide on such a risky investment, the future wealth becomes uncertain so it follows some probability distribution. The investment selection procedure can be reduced to the optimization of the probability distribution of the uncertain future wealth. If the outcomes from all alternatives were certain, then we would select the investment that produces the largest return. For the case with uncertain outcomes, we may be interested in minimizing the variance of the respective probability distribution but other criteria can be applied. So we need a systematic way to rank random wealth levels. In the case of alternatives with uncertain outcomes, we introduce some score function that is calculated as an expected value of a so-called utility function.

Suppose we are given a function u: ℝ → ℝ so that each possible investment can be assessed by computing the expected utility value E[u(V)] of the future wealth V . In other words, the value of an investment can be measured by the expected value of the utility of its consequences. To compare possible alternatives, we first compute E[u(V)] for each possible wealth function V and then choose an alternative with the greatest expected utility value. The specific utility function used depends on individual investment preferences, risk tolerance, and individual financial environment. The simplest example of a utility is the linear function u(x)= x. Whoever uses this utility function ranks uncertain wealths by their expected values.

Here are some of the most commonly used utility functions (see Figure 3.1):

the logarithmic (log) utility function u(x) = ln x;
the exponential utility function u(x) = −e−ax with a > 0;
the power utility function u(x)= xa with 0 < a < 1;
the quadratic utility function u(x)= x − ax2 with 0 < a defined for $x < \frac{1}{2 a}$ .

Figure 3.1

Figure showing sample plots of four commonly used utility functions.

Sample plots of four commonly used utility functions.

As you can see, some utility functions can take negative values. This negativity does not matter since an investor ranks investments using relative values. Moreover, the addition of a constant to a utility function and the multiplication of a utility function by a positive constant do not affect the rankings. If for some utility function u and two investments V1 and V2 we have that E[u(V1)] ≤ E[u(V2)], then for any a ∊ ℝ and b > 0 we obtain

$E [a + b u (V_{1})] = a + b E [u (V_{1})] \leq a + b E [V_{2}] = E [a + b u (V_{2})] .$

In general, given a utility function u, we can define another utility function $\tilde{u}$ of the form

$\tilde{u} (x) = a + b u (x)$

with b > 0. This new utility function $\tilde{u}$ is said to be equivalent to u. Equivalent utility functions give identical rankings of investment opportunities.

Another example of a utility function is the linear utility u(x)= a + bx, b > 0. This function reflects expectations of a risk-indifferent investor. For any random wealth V we have

$E [u (V)] = E [a + b V] = a + b E [V] = u (E [V]),$

hence the linear utility function has no preference for a deterministic wealth or for a random wealth provided that expected wealths are the same.

Example 3.1.

An investor with total capital W can invest any amount between 0 and W . If an amount is invested, then the same amount is either won or lost with respective probabilities p and 1 − p. In other words, with probability p, the investor doubles the initial investment; with probability 1 − p, the investor loses all the invested money. What amount should be invested if the log utility function u(V) = ln V is utilized for ranking alternatives?

Solution. Let the amount of xW for some 0 ≤ x ≤ 1 be invested. The investor's final fortune V (x) is either W + xW or W − xW with respective probabilities p and 1 − p. Hence the expected utility of the final wealth is

$\begin{array}{l} E [u (V (x))] & = & \ln (W + x W) p + \ln (W - x W) (1 - p) \\ = & \ln ((1 + x) W) p + \ln ((1 - x) W) (1 - p) \\ = & \ln (1 + x) p + \ln (1 - x) (1 - p) + \ln W . \end{array}$

To find the optimal value of x, let us differentiate E[u(V (x))] with respect to x and then find zeros of the derivative obtained:

$\begin{array}{l} \frac{d}{d x} E [u (V (x))] & = & \frac{d}{d x} (p \ln (1 + x) + (1 - p) \ln (1 - x)) \\ = & \frac{d}{1+ x} - \frac{1 - p}{1 - x} = \frac{2 p - (1 + x)}{1 - x^{2}} . \end{array}$

If $p \in (0, \frac{1}{2}]$ , then the derivative is strictly negative for all x ∊ (0, 1) and the expected utility attains its maximum value at x = 0. In this case, the risk to lose the invested amount is too high, and it is reasonable to invest nothing. If $p \in (\frac{1}{2}, 1)$ , then the derivative is zero at x* = 2p − 1 ∊ (0, 1). The second derivative of the expected utility function is negative at x* , hence x = 2p − 1 ∊ (0, 1) is the point of maximum of V (x). Therefore, 100(2p − 1)% of the initial capital is to be invested. For example, for p = 70%, the investor shall invest 40% of the fortune.

Example 3.2.

The Saint Petersburg Paradox, originally proposed by Nicolaus Bernoulli, is a classical example of how utility functions are used in the decision making process. Consider a game of chance where a fixed fee is paid to enter, and then a fair coin will be tossed repeatedly until a head first appear ending the game. The payoff starts at $1 and then is doubled every time a tail appears. As a result, the player wins $2k−1 if a head first appears on the kth toss (k = 1, 2, 3,...). How much should the player be willing to pay to enter such a game?

Solution. First, let us find the expected value of the payoff. We deal with a sequence of independent trials where the probability of success (i.e., a head occurs) is $\frac{1}{2}$ . With probability $p_{1} = \frac{1}{2}$ , a head first appears on the first toss and the player wins $1; with = probability $p_{2} = \frac{1}{4}$ , a head first appears on the second toss and the player wins $2; with = probability $p_{3} = \frac{1}{8}$ , a head first appears on the third toss and the player wins $4, etc. The probability that a head first appears on the kth toss is pk = 2−k; the payoff is then $2k−1. Therefore, the expected payoff is then

$E = \frac{1}{2} \cdot 1 + \frac{1}{4} \cdot 2 + \frac{1}{8} \cdot 4 + \dots + \frac{1}{2^{k}} 2^{k - 1} + \dots = \frac{1}{2} + \frac{1}{2} + \frac{1}{2} + \dots = \infty .$

The expected win for the player of this game is an infinite amount of money. So no matter how large is the fee paid to enter this game, the player will eventually make a profit in the long run repeatedly playing this game. The classical solution to this “paradox” is to assume that one's valuation of money is different from its face value and depends on his or her wealth. Let us apply the logarithmic utility model to find a reasonable price c charged to enter the game. Let the initial wealth of the player be denoted V0. The expected log utility function of the total wealth V = V (c) after playing the game is

$E [\ln V] = \sum_{k = 1}^{\infty} \ln (V_{0} + 2^{k - 1} - c) \frac{1}{2^{k}} < \infty .$

Figure 3.2

Figure showing an outcome tree for the St. Petersburg game. The game consists of a series of coin tosses offering a 50% chance of winning $1, a 25% chance of $2, a 12.5% chance of $4, and so on. The gamble may continue indefinitely.

An outcome tree for the St. Petersburg game. The game consists of a series of coin tosses offering a 50% chance of winning $1, a 25% chance of $2, a 12.5% chance of $4, and so on. The gamble may continue indefinitely.

A rational player is willing to play the game only if the game will not decrease the expected utility of the wealth:

$E [\ln V] \geq E [\ln V_{0}] .$

After plotting the expected change in utility,

$E [\ln V] - E [\ln V_{0}] = E [\ln V - \ln V_{0}] = E [\ln (V / V_{0})],$

as a function of the cost c, we observe that E[ln(V (c)/V0)] is a strictly decreasing function of c and there exists a maximum cost c* so that any price c<c gives a positive expected change in utility. Such a cost c* depends on the initial capital V0 and can be found by solving E[ln(V (c)/V0)]=0 for c. For example, a person with $2 in his pocket is willing to pay up to $2, a person with $1000 is willing to pay up to $5.97, and a millionaire is willing to pay up to $10.94.

3.1.1.1 Risk Aversion

Utility functions are constructed based on the following principles.

Principle 1. Investors prefer more to less. If there are two certain wealths V1 and V2, then an investor prefers the larger one, i.e., V1 <V2 implies u(V1) <u(V2). Hence, u is an increasing function.

Principle 2. Investors are averse to risk. Positive deviations ΔV from average wealth V cannot compensate for equally large and equally probable negative deviations −ΔV from average wealth, i.e., u(V) − u(V − ΔV) > u(V +ΔV) − u(V). Therefore,

$u (V) > \frac{u (V + Δ V) + u (V - Δ V)}{2} . (3.1)$

The inequality in (3.1) holds true for all V if u is a concave function. The left-hand side, u(x) − u(V − ΔV), is “the pain of losing ΔV dollars,” and the right-hand side, u(V + ΔV) − u(V), is “the joy of winning ΔV dollars.” The inequality says that the pain of losing outweighs the joy of winning, alternatively that we react more severely to a loss then we do to a gain of the same magnitude.

Suppose that there are two alternatives for future wealth: the first provides either x or y each with a probability of $\frac{1}{2}$ , the second gives $\frac{1}{2} x + \frac{1}{2} y$ with certainty. Although both alternatives have the same expected value, a risk-averse investor prefers the certain wealth of $\frac{1}{2} x + \frac{1}{2} y$ to a 50-50 chance of x and y: $u (\frac{1}{2} x + \frac{1}{2} y) \geq \frac{1}{2} u (x) + \frac{1}{2} u (y)$ .

Recall that a function u defined on an interval [a, b] is said to be concave if for any α with 0 ≤ α ≤ 1 and any x, y ∊ ℝ there holds

$u (α x + (1 - α) y) \geq α u (x) + (1 - α) u (y) .$

A function u is said to be convex on [a, b] if the function −u is concave on [a, b]. That is, if for any α with 0 ≤ α ≤ 1 and any x, y ∊ ℝ there holds

$u (α x + (1 - α) y) \leq α u (x) + (1 - α) u (y) .$

A twice differentiable function u is concave (convex) on an interval [a, b] if its second derivative u″ is nonpositive (nonnegative) on [a, b]. A utility function is said to be risk-averse (on an interval [a, b]) if it is concave (on the interval [a, b]). A concave function is depicted in Figure 3.3. For a twice-differentiable utility function, the risk-averse condition means that the second derivative of the utility function is nonpositive.

Figure 3.3

Figure showing the concave (or convex-upward) plot of a typical risk-averse utility function. As is seen, every curve segment of a concave plot lies above a chord connecting the endpoints of the segment.

The concave (or convex-upward) plot of a typical risk-averse utility function. As is seen, every curve segment of a concave plot lies above a chord connecting the endpoints of the segment.

Recall Jensen's inequality: let u be a concave function, then for any random variable V,

$E [u (V)] \leq u (E [V]) . (3.2)$

This means that a risk-averse investor prefers a certain wealth of W to an uncertain wealth V with the same expected value E[V]= W . This observation relates to the notion of the certainty equivalent.

The certainty equivalent of an uncertain wealth V is defined as the amount of a constant wealth C that has the utility level equal to the expected utility of V :

$u (C) = E [u (V)] . (3.3)$

Clearly, the certainty equivalent is the same for all equivalent functions. Combining (3.2) and (3.3) gives that the certainty equivalent is always less than the expected value of the wealth for a risk-averse investor with a concave utility function:

$u (C) \leq u (E [V]) \Rightarrow C \leq E [V] .$

Let us represent the uncertain return V in the form V = W + ∈ where W is the initial capital and ∈ is a zero-mean risk. A natural way to measure risk aversion is to ask how much an investor is ready to pay to get rid of the zero-mean risk ∈. This price, called a risk premium and denoted π, is defined implicitly by

$E [u (W + \in)] = u (W - π) . (3.4)$

Let us consider a small risk ∈. Expanding the left-and right-hand sides of (3.4) in Taylor's approximations gives

$\begin{array}{l} E [u (W + \in)] & \approx E [u (W) + \in u^{'} (W) + \frac{\in^{2}}{2} u^{″} (W)] \\ = u (W) + E [\in] u^{'} (W) + \frac{E [\in^{2}]}{2} u^{″} (W) = u (W) + \frac{σ_{\in}^{2}}{2} u^{″} (W) \end{array}$

and

$u (W - π) \approx u (W) - π u^{'} (W),$

respectively, where $σ_{\in}^{2} := E [\in^{2}]$ is the variance of ∈. Substituting these back into (3.4), we obtain

$π \approx \frac{σ_{\in}^{2}}{2} A_{u} (W),$

where

$A_{u} (W) := - \frac{u^{″} (W)}{u^{'} (W)}$

is the Arrow–Pratt absolute risk aversion coefficient. We say that investor 1 (with utility function u1) is more risk averse than investor 2 (with utility function u2) if for the same initial wealth W and zero-mean risk ∈, the risk premium π1 paid by investor 1 is larger than the risk premium π2 of investor 2, or, equivalently, $A_{u_{1}} (W) > A_{u_{2}} (W)$ .

The degree of risk aversion can be viewed as a measure of the magnitude of concavity of the utility function: the stronger the bend in the function, the larger the risk aversion coefficient A. For example, the risk-aversion coefficient for a linear utility function, u(V)= a + bV, is zero. The coefficient A is normalized by the derivative u′ that appears in the denominator. This makes A independent of linear transformations of the utility function u. Indeed, for any reals a and b ≠ 0 we have

$- \frac{(a + b u (x))''}{(a + b u (x))'} = - \frac{b u^{″} (x)}{b u^{'} (x)} = - \frac{u^{″} (x)}{u^{'} (x)} .$

The coefficient function A(W) shows how risk aversion changes with the wealth level. It is usually argued that absolute risk aversion should be a decreasing function of wealth. That is, many investors are willing to take more risk when they are financially secure. For example, a lottery to gain or lose $100 is potentially life-threatening for an investor with initial wealth W = 101, whereas it is negligible for an investor with wealth W = 100,000. The former individual should be willing to pay more than the latter for the elimination of such a risk. Thus, we may require that the risk premium associated with any risk is decreasing in wealth. It can be shown that this holds if and only if the Arrow–Pratt absolute risk aversion coefficient is decreasing in wealth. This requirement means that

$A^{'} (W) = - \frac{u^{‴} (W) u^{'} (W) - u^{″} {(W)}^{2}}{u^{'} {(W)}^{2}} < 0.$

A necessary condition for this to hold is u′″ (W) > 0.

As a specific example, consider the exponential utility function u(x)= −e−ax. Differentiate it to obtain

$u^{'} (x) = a e^{- a x} and u^{″} (x) = - a^{2} e^{- a x} .$

Therefore, we have A(x)= −u″ (x)/u′ (x)= a. In this case, the risk aversion remains constant as wealth increases. As another example, consider the power utility function u(x)= xa with 0 < a < 1. We have u′ (x)= axa−1 and u″ (x)= a(a−1)xa−2. Thus, A(x)= (1−a)/x. So risk aversion decreases as wealth increases. Similarly, for the logarithmic utility function u(x) = ln x, we have u′ (x)=1/x, u″ (x)= −1/x2, and A(x)=1/x.

3.1.2 Mean-Variance Criterion

Suppose that the optimal investment opportunity is chosen by maximizing the expected utility of the wealth. Let us show how the utility maximization method reduces to the mean-variance criterion when an optimal investment is selected by maximizing the expected wealth and minimizing the variance of the wealth. Suppose that the final wealth follows a normal probability distribution and the investor uses an exponential utility function u(x)= −e−ax with a > 0. Recall that the mathematical expectation of an exponential function of a normal random variable Z can be expressed in terms of the expected value and variance of Z as follows:

$E [e^{Z}] = e^{E [Z] + Var (Z) / 2} .$

If the wealth V is normal, then −aV is normal as well with mean E[−aV]= −aE[V] and variance Var(−aV)= a2 Var(V). Therefore, the expected utility of the wealth V is

$E [u (V)] = - \exp (- a E [V] + a^{2} Var (V) / 2) = - \exp (- a (E [V] - a Var (V) / 2)) .$

The exponential function is increasing. Thus, the expected utility is maximized by choosing an investment that maximizes E[V] − a Var(V)/2. This means that alternative investments can be ranked by comparing their means and variances. If there are two investments so that E[V1] ≥ E[V2] and Var(V1) ≤ Var(V2), then the first investment results in a larger expected utility than does the second: E[u(V1)] ≥ E[u(V2)].

One can arrive at the same conclusion for the case of a quadratic utility function u(x)= x − ax2 with a > 0. Assuming that the wealth V satisfies $V < \frac{1}{2 a}$ , the expected utility E[u(V)] is maximized by selecting an investment with a larger expected wealth and smaller variance Var(V).

To deal with a general utility function u, let us consider the Taylor expansion of u about the point E[V]:

$u (V) \approx u (E [V]) + u^{'} (E [V]) (V - E [V]) + \frac{1}{2} u^{″} (E [V]) {(V - E [V])}^{2} .$

Taking expectations gives

$\begin{array}{l} E [u (V)] & \approx u (E [V]) + u^{'} (E [V]) E [V - E [V]] + \frac{1}{2} u^{″} (E [V]) E [{(V - E [V])}^{2}] \\ = u (E [V]) + u^{″} (E [V]) Var [V] / 2. \end{array} (3.5)$

Here we use that E[V − E[V]] = E[V] − E[V] = 0 and E (V − E[V])2 = Var(V). Therefore, a reasonable approximation to the optimal investment is given by an investment that maximizes

$u (E [V]) + u^{″} (E [V]) Var [V] / 2.$

Suppose that u″ (x) is a nondecreasing function in x. Then, since u″ (x) ≤ 0, an optimal investment V can be again selected by both maximizing the expected value E[V] and minimizing the variance Var(V). Recall that the standard deviation $σ_{V} = \sqrt{Var (V)}$ characterizes the risk associated with the investment V. Therefore, the mean-variance criterion tells us that the optimal investment is attained by maximizing the expected value of the wealth and minimizing the risk.

3.2 Portfolio Optimization for Two Assets

3.2.1 Portfolio of Two Assets

In a one-period setting, let us consider a model with two risky assets $A_{t}^{1}$ and $A_{t}^{2}$ , where t ∊{0,T}. Each asset, labelled by i = 1, 2, is characterized by its initial value $A_{0}^{i}$ and the respective single-period return $r_{i} = \frac{A_{T}^{i} - A_{0}^{i}}{A_{0}^{i}}$ . At that we have $A_{T}^{i} = A_{0}^{i} (1 + r_{i})$ . The risky returns r1 and r2 (as well as the terminal asset prices $A_{T}^{1}$ and $A_{T}^{2}$ ) are random variables defined on a common probability space with state space and probability function ℙ.

Let us form a portfolio [x1,x2]Τ by purchasing x1 shares of asset 1 and x2 shares of asset 2. The initial wealth of such a portfolio is $V_{0} = x_{1} A_{0}^{1} + x_{2} A_{0}^{2}$ . The (one-period) rate of return rV is then given by

$\begin{matrix} r_{V} & = & \frac{V_{T} - V_{0}}{V_{0}} = \frac{x_{1} (A_{T}^{1} - A_{0}^{1}) + x_{2} (A_{T}^{2} - A_{0}^{2})}{V_{0}} \\ = & \frac{x_{1} (A_{T}^{1} - A_{0}^{1})}{A_{0}^{1}} \frac{A_{0}^{1}}{V_{0}} + \frac{x_{2} (A_{T}^{2} - A_{0}^{2})}{A_{0}^{2}} \frac{A_{0}^{2}}{V_{0}} \\ = & \frac{x_{1} A_{0}^{1}}{V_{0}} r_{1} + \frac{x_{2} A_{0}^{2}}{V_{0}} r_{2} . \end{matrix}$

Introduce the following weights:

$w_{1} = \frac{x_{1} A_{0}^{1}}{V_{0}} and w_{2} = \frac{x_{2} A_{0}^{2}}{V_{0}}$

which are called the allocation weights of funds between the two underlying assets. In other words, 100wi% of the initial wealth is invested in asset i = 1, 2. By the definition of a wealth function, the weights add up to one: w1 + w2 = 1. If short selling is allowed, then one of the weights may be negative and, hence, the other weight is greater than one. For a portfolio without short selling, both weights are between zero and one. Being given the values of returns ri and weights wi, the total wealth at the end of the period is

$V_{T} = (1 + r_{V}) V_{0} = (1 + w_{1} r_{1} + w_{2} r_{2}) V_{0} = (w_{1} (1 + r_{1}) + w_{2} (1 + r_{2})) V_{0} .$

A portfolio with weights [w1,w2]Τ can be characterized by the expected return and the variance of the return. Since rV = w1r1 + w2r2, we have that

$\begin{array}{l} E [r_{V}] & = E [w_{1} r_{1}] + E [w_{2} r_{2}] \\ = w_{1} E [r_{1}] + w_{2} E [r_{2}] \end{array} (3.6)$

$\begin{array}{l} Var (r_{V}) & = Var (w_{1} r_{1}) + Var (w_{2} r_{2}) + 2 Cov (w_{1} r_{1}, w_{2} r_{2}) \\ = w_{1}^{2} Var (r_{1}) + w_{2}^{2} Var (r_{2}) + 2 w_{1} w_{2} Cov (r_{1}, r_{2}) \\ = w_{1}^{2} Var (r_{1}) + w_{2}^{2} Var (r_{2}) + 2 w_{1} w_{2} Corr (r_{1}, r_{2}) \sqrt{Var (r_{1})} \sqrt{Var (r_{2})} . \end{array} (3.7)$

Here, we define the coefficient of correlation between two random variables as follows:

$Corr (r_{1}, r_{2}) = \frac{Cov (r_{1}, r_{2})}{\sqrt{Var (r_{1}) Var (r_{2})}} \in [- 1, 1] .$

Note that if the variance of one of the random variables is zero then the correlation coefficient is undefined.

Proposition 3.1.

The variance of the return on a portfolio without short selling (i.e., both w1 and w2 are nonnegative) cannot exceed the greater of the variances of the underlying asset returns:

$0 \leq Var (r_{V}) \leq \max {Var (r_{1}), Var (r_{2})} .$

Proof. Since the value of the correlation coefficient is always between −1 and 1, from (3.7) we obtain that

$\begin{array}{l} Var (r_{V}) & \leq w_{1}^{2} Var (r_{1}) + w_{2}^{2} Var (r_{2}) + 2 w_{1} w_{2} \sqrt{Var (r_{1})} \sqrt{Var (r_{2})} \\ \leq {(w_{1} \sqrt{Var (r_{1})} + w_{2} \sqrt{Var (r_{2})})}^{2} \\ \leq {(w_{1} + w_{2})}^{2} \max {Var (r_{1}), Var (r_{2})} = \max {Var (r_{1}), Var (r_{2})} . \end{array}$

On the other hand, the variance is always a nonnegative quantity.

Introduce the following notation for the expected returns, the variances of returns, and the correlation coefficient:

$μ_{i} = E [r_{i}], σ_{i}^{2} = Var (r_{i}); (i = 1, 2); ρ_{12} = Corr (r_{1}, r_{2}) .$

Moreover, denote $μ_{V} = E [r_{V}] and σ_{V}^{2} = Var (r_{V})$ . In this notation, Equations (3.6) and (3.7) take the respective forms:

$μ_{V} = w_{1} μ_{1} + w_{2} μ_{2} and σ_{V}^{2} = w_{1}^{2} σ_{1}^{2} + w_{2}^{2} σ_{2}^{2} + 2 ρ_{12} w_{1} w_{2} σ_{1} σ_{2} . (3.8)$

Example 3.3.

Consider two risky assets with the following probability distributions of their returns:

Scenario ω	Probability ℙ(ω)	Return r1	Return r2
ω1	0.1	−20%	30%
ω2	0.6	5%	10%
ω3	0.3	10%	−20%

Calculate the expected returns, μi, standard deviations, σi, and correlation coefficient of returns, ρ12.

Solution. To compute the mathematical expectation of a random variable X on a finite sample space Ω, we use the formula

$E [X] = \sum_{ω \in Ω} X (ω) ℙ (ω) .$

The expected returns are

$\begin{array}{l} μ_{1} = E [r_{1}] = \sum_{i = 1}^{3} r_{1} (ω^{i}) ℙ (ω^{i}) = (- 0.2) \cdot 0.1 + 0.05 \cdot 0.6 + 0.1 \cdot 0.3 = 0.04 = 4 %, \\ μ_{2} = E [r_{2}] = \sum_{i = 1}^{3} r_{2} (ω^{i}) ℙ (ω^{i}) = 0.3 \cdot 0.1 + 0.1 \cdot 0.6 + (- 0.2) \cdot 0.3 = 0.03 = 3 % . \end{array}$

Using the fact that Var(X) = E[(X − E[X])2] and Cov(X, Y) = E[(X − E[X])(Y − E[Y])], we similarly obtain:

$\begin{array}{l} Var (r_{1}) = {(- 0.2 - 0.04)}^{2} \cdot 0.1 + {(0.05 - 0.04)}^{2} \cdot 0.6 + {(0.1 - 0.04)}^{2} \cdot 0.3 = 0.0069, \\ Var (r_{2}) = {(0.3 - 0.03)}^{2} \cdot 0.1 + {(0.1 - 0.03)}^{2} \cdot 0.6 + {(- 0.2 - 0.03)}^{2} \cdot 0.3 = 0.02610, \\ Cov (r_{1}, r_{2}) = (- 0.2 - 0.04) \cdot (0.3 - 0.03) \cdot 0.1 + (0.05 - 0.04) \cdot (0.1 - 0.03) \cdot 0.6 \\ + (0.1 - 0.04) \cdot (- 0.2 - 0.03) \cdot 0.3 = - 0.0102. \end{array}$

The standard deviations are

$σ_{1} = \sqrt{Var (r_{1})} = \sqrt{0.0069} ≅ 0.08307, σ_{2} = \sqrt{Var (r_{2})} = \sqrt{0.02610} ≅ 0.16156.$

The correlation coefficient ρ12 is

$ρ_{12} = \frac{Cov (r_{1}, r_{2})}{\sqrt{Var (r_{1}) Var (r_{2})}} ≅ \frac{- 0.0102}{0.08307 \cdot 0.16156} ≅ - 0.76007 ≅ - 76 % .$

Example 3.4.

Find an optimal allocation of the initial wealth V0 = 1000 between two risky assets from Example 3.3 when attempting to maximize the expected value, E[u(VT)], of an exponential utility function u(x) = 1 − e−0.01x of the wealth VT.

Solution. Let the weights of a portfolio V in the two assets be w1 = x and w2 = 1 − x, respectively. The return on such a portfolio is rV (x)= xr1 + (1 − x)r2. At the end of the period, the portfolio value is VT = V0(1 + rV). Now we can find the optimal allocation by solving the following maximization problem:

$E [u (V_{T})] = E [1 - e^{- 0.01 V_{T}}] = E [1 - e^{- 0.01 V_{0} (1 + r_{V})}] = 1 - E [e^{- 10 (1 + x r_{1} + (1 - x) r_{2})}] \to \max_{x} .$

It is equivalent to minimizing E[e−10(1+xr1+(1−x)r2)] w.r.t. x. Evaluate the mathematical expectation:

$\begin{matrix} E [e^{- 10 (1 + x r_{1} + (1 - x) r_{2})}] & = & \sum_{i = 1}^{3} p_{i} e^{- 10 (1 + x r_{1} (ω^{i}) + (1 - x) r_{2} (ω^{i}))} \\ = & 0.1 e^{- 13+5 x} + 0.6 e^{- 11 + 0.5 x} + 0.3 e^{- 8 - 3 x} . \end{matrix}$

Differentiate the expected value w.r.t. x and equate the obtained derivative to zero:

$0.5 e^{- 13 + 5 x} + 0.3 e^{- 11+0 .5 x} - 0.9 e^{- 8 - 3 x} = 0.$

The resulting equation can be solved numerically to yield the optimal value $x * ≅ 0.67431$ , where the expected utility function attains its maximum value. Therefore, the optimal allocation weights are w1 ≅ 67.431% and w2 ≅ 32.569%.

3.2.2 Portfolio Lines

Consider two risky assets with respective returns r1 and r2. It is a typical situation when the joint probability distribution of the returns is unknown. However, it may be possible to estimate the moments of the returns from historical data. Suppose we only know the expected returns and variances of the returns, $μ_{i}, σ_{i}^{2}, i = 1, 2$ , and the correlation coefficient ρ12. Every portfolio in these assets can be characterized by its expected return and variance of its return.

On the (σ, μ)-plane, a portfolio V with allocation weights [w1,w2]Τ is represented by a point whose coordinates (σV, μV) are calculated by (3.8). Let us find a set of points on the (σ, μ)-plane that describes all possible portfolios in the two underlying assets. Since w1 + w2 = 1, all portfolios can be parameterized by a single variable x ∊ ℝ: w1 = x and w2 = 1 − x. Therefore, the set of all possible portfolios can be represented by a portfolio line (which can shrink to a single point in some extreme cases). Equations (3.8) can be rewritten as follows:

$μ_{V} (x) = x μ_{1} + (1 - x) μ_{2}, σ_{V}^{2} (x) = x^{2} σ_{1}^{2} + {(1 - x)}^{2} σ_{2}^{2} + 2 x (1 - x) σ_{1} σ_{2} ρ_{12} (3.9)$

with x ∊ (−∞, ∞). For portfolios without short selling (i.e., w1 ≥ 0 and w2 ≥ 0), we have that 0 ≤ x ≤ 1.

3.2.2.1 Case with |ρ12| = 1

First, let ρ12 = 1. Then from (3.9) we obtain that the variance of return on portfolio V is given by $σ_{V}^{2} (x) = {(x σ_{1} + (1 - x) σ_{2})}^{2}$ , and hence σV (x)= |xσ1 + (1 − x)σ2|. The portfolio line is described by

$σ_{V} (x) = | x (σ_{1} - σ_{2}) + σ_{2} | and μ_{V} (x) = x μ_{1} + (1 - x) μ_{2}, x \in ℝ .$

Let us assume that μ1 ≠ μ2 and σ1 ≠ σ2 (we leave the other cases as exercises for the reader). We can solve the second equation for x to obtain $x = \frac{μ_{V} - μ_{2}}{μ_{1} - μ_{2}}$ . Substituting this expression in the formula for σV gives us the following relationship:

$σ_{V} = | σ_{2} + (σ_{1} - σ_{2}) \frac{μ_{V} - μ_{2}}{μ_{1} - μ_{2}} | \Rightarrow σ_{V} = | \frac{σ_{1} - σ_{2}}{σ_{1} - μ_{2}} μ_{V} + \frac{σ_{2} μ_{1} - σ_{1} μ_{2}}{μ_{1} - μ_{2}} | .$

As we can see, the standard deviation σV is a piecewise-linear function of μV :

$σ_{V} = {\begin{array}{l} a μ_{V} + b if μ_{V} \geq - \frac{b}{a}, \\ - (a μ_{V} + b) if μ_{V} < - \frac{b}{a}, \end{array} where a := \frac{σ_{1} - σ_{2}}{μ_{1} - μ_{2}} and b := \frac{σ_{2} μ_{1} - σ_{1} μ_{2}}{μ_{1} - μ_{2}} .$

The plot of σV as a function of μV is a broken line with two half-lines. It is interesting to observe that there is a portfolio with zero variance (i.e., a risk-free portfolio). Indeed, $σ_{V} = 0 iff μ_{V} = \frac{σ_{2} μ_{1} - σ_{1} μ_{2}}{σ_{1} - σ_{2}}$ . The weights w1 = x and w2 = 1 − x can be obtained by solving the equation xσ1 + (1 − x)σ2 = 0 for x. Hence, the weights of a risk-free portfolio are

${\hat{w}}_{1} = \frac{σ_{2}}{σ_{2} - σ_{1}} and {\hat{w}}_{2} = \frac{σ_{1}}{σ_{1} - σ_{2}} . (3.10)$

One of the weights is negative, hence short selling is necessary to construct a risk-free portfolio.

Now let us find what part of the portfolio line corresponds to portfolios without short selling. The portfolios with weights (0, 1) and (1, 0) are the endpoints of such a set. By changing x from 0 to 1, we continuously move the point along the line of portfolios without short selling from one endpoint to the other. Since the portfolio with σV = 0 has a negative weight, the no-short-selling line is a segment lying on one of two rays. The final result of our analysis is presented in Figure 3.4a.

Figure 3.4

Figure showing a typical portfolio line for the case with |ρ12| = 1. The bold part indicates portfolios without short selling.

A typical portfolio line for the case with |ρ12| = 1. The bold part indicates portfolios without short selling.

Similarly, we can construct the portfolio line for the case with ρ12 = −1. The portfolio line is described by

$σ_{V} (x) = | x (σ_{1} + σ_{2}) - σ_{2} | and μ_{V} (x) = x μ_{1} + (1 - x) μ_{2} with x \in ℝ .$

By excluding x from the above equations, we obtain

$σ_{V} = | \frac{σ_{1} + σ_{2}}{μ_{1} - μ_{2}} μ_{V} - \frac{σ_{2} μ_{1} + σ_{1} μ_{2}}{μ_{1} - μ_{2}} | .$

Again, the plot of σV as a function of μV is a broken line. Now σV = 0 iff $μ_{V} = \frac{σ_{2} μ_{1} + σ_{1} μ_{2}}{σ_{1} + σ_{2}}$ .

The weights of the risk-free portfolio are

${\hat{w}}_{1} = \frac{σ_{2}}{σ_{1} + σ_{2}} and {\hat{w}}_{2} = \frac{σ_{1}}{σ_{1} + σ_{2}} . (3.11)$

Both weights are positive, so no short selling is required to construct a portfolio with a zero variance of return. The no-short-selling line is a broken line segment lying on both half-lines. The result of our analysis is given in Figure 3.4b.

3.2.2.2 Case with |ρ12| < 1

Excluding x from (3.9) and expressing σ2 as a function of μ gives

$σ^{2} = \frac{{(μ - μ_{2})}^{2}}{{(μ_{1} - μ_{2})}^{2}} σ_{1}^{2} + \frac{{(μ - μ_{1})}^{2}}{{(μ_{1} - μ_{2})}^{2}} σ_{2}^{2} - 2 \frac{(μ - μ_{1}) (μ - μ_{2})}{{(μ_{1} - μ_{2})}^{2}} σ_{12} σ_{1} σ_{2} .$

After doing some algebra, we can bring this equation to the form:

$\begin{matrix} σ^{2} & = & A μ^{2} - 2 B μ + C, where \\ A & = & \frac{σ_{1}^{2} - 2 ρ_{12} σ_{1} σ_{2} + σ_{2}^{2}}{{(μ_{1} - μ_{2})}^{2}}, \\ B & = & \frac{μ_{1} σ_{2}^{2} + μ_{2} σ_{1}^{2} - 2 ρ_{12} σ_{1} σ_{2} (μ_{1} + μ_{2})}{{(μ_{1} - μ_{2})}^{2}}, \\ C & = & \frac{{(μ_{1} σ_{2})}^{2} + {(μ_{2} σ_{1})}^{2} - 2 ρ_{12} σ_{1} σ_{2} μ_{1} μ_{2}}{{(μ_{1} - μ_{2})}^{2}} . \end{matrix}$

The curve defined by the above equation is a hyperbola. Indeed, let us rewrite the equation as follows: $σ^{2} = A {(μ - \frac{B}{A})}^{2} + D$ , where $D = C - \frac{B^{2}}{A}$ . By changing variables from (σ, μ) to $(x = \frac{σ}{\sqrt{D}}, y = \frac{\sqrt{A}}{\sqrt{D}} μ - \frac{B}{\sqrt{A D}})$ , one can easily obtain the canonical equation of a hyperbola: x2 − y2 = 1. A typical portfolio line is given in Figure 3.5.

Figure 3.5

Figure showing a typical portfolio line for the case with −1 < ρ12 < 1. The bold part indicates portfolios without short selling.

A typical portfolio line for the case with −1 < ρ12 < 1. The bold part indicates portfolios without short selling.

As is seen from Figures 3.4 and 3.5, the plot of a portfolio line is a hyperbola for the case with ρ12 ∊ (−1, 1) and a broken line for the extreme case with |ρ12| = 1. The evolution of a portfolio line when μ1,2 and σ1,2 are fixed and ρ12 is changing from −1 to 1 is represented in Figure 3.6.

Figure 3.6

Figure showing portfolio lines for varying ρ12 and fixed μ's and σ's. The parameter ρ12 is changing from −1 to 1 with the step size of 0:5. The bold parts indicate portfolios without short selling.

Portfolio lines for varying ρ12 and fixed μ's and σ's. The parameter ρ12 is changing from −1 to 1 with the step size of 0:5. The bold parts indicate portfolios without short selling.

3.2.2.3 Case with a Risk-Free Asset

Suppose that one of two assets (say, asset 2) in our portfolio is risk-free, that is, the variance of its return is zero: $σ_{2}^{2} = Var (r_{2}) = 0$ . Hence, the return r2 is constant: r2 ≡ r.

The formula of the variance in (3.9) reduces to $σ_{V}^{2} (x) = x^{2} σ_{1}^{2}$ or just $σ_{V} (x) = | x | σ_{1}$ . So the standard deviation σV of such a portfolio depends on the weight w1 of the risky asset as follows: σV = |w1|σ1. The portfolio line is described by a piecewise linear function: $σ_{V} = σ_{1} | \frac{μ_{V} - r}{μ_{1} - r} |$ . Thus, the portfolio plot is a broken line with its vertex at the point that corresponds to the risk-free asset (see Figure 3.7).

Figure 3.7

Figure showing portfolio line for one risky and one risk-free asset. The risk-free rate of return is r. The bold part indicates portfolios without short selling.

Portfolio line for one risky and one risk-free asset. The risk-free rate of return is r. The bold part indicates portfolios without short selling.

3.2.3 The Minimum Variance Portfolio

As is seen in Figures 3.4 and 3.6, there is always a portfolio with the smallest possible variance $σ_{V}^{2}$ . We already found risk-free portfolios with zero variance for the case with |ρ12| = 1. Let us now find the general solution to this problem.

Theorem 3.2.

Suppose that |ρ12| < 1 or σ1 ≠ σ2 holds. The portfolio with the minimum variance is attained at

${\hat{w}}_{1} = \frac{σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} + σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}} a n d {\hat{w}}_{2} = \frac{σ_{1}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} + σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}} . (3.12)$

The variance of the portfolio is

$σ_{mv}^{2} = \frac{(1 - ρ_{12}^{2}) σ_{1}^{2} σ_{2}^{2}}{σ_{1}^{2} + σ_{2}^{2} - 2 ρ_{12} σ_{1} σ_{2}} . (3.13)$

Proof. By differentiating the variance $σ_{V}^{2}$ given by (3.9) w.r.t. x and equating the derivative to zero, we obtain the following linear equation for x:

$\frac{d σ_{V}^{2}}{d x} = 2 (σ_{1}^{2} + σ_{2}^{2} + 2 ρ_{12} σ_{1} σ_{2}) x - 2 (σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}) = 0.$

The solution is

$x_{0} = \frac{σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} + σ_{2}^{2} - 2 ρ_{12} σ_{1} σ_{2}} . (3.14)$

Thus, for the weights ${\hat{w}}_{1} = x_{0}$ and ${\hat{w}}_{2} = 1 - x_{0}$ , we immediately obtain (3.12). Since the second derivative of σV2 w.r.t. x is positive everywhere:

$\frac{d^{2} σ_{V}^{2}}{d x^{2}} = 2 (σ_{1}^{2} + σ_{2}^{2} - 2 ρ_{12} σ_{1} σ_{2}) > 0,$

the variance σV2 attains its smallest value at x0. Substituting x0 in (3.8) gives us Equation (3.13) for the minimum variance.

Clearly, the formulae in (3.12) and (3.13) work for both cases when |ρ12| = 1 or |ρ12| < 1. If |ρ12| = 1, then σmv2 = 0 in (3.13) and the formulae in (3.12) reduce to (3.10) or (3.11) depending on the sign of ρ12.

3.2.3.1 Case without Short Selling

While proving Theorem 3.2, we did not take into account whether short sells are allowed. Let us find the minimum variance portfolio without short sells, i.e., with nonnegative weights. The function σV2 in (3.9) attains its minimum value on [0, 1] either at one of the boundary points x ∊{0, 1} or at the point x0 given by (3.14) provided 0 <x0 < 1. Both weights ${\hat{w}}_{1}$ and ${\hat{w}}_{2}$ in (3.12) are positive iff ρ12σ2 < σ1 and ρ12σ1 < σ2, or, equivalently, if $ρ_{12} < \min {\frac{σ_{1}}{σ_{2}}, \frac{σ_{2}}{σ_{1}}}$ . If that is the case, then it is possible to construct a portfolio without short selling with risk lower than that of any of the individual assets.

Otherwise, when $ρ_{12} \geq \min {\frac{σ_{1}}{σ_{2}}, \frac{σ_{2}}{σ_{1}}}$ , the minimum variance portfolio without short selling is composed of shares of only one of the assets. If σ1 < σ2 (hence x0 > 1), then the portfolio has only shares of asset 1 and its variance is σ12. If σ2 <σ1 (hence x0 < 0), then the portfolio has only shares of asset 2 and its variance is σ22. For the special case with σ1 = σ2 and ρ12 = 1, the variance is the same for any portfolio: $σ_{V}^{2} = σ_{1}^{2} = σ_{2}^{2}$ .

3.2.4 Selection of Optimal Portfolios

A typical problem of a risk manager is the selection of an optimal portfolio. Let us consider a portfolio with two risky assets. Suppose that the returns r1 and r2 follow a bivariate normal distribution, which is characterized by five parameters, namely, the expected returns, μ1 and μ2, the variances of returns, σ12 and σ22 and the correlation coefficient ρ12 = Corr(r1,r2).

There are many criteria that can be used to select an optimal portfolio. We consider three examples: minimization of the risk, maximization of an expected utility function of the return, and minimization of the probability of loss. All examples will be illustrated with the following data:

$μ_{1} = 10 %, σ_{1} = 20 %, μ_{2} = 15 %, σ_{2} = 40 %, and ρ_{12} = - 20 % . (3.15)$

3.2.4.1 Minimum Variance Portfolio

The variance of the terminal portfolio value is

$Var (V_{T}) = Var (V_{0} (1 + r_{V})) = V_{0} (1 + Var (r_{V})) .$

So, minimization of Var(VT) is equivalent to minimization of $σ_{V}^{2} = Var (r_{V})$ . Let us find the weights ${\hat{w}}_{1} = x_{0}$ and ${\hat{w}}_{2} = 1 - x_{0}$ that minimize the variance of the portfolio return:

$x_{0} = \frac{σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} + σ_{2}^{2} - 2 ρ_{12} σ_{1} σ_{2}} = \frac{{0.4}^{2} - (- 0.2) \cdot 0.2 \cdot 0.4}{{0.2}^{2} + {0.4}^{2} - 2 \cdot (- 0.2) \cdot 0.2 \cdot 0.4} ≅ 0.7586.$

Thus, ${\hat{w}}_{1} ≅ 75.86 %$ and ${\hat{w}}_{2} ≅ 24.14 %$ . The expected return and variance of return are, respectively,

$\begin{matrix} μ_{V} & = & 0.1 \cdot 0.7586 + 0.15 \cdot 0.2414 ≅ 0.1121 = 11.21 %, \\ σ_{V}^{2} & = & {0.2}^{2} \cdot {0.7586}^{2} + {0.4}^{2} \cdot {0.2414}^{2} + 2 \cdot (- 0.2) \cdot 0.2 \cdot 0.4 \cdot 0.7586 \cdot 0.2414 ≅ 0.02648. \end{matrix}$

Thus, σV ≅ 0.1627 = 16.27%. Notice that μ1 <μV <μ2 but σV < min{σ1,σ2} = min{0.2, 0.4} = 0.2. We managed to decrease the risk by diversifying the portfolio.

3.2.4.2 Maximum Expected Utility Portfolio

Let us find a portfolio that maximizes the mathematical expectation of the exponential utility function, u(V) = 1 − eαV with α > 0, of the wealth VT = (1+ rV)V0 with some initial capital V0 > 0:

$E [u (V_{T})] = E [1 - e^{- α V_{0} (1 + r_{V})}] \to \max . (3.16)$

Choosing the fraction that maximizes utility is slightly more complicated. If we invest fraction x in the high-risk asset, then the terminal wealth will be VT (x) = (1+ rV (x))V0, where we recall that the rate of return rV (x) is normally distributed with mean μV (x) and variance $σ_{V}^{2} (x)$ . Using (3.9) gives μV (x)= μ2 + x(μ1 − μ2) and $σ_{V}^{2} (x) = A x^{2} + 2 B x + C$ , where

$A = σ_{1}^{2} + σ_{2}^{2} - 2 ρ_{12} σ_{1} σ_{2}, B = (ρ_{12} σ_{1} σ_{2} - σ_{2}^{2}), C = σ_{2}^{2} .$

The expected utility is

$\begin{matrix} E [1 - e^{- α V_{T} (x)}] & = & 1 - E [e^{- α V_{T} (x)}] = 1 - e^{- α V_{0}} E [e^{- α V_{0 r V} (x)}] \\ = & 1 - e^{- α V_{0}} e^{- α V_{0 μ V (x) + α^{2} V_{0}^{2} σ_{V}^{2} (x) / 2}} . \end{matrix}$

It therefore suffices to maximize the function

$μ_{V} (x) - \frac{α V_{0} σ_{V}^{2} (x)}{2} .$

Substituting the above expressions for μV and σV gives us the following target function to be maximized:

$f (x) := μ_{2} + x (μ_{1} - μ_{2}) - \frac{α V_{0}}{2} (A x^{2} + 2 B x + C) .$

Differentiate f with respect to x and equate the derivative to zero to obtain

$f^{'} (x) = (μ_{1} - μ_{2}) - α V_{0} (A x + B) = 0.$

The solution is

$x^{*} = - \frac{B}{A} + \frac{1}{A} \frac{μ_{1} - μ_{2}}{α V_{0}} = \frac{(μ_{1} - μ_{2}) / (α V_{0}) + σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} + σ_{2}^{2} - 2 ρ_{12} σ_{1} σ_{2}} .$

Since f″ (x)= −αV0A < 0, the function f is a concave function. Therefore, f attains its maximum at x*. Suppose that αV0 = 1. Now we can do computations for our problem with the data in (3.15). The optimal weights are

${\hat{w}}_{1} = x^{*} ≅ 0.5431 and {\hat{w}}_{2} = 1 - x^{*} ≅ 0.4569.$

The expected return is μV (x0) ≅ 12.28%; the volatility of return is σV(x0) ≅ 19.30%.

One can consider other utility functions. However, in most cases we need to use a computational method to find the maximum of an expected utility function. The Taylor series approximation (3.5) can also be applied, as is demonstrated in the next example. Let us find an optimal portfolio when attempting to maximize the expected value of the square-root utility function of VT = (1+ rV) V0:

$E [u (V_{T})] = V_{0} E [\sqrt{1 + r_{V}}] \to \max . (3.17)$

Expand $\sqrt{1 + r}$ in a Taylor series about the point r = μV (x) to obtain

$E [\sqrt{1 + r_{V} (x)}] \approx \sqrt{1 + μ_{V} (x)} - \frac{1}{8} {(1 + μ_{V} (x))}^{- 3 / 2} σ_{V}^{2} (x),$

where rV (x)= xr1 + (1 − x)r2, and μV (x) and $σ_{V}^{2} (x)$ are given by (3.9). Now the maximization problem (3.17) reduces to

$f (x) := \sqrt{1 + x μ_{1} + (1 - x) μ_{2}} - \frac{x^{2} σ_{1}^{2} + {(1 - x)}^{2} σ_{2}^{2} + 2 x (1 - x) σ_{1} σ_{2} ρ_{12}}{8 {(1 + x μ_{1} + (1 - x) μ_{2})}^{3 / 2}} \to \max_{x} \cdot$

Equating the derivative of the function f to zero and solving numerically the equation obtained give the optimal weights: ${\hat{w}}_{1} ≅ 25.64 %$ and ${\hat{w}}_{2} ≅ 74.36 %$ . The resulting portfolio has the following expected return and volatility of return: μV = 13.72% and σV ≅ 29.16%.

Remark. Let us assume without loss of generality that σ1 <σ2. We would then expect that μ1 <μ2, otherwise no risk-averse investor would ever want to purchase the high-risk asset. To minimize the volatility σV we should invest $x_{0} = - \frac{B}{A}$ in the low-risk asset. To maximize the expected exponential utility, we invest $x * = - \frac{B}{A} + \frac{1}{A} \frac{μ_{1} - μ_{2}}{α V_{0}}$ . Clearly, we have

$x^{*} = x_{0} + \frac{1}{A} \frac{μ_{1} - μ_{2}}{α V_{0}} < x_{0},$

since μ1 < μ2 holds. There are several important insights here. First, we see that maximizing expected utility is not the same as minimizing volatility, even for a risk-averse investor. Risk-averse investors are willing to take on a certain amount of risk provided they are adequately compensated. To see this, note that the difference x0 − x* is increasing in μ2 − μ1; the greater the compensation being offered the more the risk-averse investor will allocate to the riskier asset. Risk aversion is therefore not the same as complete risk avoidance. Finally, if V0 is large, then x0 − x* will be small; the increase in wealth is simply not worth the possibility of losing large sums when marginal returns to wealth are small (as they are for wealthy individuals). Finally, we can observe that if σ2 is large, then A is large, and the difference x0 − x* is small.

3.2.4.3 Minimum Loss-Probability Portfolio

Suppose we wish to find the allocation weights when attempting to minimize the probability that the return on the portfolio is less than a certain threshold r0:

$ℙ (r_{V} \leq r_{0}) \to \min .$

Given that r1 and r2 follow a bivariate normal distribution, the probability distribution of rV is $N o r m (μ_{V} (x), σ_{V}^{2} (x))$ . Therefore,

$\begin{matrix} ℙ (r_{V} (x) \leq r_{0}) & = & ℙ (\frac{r_{V} (x) - μ_{V} (x)}{σ_{V} (x)} \leq \frac{r_{0} - μ_{V} (x)}{σ_{V} (x)}) \\ = & ℙ (Z \leq \frac{r_{0} - μ_{V} (x)}{σ_{V} (x)}) = N (\frac{r_{0} - μ_{V} (x)}{σ_{V} (x)}), \end{matrix}$

where Z denotes a standard normal random variable and $N$ is a standard normal CDF. Since a normal CDF is a strictly increasing function of its argument, it is sufficient to solve

$\frac{μ_{V} (x) - r_{0}}{σ_{V} (x)} \to \max_{x} . (3.18)$

In fact, Equation (3.18) relates to the so-called Sharpe ratio. The Sharpe ratio is a measure of the excess return (or risk premium) per unit of risk in an investment portfolio. It is named after William Forsyth Sharpe. The Sharpe ratio is defined as

$\frac{E [r_{V} - r_{0}]}{\sqrt{Var (r_{V} - r_{0})}}, (3.19)$

where r0 is the return on a benchmark asset, such as the risk-free rate of return, E[rV − r0] is the expected value of the excess of the portfolio return rV over the benchmark return, and Var(rV − r0) is the variance of the excess return. Since r0 is constant, we have

$E [r_{V} - r_{0}] = E [r_{V}] - r_{0} = μ_{V} - r_{0} and Var (r_{V} - r_{0}) = Var (r_{V}) = σ_{V}^{2} .$

The Sharpe ratio is used to characterize how well the return of a portfolio compensates the investor for the risk taken. When comparing two portfolios against the same benchmark asset, the portfolio with the higher Sharpe ratio gives more return for the same level of risk. Investors are often advised to pick investments with high Sharpe ratios.

The solution to (3.18) can be obtained by using standard methods of calculus: differentiate the left-hand side of (3.18) w.r.t. x, equate the derivative to zero, and then solve the obtained equation for x. As a result, we obtain the following allocation weights:

$\begin{matrix} {\hat{w}}_{1} = \frac{(μ_{1} - r_{0}) σ_{2}^{2} - (μ_{2} - r_{0}) ρ_{12} σ_{1} σ_{2}}{(μ_{1} - r_{0}) σ_{2}^{2} + (μ_{2} - r_{0}) σ_{1}^{2} - (μ_{1} + μ_{2} - 2 r_{0}) ρ_{12} σ_{1} σ_{2}}, \\ {\hat{w}}_{2} = \frac{(μ_{2} - r_{0}) σ_{1}^{2} - (μ_{1} - r_{0}) σ_{1} σ_{2} ρ_{12}}{(μ_{1} - r_{0}) σ_{2}^{2} + (μ_{2} - r_{0}) σ_{1}^{2} - (μ_{1} + μ_{2} - 2 r_{0}) σ_{1} σ_{2} ρ_{12}} . \end{matrix} (3.20)$

Let the risk-free rate be r0 = 5%. Substituting (3.15) into (3.20) gives us the following solution: ${\hat{w}}_{1} = \frac{2}{3}$ and ${\hat{w}}_{2} = \frac{1}{3}$ . The expected return and volatility of the portfolio return are μV ≅ 11.67% and σV ≅ 16.87%, respectively.

3.3 Portfolio Optimization for N Assets

3.3.1 Portfolios of Several Assets

Consider a market model with N different assets $A_{t}^{1}, A_{t}^{2}, ..., A_{t}^{N}$ , where t ∊{0,T}. The return on the ith asset is $r_{i} = \frac{A_{T}^{i} - A_{0}^{i}}{A_{0}^{i}}$ . Suppose a portfolio is constructed from these base assets. Let xi be the number of shares of asset i with i = 1, 2,...,N. The time-t portfolio value is $V_{t} = \sum_{i = 1}^{N} x_{i} A_{t}^{i}$ for t ∊{0,T}. The return on the portfolio is a linear combination of the returns on the assets:

$\begin{array}{l} r_{V} & = \frac{V_{T} - V_{0}}{V_{0}} = \sum_{i = 1}^{N} \frac{x_{i} (A_{T}^{i} - A_{0}^{i})}{V_{0}} \\ = \sum_{i = 1}^{N} \frac{x_{i} A_{0}^{i}}{V_{0}} \frac{A_{T}^{i} - A_{0}^{i}}{A_{0}^{i}} = \sum_{i = 1}^{N} \frac{x_{i} A_{0}^{i}}{V_{0}} r_{i} . \end{array}$

Define the allocation weights $w_{i} = \frac{x_{i} A_{0}^{i}}{V_{0}}$ with i = 1, 2,...,N of funds between the N base assets. The formula for the return rV takes the following compact form:

$r_{V} = w_{1} r_{1} + w_{2} r_{2} + \dots + w_{N} r_{N} = \sum_{i = 1}^{N} w_{i} r_{i} .$

Let us denote

$W := {[w_{1} w_{2} ... w_{N}]}^{Τ} \in ℝ^{N} .$

Clearly, the sum of the weights is one. This fact can be written in vector form:

$u^{Τ} w = 1, where u := {[11 ... 1]}^{Τ} \in ℝ^{N} . (3.21)$

Here xΤ denotes the transpose of a vector x. Here, we operate with column vectors.

We denote μi = E[ri]—the expected return on asset i, $σ_{i}^{2} = Var (r_{i})$ —the variance of the return on asset i, and cij = Cov(ri, rj)—the covariance between returns ri and rj for i, j = 1, 2,...,N. The expected returns and covariances between returns can be respectively arranged into an N × 1 column vector and an N × N matrix:

$m := [\begin{matrix} μ_{1} \\ μ_{2} \\ ⋮ \\ μ_{N} \end{matrix}] and C := [\begin{array}{l} c_{11} & c_{12} & ... & c_{1 N} \\ c_{21} & c_{22} & ... & c_{2 N} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ c_{N 1} & C_{N 2} & ... & c_{n n} \end{array}] .$

The matrix C is called a covariance matrix. The covariance σXY ≡ Cov(X, Y) of two random variables X and Y can be factorized into a product of the standard deviations, σX and σY , and the coefficient of correlation between X and Y denoted by Corr(X, Y) ≡ ρXY as follows:

$σ_{X Y} = σ_{X} ρ_{X Y} σ_{Y} .$

Therefore, the covariance matrix C can be represented as a product of a diagonal matrix filled with standard deviations of returns, $σ_{i} := \sqrt{Var (r_{i})}$ , and a correlation matrix whose entries are coefficients of correlation between returns, ρij ≡ Corr(ri, rj), i, j = 1, 2....,N:

$C = [\begin{array}{l} σ_{1} & 0 & ... & 0 \\ 0 & σ_{2} & ... & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & ... & σ_{N} \end{array}] [\begin{array}{l} 1 & ρ_{12} & ... & ρ_{1 N} \\ ρ_{21} & 1 & \dots & ρ_{2 N} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ρ_{N 1} & ρ_{N 2} & ... & 1 \end{array}] [\begin{array}{l} σ_{1} & 0 & \dots & 0 \\ 0 & σ_{2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & ... & σ_{N} \end{array}] .$

Here, we use the fact that Corr(X, X) = 1 for every random variable X, hence ρii = 1 for all i = 1, 2,..., N.

The covariance matrix is symmetric (i.e., C = CΤ) and positive definite, i.e., wΤ Cw > 0 for every nonzero vector w ∊ ℝN. Since C is positive definite, it is a nonsingular matrix and hence its inverse matrix C−1 exists. There exist several necessary and sufficient criteria to determine if a symmetric real matrix C is positive definite, including the following.

All eigenvalues of C are positive.
All the leading principal minors are positive. The kth leading principal minor of C is the determinant of the upper-left k-by-k corner of C, where k = 1, 2,...,N. This criterion is known as Sylvester's criterion.
There exists a unique lower triangular matrix L, with strictly positive diagonal elements, that allows the factorization of C into C = LLΤ. Such a factorization is called the Cholesky factorization.

Note that in general C can be a semi-positive definite matrix, meaning that wΤ Cw ≥ 0 for all w ∊ ℝN.

Let us find the mathematical expectation and variance of rV by applying well-known equations for the mathematical expectation and variance of a sum of (dependent) random variables. The expected return on the portfolio V with weights w is

$μ_{V} = E [r_{V}] = E [\sum_{i = 1}^{N} w_{i} r_{i}] = \sum_{i = 1}^{N} E [w_{i} r_{i}] = \sum_{i = 1}^{N} w_{i} μ_{i}; (3.22)$

the variance of rV is

$\begin{array}{l} σ_{V}^{2} & = Var (r_{V}) = Var (\sum_{i = 1}^{N} w_{i} r_{i}) \\ = Cov (\sum_{i = 1}^{N} w_{i} r_{i}, \sum_{j = 1}^{N} w_{j} r_{j}) = \sum_{i = 1}^{N} \sum_{j = 1}^{N} Cov (w_{i} r_{i}, w_{j} r_{j}) \\ = \sum_{i = 1}^{N} \sum_{j = 1}^{N} w_{i} w_{j} c_{i j} . \end{array} (3.23)$

The above equations can be written in matrix-vector form:

$μ_{V} = m^{Τ} w; (3.24)$

$σ_{V}^{2} = w^{Τ} C w . (3.25)$

Note that we do not assume any probability distribution for the vector of returns. Our analysis of portfolios is entirely based on the knowledge of the vector of expected returns m and covariance matrix C.

In the next sections, we shall solve the following two problems.

Find a portfolio with the minimum variance. It will be called the minimum variance portfolio.
Find a portfolio with the minimum variance among all portfolios whose expected return is fixed and equal to a given number. We may obtain different solutions for portfolios with or without short sells. The set of such portfolios parameterized by the expected return is called the minimum variance (portfolio) line.

3.3.2 The Minimum Variance Portfolio

To find the minimum variance portfolio, we need to solve

$f (w) := w^{Τ} C w \to_{w}^{\min} (3.26)$

subject to the constraint

$u^{Τ} w = 1. (3.27)$

Let us use the method of Lagrange multipliers. First, we find the critical points of the function

$F (w, λ) := w^{Τ} C w - λ (u^{Τ} w - 1) .$

The partial derivatives of F with respect to wi for i = 1, 2,...,N are

$\begin{array}{l} \frac{\partial F}{\partial w_{i}} (W, λ) & = \frac{\partial}{\partial w_{i}} (\sum_{i = 1}^{N} \sum_{j = 1}^{N} w_{i} w_{j} c_{i j} - λ \sum_{i = 1}^{N} w_{i} + λ) \\ = 2 \sum_{j = 1}^{N} w_{j} c_{i j} - λ . \end{array}$

Equating them to zero gives us the following linear equations:

$2 \sum_{j = 1}^{N} w_{j} c_{i j} - λ = 0 for all i = 1, 2, ..., N .$

Let cΤ j denote the jth row of matrix C. Then the above equations can be rewritten in vector form:

$2 c_{j}^{Τ} w - λ = 0 for i = 1, 2, ..., N .$

Finally, we have 2Cw − λu = 0. Multiplying both parts by the inverse matrix C−1 from the left gives

$2 C^{- 1} C w - λ C^{- 1} u = 2 w - λ C^{- 1} u and C^{- 1} 0 = 0 \Rightarrow 2 w - λ C^{- 1} u = 0 .$

Solve this equation for w to obtain that $w = \frac{λ}{2} C^{- 1} u$ . The only missing variable is λ. Substitute the expression for w in the constraint (3.27) to obtain

$1 = \frac{λ}{2} u^{Τ} C^{- 1} u \Rightarrow λ = \frac{2}{u^{Τ} C^{- 1} u} .$

Finally, we obtain the weight vector for the minimum variance portfolio:

$w_{mv} = \frac{C^{- 1} u}{u^{Τ} C^{- 1} u} . (3.28)$

Since the matrix of second derivatives of the function f(w) is 2C (which is positive definite), the function F (w, λ) is a concave function of w for every value of λ. Therefore, the function f(w) has a minimum at wmv. The minimum variance can be computed by putting the weights wmv in (3.23):

$σ_{mv}^{2} = w_{mv}^{Τ} C w_{mv} = \frac{1}{u^{Τ} C^{- 1} u} .$

As an example, let us consider the case of two assets. The covariance matrix C and its inverse can be written using $\sqrt{Var (r_{i})}, i = 1, 2,$ and ρ12 = Corr(r1,r2):

$C = [\begin{matrix} σ_{1}^{2} & ρ_{12} σ_{1} σ_{2} \\ ρ_{12} σ_{1} σ_{2} & σ_{2}^{2} \end{matrix}] and C^{- 1} = \frac{1}{1 - ρ_{12}^{2}} [\begin{matrix} \frac{1}{σ_{1}^{2}} & - \frac{ρ_{12}}{σ_{1} σ_{2}} \\ - \frac{ρ_{12}}{σ_{1} σ_{2}} & \frac{1}{σ_{2}^{2}} \end{matrix}] .$

The weight vector is then

$\begin{array}{l} W_{mv}^{T} = [w_{1}, w_{2}] & = \frac{1}{\frac{1}{σ_{1}^{2}} - \frac{2 ρ_{12}}{σ_{1} σ_{2}} + \frac{1}{σ_{2}^{2}}} [\frac{1}{σ_{1}^{2}} - \frac{ρ_{12}}{σ_{1} σ_{2}}, \frac{1}{σ_{2}^{2}} - \frac{ρ_{12}}{σ_{1} σ_{2}}] \\ = [\frac{σ_{2}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} - 2 ρ_{12} σ_{1} σ_{2} + σ_{2}^{2}}, \frac{σ_{1}^{2} - ρ_{12} σ_{1} σ_{2}}{σ_{1}^{2} - 2 ρ_{12} σ_{1} σ_{2} + σ_{2}^{2}}] . \end{array}$

The resulting expression is identical to that of (3.12).

3.3.3 The Minimum Variance Portfolio Line

Now, we consider a set of portfolios with fixed expected return μ, i.e., mΤ w = μ. To find the minimum variance portfolio in such a set, we need to minimize f(w) := wΤ Cw subject to the constraints

$u^{Τ} w = 1 and m^{Τ} w = μ . (3.29)$

As a result, we obtain a family of minimum variance portfolios $w = \hat{w} (μ)$ parameterized by μ. On the risk-return plot, such a family is represented by a continuous line called the minimum variance line.

Again, to find the equation of the minimum variance line we apply the method of Lagrange multipliers. Introduce the function

$G (w, λ_{1}, λ_{2}) := w^{Τ} C w - λ_{1} (u^{Τ} w - 1) - λ_{2} (m^{Τ} w - μ),$

where λ1 and λ2 are Lagrange multipliers. Differentiate G w.r.t. weight wi and equate the derivative to zero:

$\frac{\partial G}{\partial w_{i}} = 2 \sum_{j = 1}^{N} w_{j} c_{i j} - λ_{1} - λ_{2} μ_{i} = 0 for i = 1, 2, ..., N .$

The above simultaneous linear equations can be expressed in matrix-vector form:

$2 C w - λ_{1} u - λ_{2} m = 0.$

By solving for the weights w, we obtain:

$w = C^{- 1} (\frac{λ_{1}}{2} u + \frac{λ_{2}}{2} m) = \frac{λ_{1}}{2} C^{- 1} u + \frac{λ_{2}}{2} C^{- 1} m . (3.30)$

The constraints (3.29) are revealed from equations $\frac{\partial G}{\partial λ_{i}} = 0$ for i = 1, 2. Now substitute this expression for w into the constraints (3.29) to obtain the following system of equations:

${\begin{array}{l} \frac{1}{2} u^{Τ} C^{- 1} u λ_{1} + \frac{1}{2} u^{Τ} C^{- 1} m λ_{2} = 1, \\ \frac{1}{2} m^{Τ} C^{- 1} u λ_{1} + \frac{1}{2} m^{Τ} C^{- 1} m λ_{2} = μ . \end{array} (3.31)$

Recall that a 2-by-2 system of linear equations

${\begin{array}{l} a_{11} x_{1} + a_{12} x_{2} = b 1, \\ a_{21} x_{1} + a_{22} x_{2} = b 2 \end{array}$

admits a unique solution $x_{1} = \frac{1}{D} | \begin{array}{l} b_{1} & a_{12} \\ b_{2} & a_{22} \end{array} | and x_{2} = \frac{1}{D} | \begin{array}{l} a_{11} & b_{1} \\ a_{21} & b_{2} \end{array} | with D := | \begin{array}{l} a_{11} & a_{12} \\ a_{21} & a_{22} \end{array} |$ provided D ≠ 0. Here, $| \begin{array}{l} a & b \\ c & d \end{array} | = a d - b c$ denotes the determinant of a 2-by-2 matrix. Solve the system (3.31) for λ1 and λ2 and then plug the solution into (3.30) to obtain the final formula for the portfolio weights:

$\hat{w} = \frac{1}{D} | \begin{array}{l} 1 & u^{Τ} C^{- 1} m \\ μ & m^{Τ} C^{- 1} m \end{array} | C^{- 1} u + \frac{1}{D} | \begin{array}{l} u^{Τ} C^{- 1} u & 1 \\ m^{Τ} C^{- 1} u & μ \end{array} | C^{- 1} m (3.32)$

with $D := | \begin{array}{l} u^{Τ} C^{- 1} u & u^{Τ} C^{- 1} m \\ m^{Τ} C^{- 1} u & m^{Τ} C^{- 1} m \end{array} | \neq 0$ . The determinants in (3.32) are linear functions of μ. Therefore, the weights of portfolios on the minimum variance line depend on μ linearly as well: $\hat{w} = μ a + b$ with

$\begin{array}{l} a := \frac{u^{Τ} C^{- 1} u C^{- 1} m - u^{Τ} C^{- 1} m C^{- 1} u}{D}, \\ b := \frac{m^{Τ} C^{- 1} m C^{- 1} u - m^{Τ} C^{- 1} u C^{- 1} m}{D} . \end{array}$

This observation allows us to describe the shape of the minimum variance line. Let us select two different portfolios with respective weights w′ and w″ on the line. Then the minimum variance line consists of portfolios with weights w = xw′ + (1 − x)w″ for x ∊ ℝ. Indeed, the weights of the two chosen portfolios satisfy w′ = μ′a + b and w″ = μ″a + b for some μ′ ≠ μ″. Every linear combination of the weights w′ and w″ satisfies the same equation:

$w = x w^{'} + (1 - x) w^{″} = (x μ^{'} + (1 - x) μ^{″}) a + (x + (1 - x)) b = μ_{x} a + b$

with μx = xμ′ + (1 − x)μ″. Conversely, for every μ ∊ ℝ there exist x ∊ ℝ so that μ = xμ′ + (1 − x)μ″. Therefore, the portfolios with weights xw′ + (1 − x)w, x ∊ ℝ, exhaust the whole minimum variance line. This result means that the minimum variance line has the same shape as that describing a set of portfolios constructed from two assets. The shape of the line (which is a hyperbola) does not depend on the number of assets. The set of admissible portfolios is represented by a planar domain bounded by the minimum variance line. The shape of this domain is known as the Markowitz bullet. All elementary portfolios consisting of individual assets lie inside the bullet, as shown in Figure 3.8.

Figure 3.8

Figure showing the set of admissible portfolios (the Markowitz bullet) in four underlying assets (which are marked by solid circles) bounded by the minimum variance line. The minimum variance portfolio is marked by a diamond.

The set of admissible portfolios (the Markowitz bullet) in four underlying assets (which are marked by solid circles) bounded by the minimum variance line. The minimum variance portfolio is marked by a diamond.

Example 3.5.

Let us consider a portfolio in three underlying assets whose expected returns, standard deviations of returns, and correlations between returns are as follows:

μ1 = 0.1,	σ1 = 0.2,	ρ12 = ρ21 = −0.2,
μ2 = 0.15,	σ2 = 0.3,	ρ23 = ρ32 = 0.2,
μ3 = 0.3,	σ3 = 0.4,	ρ31 = ρ13 = −0.4.

(a) Find the minimum variance portfolio.
(b) Find the minimum variance portfolio line.

Solution. First, to apply the formulae in (3.28) and (3.32), we arrange the expected returns μi in a vector m and construct the covariance matrix C with entries Cij = σiσj ρij:

$m = [\begin{matrix} 0.10 \\ 0.15 \\ 0.30 \end{matrix}], C = [\begin{array}{l} 0.040 & - 0.012 & - 0.032 \\ - 0.012 & 0.090 & 0.024 \\ - 0.032 & 0.024 & 0.160 \end{array}] .$

The matrix C is positive definite, hence the inverse matrix C−1 exists:

$C^{- 1} ≅ [\begin{array}{l} 30.3030 & 2.5252 & 5.6818 \\ 2.5252 & 11.7845 & - 1.2626 \\ 5.6818 & - 1.2626 & 7.5758 \end{array}] .$

The weights of the minimum variance portfolio are

$w_{mv} = \frac{u^{Τ} C^{- 1}}{u^{Τ} C^{- 1} u} ≅ {[\begin{matrix} 0.6060 & 0.2053 & 0.1887 \end{matrix}]}^{Τ} .$

The expected return μmv and standard deviation (the risk) of the return σmv of the minimum variance portfolio are

$μ_{mv} = m^{Τ} w_{mv} ≅ 0.1480 and σ_{mv} = \sqrt{w_{mv}^{Τ} C w_{mv}} ≅ 0.1254.$

To describe the minimum variance line, we need to find the weight vectors for two portfolios on the line. We found one of them—the minimum variance portfolio. Since μmv ≠ 0, the other portfolio on the minimum variance line to be selected can be a portfolio with zero expected return. To find its weights, apply E quation (3.32) where we put μ = 0 to obtain

$w_{0} = \frac{m^{Τ} C^{- 1} m}{D} C^{- 1} u - \frac{m^{Τ} C^{- 1} u}{D} C^{- 1} m ≅ {[\begin{matrix} 1.1459 & 0.4721 & - 0.6180 \end{matrix}]}^{Τ} .$

Now the portfolios with weights xwmv + (1 − x)w0, x ∊ ℝ, exhaust the minimum variance line.

Since w3 = 1 − w1 − w2, all portfolios in three basis assets from the above example can be described by the weights w1 and w2. On the (w1, w2)-plane, every portfolio line is represented by a straight line. For example, the line given by the equation w1 = 0 represents all portfolios in the basis assets 2 and 3 only; the line w1 + w2 = 1 represents all portfolios in the basis assets 1 and 2 only, etc. Figure 3.9 visualizes the set of admissible portfolios from Example 3.5 on the (w1,w2)-plane (the left plot) and on the (σ, μ)-plane (the right plot). The bold line represents the minimum variance line; the minimum variance portfolio is marked by a diamond.

Figure 3.9

Figure showing the minimum variance line from Example 3.5 is plotted as a bold line. The minimum variance portfolio is marked by a diamond. The basis assets are represented by solid circles. The dashed lines represent portfolio lines.

The minimum variance line from Example 3.5 is plotted as a bold line. The minimum variance portfolio is marked by a diamond. The basis assets are represented by solid circles. The dashed lines represent portfolio lines.

3.3.4 Case without Short Selling

The case without short selling is very similar to that considered in the previous section. No short selling means that all positions in an investment portfolio have to be nonnegative. We can find the minimum variance portfolio line and the minimum variance portfolio by solving respective quadratic programming problems that have one additional condition: the weights wi are now nonnegative. To find the minimum variance portfolio, we need to solve (3.26):

$f (w) := w^{Τ} C w \to_{w}^{\min}$

subject to the constraints

$u^{Τ} w = 1, w \geq 0 . (3.33)$

Here, the meaning of w ≥ 0 is that all wi ≥ 0. To obtain a family of minimum variance portfolios parameterized by the expected return μ, we need to minimize f(w) subject to the constraints

$u^{Τ} w = 1, m^{Τ} w = μ, and w \geq 0 . (3.34)$

The constraints in (3.33) and (3.34) are almost the same as are in (3.27) and (3.29), respectively. The quadratic problems can be solved numerically. Computer systems such as MAPLETM, MATHEMATICATM, and MATLABTM can be applied to solve the minimization problems (3.26)–(3.33) and (3.26)–(3.34).

Let us consider the case with three assets from Example 3.5. The weights can be parameterized by two real variables w1,w2 ∊ [0, 1] with w1 + w2 ≤ 1 and hence w3 = 1 − w1 − w2 ≥ 0. On the (w1,w2)-plane, the set of admissible portfolios is represented by a triangle with vertices (0, 0), (0, 1), and (1, 0). Clearly, the expected return on a portfolio with nonnegative weights w is bounded above and below by max μi and min μi, respectively. Therefore, the minimum variance line is a bounded curve on the (σ, μ)-plane. It connects two points corresponding to the assets with the lowest μ and highest μ, respectively. The set of admissible portfolios (with nonnegative weights) is represented by a planar domain bounded by the minimum variance line and portfolio lines (without short selling) corresponding to different pairs of the underlying assets (see Figure 3.10).

Figure 3.10

Figure showing the set of admissible portfolios in three underlying assets without short selling.

The set of admissible portfolios in three underlying assets without short selling.

3.3.5 Efficient Frontier and Capital Market Line

Given the choice between two risky assets, a rational investor will choose an asset with higher expected return μ and lower risk σ.

Definition 3.1.

An asset with (μ1,σ1) is said to dominate another asset with (μ2,σ2) whenever μ1 ≥ μ2 and σ1 ≤ σ2. A portfolio in risky assets is called efficient if there is no other portfolio, except itself, that dominates it. The set of efficient portfolios among all attainable portfolios is called the efficient frontier.

In particular, an efficient portfolio has the highest expected return μ among all attainable portfolios with the same level of risk σ and has the lowest σ among all attainable portfolios with the same μ.

Let us consider the case with two risky assets. The set of admissible portfolios is represented by a portfolio line on the (σ, μ)-plane. The line is passing through the two base assets (σ1,μ2) and (σ2,μ2). As was proved in the previous section, there is a portfolio with the minimum possible variance σmv2 given in (3.13). For every σ>σmv, there are two portfolios on the portfolio line, (σ, μ1) and (σ, μ2), with μ1 <μ2. A rational investor would choose the portfolio (σ, μ2) with a higher expected return. Therefore, in the case of two risky assets, the efficient frontier is the upper half of the portfolio line with the minimum variance portfolio (σmv,μmv) as an endpoint. If one of the two assets is risk-free, then the portfolio line is a broken line with its vertex at the risk-free asset. The efficient frontier is the upper half-line. Both cases are represented in Figure 3.11.

Figure 3.11

Figure showing the efficient frontier (the bold line) for two assets.

The efficient frontier (the bold line) for two assets.

In the situation with multiple risky assets (N > 2), the set of admissible portfolios (the Markowitz bullet) is a planar domain on the (σ, μ)-plane bounded by the minimum variance line. Fix the value of σ ≥ σmv and consider all admissible portfolios V with the standard deviation σV = σ. On the (σ, μ)-plane, this set is a segment enclosed by the minimum variance line. By maximizing the expected return, we find that the efficient portfolios are all lying on the upper half of the minimum variance line (see Figure 3.12a).

Figure 3.12

Figure showing the efficient frontier (the bold line) constructed from multiple risky assets.

The efficient frontier (the bold line) constructed from multiple risky assets.

Finally, let us assume that one risk-free asset labelled B with the rate of return r is available in addition to N risky assets. Let 100α% of the capital be allocated in the risk-free asset and 100(1 − α)% is a risky portfolio:

$\begin{matrix} \begin{array}{l} r_{V} = α r + (1 - α) \sum_{\underset{= : r_{ℳ}}{\underset{︸}{i = 1}}}^{N} w_{i} r_{i} = α r + (1 - α) r_{ℳ}, \end{array} \end{matrix}$

where w1,...,wN are the allocation weights for the risky assets (so that w1 + ... + wN = 1 holds).

As is shown in Subsection 3.2.2, all portfolios with rV = αr + (1 − α)rℳ consisting of one risk-free and one risky asset (the risky portfolio Vℳ of an investment portfolio can be considered as a new asset) form a broken line having upper and lower half-lines with the common vertex at the point with coordinates (0,r). The efficient frontier constructed from such portfolios is the upper half-line like that in Figure 3.11. By taking a risky portfolio Vℳ anywhere in the Markowitz bullet, we can construct the set of admissible portfolios that is represented on the (σ, μ)-plane by a cone bounded by two half-lines, as is shown in Figure 3.13.

Figure 3.13

Figure showing the set of admissible portfolios constructed from four risky assets and one risk-free asset.

The set of admissible portfolios constructed from four risky assets and one risk-free asset.

The efficient frontier of the portfolios containing a risk-free asset in addition to N risky ones is the upper half-line which is passing through the point representing the risk-free asset and tangent to the minimum variance line. Indeed, to minimize the risk, the portfolio Vℳ in risky assets has to be selected on the minimum variance line. To maximize the return, the portfolio Vℳ has to be selected so that the upper half-line has the largest possible slope. If the risk-free return r is not too high, the largest possible slope is achieved when the upper half-line is tangent to the Markowitz bullet. If r is too high, then the efficient frontier is obtained in the limiting case as the portfolio Vℳ selected on the upper half of the minimum variance line goes to infinity. The efficient frontier is no longer tangent to the Markowitz bullet, but is parallel to its asymptote (recall that the shape of the bullet is a hyperbola). The tangency point with coordinates (σℳ,μℳ) is the so-called market portfolio. The efficient frontier is called the capital market line. Every rational investor forming her portfolio with a risk-free asset with return r and risky assets available on the market selects the portfolio on this line. Figure 3.12b shows the market line for portfolios with one risk-free and several risky assets.

The weights of the market portfolio are

$w_{ℳ} = \frac{(m^{T} - r u^{T}) C^{- 1}}{u^{T} C^{- 1} (m - r u)} .$

The expected return, $μ_{ℳ}$ , and variance of return, $σ_{ℳ}^{2}$ , of the market portfolio can be found by using (3.24) and (3.25). The capital market line that starts at the risk-free asset (represented by the point (0,r) on the (σ, μ)-plane) and passes through the market portfolio with expected return μM and standard deviation of return σℳ satisfies the equation

$\frac{μ - r}{μ_{ℳ} - r} = \frac{σ - 0}{σ_{ℳ} - 0} \Leftrightarrow μ = r + \frac{μ_{ℳ} - r}{σ_{ℳ}} σ .$

3.4 The Capital Asset Pricing Model

The Capital Asset Pricing Model (CAPM) attempts to relate ri, the return on asset i, to rM, the return of the entire market, which can be measured by some index such as Standard and Poor's index of 500 stocks (S&P500). In the Markowitz portfolio model, the market portfolio can be used as a good approximation to such a market index. Indeed, every rational investor will select a portfolio on the capital market line since it is the efficient frontier constructed from a risk-free asset and several risky assets. Therefore, every investor will be holding a portfolio with the same relative proportions of risky assets. This means that for each risky asset its weight in the market portfolio is equal to the relative share of the asset in the whole market.

The CAPM assumes that the dependence between ri and rℳ takes the following form:

$r_{i} = r + β_{i} (r_{ℳ} - r) + \in_{i}, (3.35)$

where βi is a constant called the beta factor for asset i, r is a risk-free rate of return, and ∈i is a residual random variable having a normal distribution with mean zero. The residual ∈i is assumed to be independent of rℳ.

There are several ways to compute beta factors.

(1) Suppose that the joint probability distribution of ri and rℳ is given. Compute the covariance of ri and rℳ by employing (3.35) and using the independence of rℳ and ∈i:

$Cov (r_{i}, r_{ℳ}) = \underset{= 0}{\underset{︸}{Cov (r, r_{ℳ})}} + β_{i} \underset{= Var (r_{ℳ})}{\underset{︸}{Cov (r_{ℳ}, r_{ℳ})}} - β_{i} \underset{= 0}{\underset{︸}{Cov (r, r_{ℳ})}} + \underset{= 0}{\underset{︸}{Cov (\in_{i}, r_{ℳ})}} = β_{i} Var (r_{ℳ}) .$

Therefore, the beta factor of asset i is given by

$β_{i} = \frac{Cov (r_{i}, r_{ℳ})}{Var (r_{ℳ})} . (3.36)$

(2) Consider a market model with a set of market scenarios Ω. Suppose that for each market scenario ω ∊ Ω, the values of returns on asset i and the market portfolio ℳ are given. We can plot the value of ri(ωj) against rℳ(ω) for each ω ∊ Ω and then find the line of best fit, also known as the regression line. Employ the model ri = α + βrℳ + ∈i. So the residual random variable ∈i :Ω → ℝ is the difference between the actual return ri and the predicted return α + βrℳ. The line of best fit is defined by

$E [\in_{i}^{2}] \to min_{α, β .}$

The expected value of $\in_{i}^{2}$ is given by

$E [\in_{i}^{2}] = E [r_{i}^{2}] - 2 β E [r_{i} r_{ℳ}] + β^{2} E [r_{ℳ}^{2}] + α^{2} - 2 α E [r_{i}] + 2 α β E [r_{ℳ}] .$

A necessary condition for a minimum of $E [\in_{i}^{2}]$ as a function of α and β is that the partial derivatives w.r.t. α and β should be zero at the point of minimum, (αi,βi):

$\begin{array}{l} \frac{\partial}{\partial α} E [\in_{i}^{2}] = 0 \Leftrightarrow α + β E [r_{ℳ}] = E [r_{i}], \\ \frac{\partial}{\partial β} E [\in_{i}^{2}] = 0 \Leftrightarrow α E [r_{ℳ}] + β E [r_{ℳ}^{2}] = E [r_{i} r_{ℳ}] . \end{array}$

As a result, we obtain a system of linear equations that can be solved to find

$β_{i} = \frac{Cov (r_{i}, r_{ℳ})}{Var (r_{ℳ})}, α_{i} = E [r_{i}] - β_{i} E [r_{ℳ}] .$

Note that for the beta factor we obtained the same expression as that in (3.36).

(3) Suppose that historical data of returns on some portfolio V and the market portfolio $M, {r_{V}^{(j)}, r_{ℳ}^{(j)}}_{j = 1, 2, ..., N}$ , are available. Let us find the line of best fit by minimizing the sum of squared residuals:

$\sum_{j = 1}^{N} {(r_{V}^{(j)} - (α + β r_{ℳ}^{(j)}))}^{2} \to \min_{α, β} \cdot$

The solution to this minimization problem is

$β_{i} = \frac{N \sum_{j} r_{V}^{(j)} r_{ℳ}^{(j)} - (\sum_{j} r_{V}^{(j)}) (\sum_{j} r_{ℳ}^{(j)})}{N \sum_{j} {(r_{ℳ}^{(j)})}^{2} - {(\sum_{j} r_{ℳ}^{(j)})}^{2}}, α_{i} = \frac{\sum_{j} r_{V}^{(j)} - β_{i} \sum_{j} r_{ℳ}^{(j)}}{N} .$

The beta factors for individual assets can be computed by (3.36) or from historical data. The beta factor of a portfolio V in N assets with weights w1,...,wN is given by

$β_{V} = w_{1} β_{1} + \cdot \cdot \cdot + w_{N} β_{N} .$

Indeed, the covariance function is bilinear; therefore

$\begin{array}{l} β_{V} & = \frac{Cov (r_{V}, r_{ℳ})}{Var (r_{V})} = \frac{Cov (w_{1} r_{1} + \dots + w_{n} r_{N}, r_{ℳ})}{Var (r_{V})} \\ = \frac{w_{1} Cov (r_{1}, r_{ℳ}) + \dots + w_{N} Cov (r_{N}, r_{ℳ})}{Var (r_{V})} = w_{1} β_{1} + \dots + w_{N} β_{N} . \end{array}$

Clearly, the beta factor of the market portfolio is equal to one.

By taking the mathematical expectation of both parts of (3.35), we obtain

$μ_{i} = r + β_{i} (μ_{ℳ} - r),$

where μi = E[ri] and μℳ = E[rℳ]. The expected return plotted against the beta factor of any portfolio will form a straight line on the (β, μ)-plane, called the asset market line.

3.5 Exercises

Exercise 3.1. Show that the functions u1(x) = ln x and u2(x) = 1 − e−ax with a > 0 both satisfy the definition of a utility function, i.e., each of them is an increasing, concave function.
Exercise 3.2. Show that the functions u3(x) = xawith 0 <a < 1 and u4(x)= x − bx2 with b > 0 and $x < \frac{1}{2 b}$ both satisfy the definition of a utility function, i.e., each of them is an increasing, convex-upward function.

Exercise 3.3. An investor with capital W can invest an amount V = aW for some 0 ≤ a ≤ 1. If V is invested, then after one year the invested amount is doubled with probability p or lost with probability 1 − p. Suppose that the remaining capital W − aW can be put in a risk-free bank account to earn interest at an annual rate of interest r. How much should be invested by an investor using:
1. (a) a log utility function u(V) = ln V,
2. (b) an exponential utility function u(V) = 1 − e−0.1v?
Exercise 3.4. Consider an investment of $1000 in two risky assets whose returns follow a bivariate normal distribution with the following expected values and standard deviations:

$μ_{1} = 0.1, σ_{1} = 0.2, μ_{2} = 0.15, σ_{1} = 0.3.$

The correlation coefficient between the returns is ρ = −0.5.
1. (a) Suppose that the allocation weights of an investment portfolio for assets 1 and 2 are, respectively, w1 = x and w2 = 1 − x for some x ∊ ℝ. Show that the terminal value VT of a portfolio is normal. Find the expected value and variance of VT.
2. (b) Find the optimal portfolio when employing the utility function
$u (V) = 1 - e^{- 0 .01 V} .$
Exercise 3.5. Consider a market model with three scenarios {ω1,ω2,ω3} and two risky assets with returns r1 and r2. Let the probabilities of the scenarios and values of the returns be as follows:

ω

ℙ(ω)

r1(ω)

r2(ω)

ω1

0.5

10%

5%

ω2

0.3

5%

10%

ω3

0.2

15%

−5%

Find the expected values and standard deviations of the returns. Find the coefficient of correlation between r1 and r2.
Exercise 3.6. Show that the optimal allocation weights {wi} of one's investment portfolio $V_{t} = \sum_{i = 1}^{N} w_{i} A_{t}^{i}, t \in {0, T}$ , that correspond to amounts invested in each asset do not depend on the initial capital V0 when attempting to maximize the mathematical expectation of:
1. (a) a log utility function u(VT) = ln VT,
2. (b) a power utility function u(VT)=(VT)a with 0 < a < 1.
In other words, the maximization of E[u(VT)] reduces to the maximization of $E [u (\frac{V_{T}}{V_{0}})]$ .
Exercise 3.7. Plot portfolio lines with and without short selling for the case with two assets if
1. (a) |ρ12| = 1, μ1 = μ2, and σ1 ≠ σ2,
2. (b) |ρ12| = 1, μ1 ≠ μ2, and σ1 = σ2,
3. (c) μ1 = μ2, and σ1 = σ2.
Exercise 3.8. Consider three assets whose returns have the following standard deviation and correlation coefficients:

$σ_{1} = 0.2, σ_{2} = 0.25, σ_{3} = 0.15, ρ_{12} = - 0.4, ρ_{13} = 0.3, ρ_{23} = 0.7.$

Obtain the covariance matrix C.
Exercise 3.9. Compute the weights in the minimum variance portfolio constructed using the assets in Exercise 3.7. Also compute the expected return and standard deviation of the minimum variance portfolio.
Exercise 3.10. Show that

$C = [\begin{matrix} 1 & 0.75 & - 0.3 \\ 0.75 & 1 & 0.5 \\ - 0.3 & 0.5 & 1 \end{matrix}]$

cannot be a covariance matrix.
Exercise 3.11. Suppose that the risk-free return is r = 3%. Find the weights in the market portfolio constructed from the three assets in Example 3.5. Compute the expected return and standard deviation of the return of the market portfolio.

ω	ℙ(ω)	r1(ω)	r2(ω)
ω1	0.5	10%	5%
ω2	0.3	5%	10%
ω3	0.2	15%	−5%

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 3 Portfolio Management

Create new playlist

Sign In

Sign Up