Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Appendix B

Probability theory

In this appendix it is the intention to give a brief overview of probability theory. Some of the concepts introduced are widely used in the lecture notes. It is not necessary to understand all the technical details, but an intuitive understanding of the concepts introduced is important.

B.1 Measures and σ-algebras

Let Ω denote a finite sample space which contains all the elementary outcome ωi for i = 1,2,..., N. In a two period binomial model the elementary outcome is the state of the world at time t = 2, which determines the stock price at that time.

Definition B.1 (σ-algebra).

Let Ω be a set of points ω. A family ℱ of subset of Ω is called a σ-algebra if

1. Ø ∊ ℱ
2. A ∊ ℱ ⇒ Ac ∊ ℱ
3. An ∊ ℱ for n=1,2,...⇒∪n=1∞An∊F $n = 1, 2, ... \Rightarrow \cup_{n = 1}^{\infty} A_{n} ∊ ℱ$

The definition says that (1) the empty set is an element of ℱ. (2) If A ∊ ℱ, then the complement of A is in ℱ as well. As an example the entire set Ω ∊ ℱ since the empty set is in ℱ. (3) Countable unions of elements of ℱ are elements of ℱ as well.

Example B.1.

The family of all subsets of Ω is an example of an σ-algebra, and it is denoted by 2Ω. In the two period binomial model with Ω = (ω1, ω2, ω3, ω4) we have

2Ω={∅,ω1,ω2,ω3,ω4,{ω1,ω2},{ω1,ω3},{ω1,ω4},{ω2,ω3},{ω2,ω4},{ω3,ω4},{ω1,ω2,ω3},{ω1,ω2,ω4},{ω1,ω3,ω4},{ω2,ω3,ω4},{ω1,ω2,ω3,ω4},Ω}.(B.1) $\begin{array}{l} 2^{Ω} & = & {\emptyset, ω_{1}, ω_{2}, ω_{3}, ω_{4}, {ω_{1}, ω_{2}}, {ω_{1}, ω_{3}}, {ω_{1}, ω_{4}}, {ω_{2}, ω_{3}}, \\ {ω_{2}, ω_{4}}, {ω_{3}, ω_{4}}, {ω_{1}, ω_{2}, ω_{3}}, {ω_{1}, ω_{2}, ω_{4}}, \\ {ω_{1}, ω_{3}, ω_{4}}, {ω_{2}, ω_{3}, ω_{4}}, {ω_{1}, ω_{2}, ω_{3}, ω_{4}}, Ω} . & (B .1) \end{array}$

Definition B.2 (Measurable space).

A pair (Ω, ℱ), where Ω is a set and ℱ is a σ-algebra on Ω, is called a measurable space, and the subsets of Ω which are in ℱ are called ℱ-measurable sets.

Definition B.3.

A probability measure ℙ on a measurable space (Ω, ℱ) is a function ℙ: ℱ → [0,1] such that

1. ℙ(∅) = 0, ℙ(Ω) = 1.
If A1, A2,... ∊ ℱ and {At}∞i=1 ${A_{t}}_{i = 1}^{\infty}$ is disjoint (i.e. Ai ∩ Aj = ∅ if i ≠ j) then

P(∪i=1∞Ai)=∑i=1∞P(Ai).(B.2) $ℙ (\cup_{i = 1}^{\infty} A_{i}) = \sum_{i = 1}^{\infty} ℙ (A_{i}) . (B .2)$

The triple (Ω, ℱ, ℙ) is called a probability space.

B.2 Partitions and information

Definition B.4 (Partition).

A partition ? of a set Ω is a finite family {Ai, i = 1,2,..., K} of subsets of Ω, such that

1. ∪i=1KAi=Ω $\cup_{i = 1}^{K} A_{i} = Ω$ .
2. i ≠ j ⇒ Ai ∩ Aj = ∅.

Consider a sample space Ω and a given partition ? = {Ai; i = 1,2,...,K} of Ω. We can then interpret ? intuitively in terms of “information” in the following way.

“Someone“ chooses an outcome ω of the sample space Ω, which is unknown to us.
However, we are assumed to know which component of ? that ω lies in. With this interpretation of a partition the trivial partition ? = {Ω} corresponds to “no information.” If we assume that the sample space Ω = {ω1, ω2,..., ωr} is finite, and the partition ? = {{ω1}, {ω2},..., {ωr}} then we have “full information,” since we know exactly which ω is chosen.

Example B.2.

Let Ω = {ω1, ω2, ω3, ω4} denote the sample space, and define two partitions ?1 = {{ω1, ω2}, {ω3, ω4}} and ?2 = {{ω1, ω2}, {ω3}, {ω4}}. Then intuitively speaking the partition ?2 contains more information than partition ?1, since one of the elements in ?1 {ω3, ω4} is partitioned into “smaller” elements in partition ?2.

This leads to the following definition:

Definition B.5.

A partition ? is said to be “richer” than a partition ? if ? and ? are partitions on the same sample space Ω, and each component of ? is a union of components of ?.

Although the more general concept of σ-algebras is used to denote the “information set,” it might help to think of it as a partition. In the next section we need the following definition:

Definition B.6 (σ-algebra).

A σ-algebra G $G$ generated by a partition ? is the smallest σ-algebra that includes ?, i.e.

1. ? ⊆ G $G$ .
2. G $G$ is a σ-algebra.
3. If ℱ is a σ-algebra such that ? ⊆ ℱ then G $G$ ⊆ ℱ.

The generated σ-algebra is denoted G $G$ = σ{?}.

Example B.3.

Considera sample space Ω = {ω1, ω2, ω3, ω4} and a partition ? = {{ω1, ω2}, {ω3, ω4}}. The σ-algebra generated by that partition is then given by

G={∅,{ω1,ω2},{ω3,ω4},Ω}.(B.3) $G = {\emptyset, {ω_{1}, ω_{2}}, {ω_{3}, ω_{4}}, Ω} . (B .3)$

Example B.4.

Let the sample space Ω consist of the real numbers in the interval [0,1]. Define the partitions

P1={A1,A2,A3,A4}P2={B1,B2,B3} $?_{1} = {A_{1}, A_{2}, A_{3}, A_{4}} ?_{2} = {B_{1}, B_{2}, B_{3}}$

where

A1=[0,13[,A2=[13,12[,A3=[12,34[,A4=[34,1]B1=[0,13[,B2=[13,34[,B3=[34,1]. $\begin{matrix} A_{1} = [0, \frac{1}{3} [, A_{2} = [\frac{1}{3}, \frac{1}{2} [, A_{3} = [\frac{1}{2}, \frac{3}{4} [, A_{4} = [\frac{3}{4}, 1] \\ B_{1} = [0, \frac{1}{3} [, B_{2} = [\frac{1}{3}, \frac{3}{4} [, B_{3} = [\frac{3}{4}, 1] . \end{matrix}$

It is intuitively appealing to state that ?1 contains more information than ?2, because ?1 is partitioned into smaller parts.

B.3 Conditional expectation

The objective of this section is to define the conditional expectation E[X| $G$ ] where $G$ is a σ-algebra, which should be interpreted as the expectation of X given the information represented by the σ-algebra. However, we begin with the elementary definition of conditional expectation, given the probability space (Ω, ℱ, ℙ) and two stochastic variables X and Z.

Definition B.7 (Conditional probability).

The probability of X conditioned on Z is given by

$ℙ (X = x_{i} | Z = z_{j}) = \frac{ℙ (X = x_{i} \cap Z = z_{j})}{ℙ (Z = z_{j})} . (B .4)$

The intuition behind this definition is as follows.

The probability of a given event xi is the fraction of the total probability mass that is assigned to that event, e.g.,

$ℙ (x_{i}) = \frac{ℙ (x_{i})}{ℙ (Ω)} . (B .5)$
When we have conditioned on the event zj, we know that zj has occurred, hence zj now is the sample space. This explains the normalisation by ℙ(Z = zj) in (B.4).
The fraction of xi that can occur, given the fact that zj has occurred, is given by xi ∩ zj.

The definition of the conditional expectation for discrete stochastic variables is

$E [X | Z = z_{j}] = \sum x_{i} ℙ (X = x_{i} | Z = z_{j}) . (B .6)$

The (unconditional) expectation of a stochastic variable X is given by

$E [X] = \int_{Ω} X (ω) d ℙ (ω) (B .7)$

where the integration is taken over the entire sample space, with respect to the measure (distribution) ℙ. This covers the case where no prior knowledge of the outcome ω is available. Now assume that we know that ω ∊ B, and ℙ(ω) > 0. As a preliminary definition of conditional expectation we have the following:

Definition B.8 (Conditional expectation given a single event).

Given a probability space (Ω, ℱ, ℙ) assume that B ∊ ℱ with ℙ(B) > 0. The conditional expectation of X given B is defined by

$E [X | B] = \frac{1}{ℙ (B)} \int_{B} X (ω) d ℙ (ω) . (B .8)$

Note that this definition is very similar to the definition of conditional probabilities given in (B.4), and with a similar interpretation. This definition is now generalized to the case where the conditioning argument is a partition. Let ? = {A1,..., AK} be a partition of Ω with ℙ(Ai) > 0, then we know from Section B.2 that this could be interpreted as if we know in which set Ai the true ω lies. This leads to the following preliminary definition of conditional expectation:

Definition B.9.

Let ? = {A1,..., AK} be a partition of Ω with ℙ(Ai) > 0, then the conditional expectation is given by

$E [X | ?] = \sum_{n = 1}^{K} I {ω ∊ A_{n}} E [X | A_{n}] (B .9)$

where I{·} denotes the indicator function.

The problem with this definition is that it assumes that each set must have positive probability, which is a unnecessary restriction as we shall see. To give an idea of the interpretation of the final definition of conditional expectation, based on σ-algebras, consider the following.

Let ? be a partition of Ω into Z-atoms,1 where the random variable Z is constant. The σ-algebra $G$ = σ(?) generated by this consists of exactly 2n possible unions of the Z-atoms. It is clear from the elementary definition of conditional expectation that the conditional expectation Y is constant on the Z-atoms, or to be more precise

$Y is G - measurable . (B .10)$

Since Y takes the constant value yi on the Z-atom {Z = zj}, we have

$\int_{{Z = z_{j}}} Y d ℙ = y_{i} ℙ (Z = z_{i}) . (B .11)$

Applying the elementary definition of conditional probability and expectation (B.4) and (B.6) we get

$\begin{array}{l} \int_{{Z = z_{j}}} Y d ℙ & = & \sum_{i} x_{i} ℙ (X = x_{i} | Z = z_{j}) ℙ (Z = z_{j}) \\ = & \sum_{i} x_{i} ℙ (X = x_{i} \cap Z = z_{j}) \\ = & \int_{{Z = z_{j}}} X d ℙ & (B .12) \end{array}$

If we write Gj = {Z = zj}, this says that E[YIGj] = E[XIGj], where I denotes the indicator function. Since IG is a sum of IGj for every G ∊ $G$ we have E[YIG] = E[XIG], or

$\int_{G} Y d ℙ = \int_{G} X d ℙ, for all G ∊ G . (B .13)$

This leads us to the final definition of conditional expectation.

Definition B.10 (Conditional expectation).

Let (Ω, ℱ, ℙ) be a probability space, X a stochastic variable on this space and let $G$ ⊆ ℱ be a σ-algebra on Ω. If Y is a stochastic variable such that

1. Y is $G$ -measurable
2.

$\int_{G} Y (ω) d ℙ (ω) = \int_{G} X (ω) d ℙ (ω) for all G ∊ G (B .14)$

then Y = E[X| $G$ ] is the conditional expectation of X given $G$ .

To give an intuitive understanding of conditional expectation given a σ-algebra consider the following example.

Example B.5.

Suppose we have a finite sample space Ω = (ω1, ω2, ω3, ω4) with four possible outcomes. Define three stochastic variables X, Y1 and Y2: Ω → ℝ with the following values

$\begin{matrix} ω_{1} & ω_{2} & ω_{3} & ω_{4} \\ X & 1 & 2 & 3 & 4 \\ Y_{1} & 1 & 2 & 1 & 2 \\ Y_{2} & 1.5 & 10 & 1.5 & 10 \end{matrix}$

Since the stochastic variable X takes different values for all outcomes ωi, the σ-algebra generated by that variable is given by

$\begin{array}{l} σ {X} & = & {\emptyset, ω_{1}, ω_{2}, ω_{3}, ω_{4}, {ω_{1}, ω_{2}}, {ω_{1}, ω_{3}}, {ω_{1}, ω_{4}}, {ω_{2}, ω_{3}}, \\ {ω_{2}, ω_{4}}, {ω_{3}, ω_{4}}, {ω_{1}, ω_{2}, ω_{3}}, {ω_{1}, ω_{2}, ω_{4}}, \\ {ω_{1}, ω_{3}, ω_{4}}, {ω_{2}, ω_{3}, ω_{4}}, {ω_{1}, ω_{2}, ω_{3}, ω_{4}, Ω} & (B .15) \end{array}$

which corresponds to full information. The σ-algebra generated by Y1 and Y2 contains less “information” since these variables take the same value for ω1 and ω3 and the same values for ω2 and ω4. The two generated σ-algebras

$σ {Y_{1}} = σ {Y_{2}} = {\emptyset, {ω_{1}, ω_{3}}, {ω_{2}, ω_{4}}, Ω} (B . i 6)$

contain the same information about X despite the fact that Y1 and Y2 take different values. Assume that each outcome has probability $\frac{1}{4}$ . By the elementary definition of conditional expectation we have

$\begin{array}{l} E [X | Y_{1} = 1] & = & \frac{1}{2} \cdot 1 + \frac{1}{2} \cdot 3 = 2 & (B .17) \\ E [X | Y_{1} = 2] & = & \frac{1}{2} \cdot 2 + \frac{1}{2} \cdot 4 = 3 & (B .18) \end{array}$

which summarize to

$E [X | Y_{1}] (ω) = {\begin{array}{l} 2 & ω ∊ {ω_{1}, ω_{3}} \\ 3 & ω ∊ {ω_{2}, ω_{4}} . \end{array} (B .19)$

We shall now check whether the two conditions stated in Definition B.10 are fulfilled. Since E[X|Y1](ω) is constant on the two subsets {ω1, ω3} and {ω2, ω4} the conditional expectation (B.19) is measurable with respect to σ{Y1}.

The other condition says that

$\begin{array}{l} \int_{{ω_{1}, ω_{3}}} E [X | Y_{1}] (ω) d ℙ (ω) & = & \int_{{ω_{1}, ω_{3}}} X (ω) d ℙ (ω) & (B .20) \\ \int_{{ω_{2}, ω_{4}}} E [X | Y_{1}] (ω) d ℙ (ω) & = & \int_{{ω_{2}, ω_{4}}} X (ω) d ℙ (ω) & (B .21) \end{array}$

which are also fulfilled. It is easy to show that E[X|σ{Y1}] = E[X|σ{Y2}], since the generated σ-algebras are the same.

Some of the most important properties of conditional expectation are given in the following list, where $G$ and ℋ denote sub-σ-algebras of ℱ:

If X is $G$ -measurable, then E[X| $G$ ] = X a.s.
E[a1X1 + a2X2| $G$ ] = a1E[X1| $G$ ] + a2E[X2| $G$ ] a.s.
If ℋ is a sub-σ-algebra of $G$ , then

$E [E [X | G] | ℋ] = E [X | ℋ] a .s . (B .22)$
If Z is $G$ -measurable and bounded, then

$E [Z X | G] = Z E [X | G] a .s . (B .23)$

Remark B.1.

Intuitively, the statement that X is $G$ -measurable simply means that X is known, and thus E[X| $G$ ] = X a.s. Item 2 simply states that the expectation operator is linear. Eq. (B.22) is often called the Tower Property, and it states that the most coarse sub-σ-algebra ℋ overrules the finer sub-σ-algebra $G$ . Eq. (B.23) states that we can take out what is known (namely Z) from the expectation operator.

B.4 Notes

Should you wish to pursue these (purely) mathematical topics, a number of books are available (Grimmett and Stirzaker [1992], Karatzas and Shreve [1996], Williams [1995], Royden [1988]). The first reference provides an excellent and readable introduction to stochastic processes and probability theory in general. The other references are given in an increasing order of difficulty and the topics considered herein are outside the scope and aim of these lecture notes.

1If the sample space is finite an atom is a set which only consists of one element.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Appendix B Probability theory

Create new playlist

Sign In

Sign Up

Probability theory

B.1 Measures and σ-algebras

B.2 Partitions and information

B.3 Conditional expectation

B.4 Notes

Table of Contents for
Appendix B Probability theory