Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Let (xt, t ∈ S) be a family of observations of a phenomenon which may be physical, economic, biological, etc. To model the mechanism that generates the xt, we may suppose them to be realizations of random variables (Xt, t ∈ S) that are, in general, correlated. The overall phenomenon is described by (Xt, t ∈ T) where t is generally interpreted as a time: (Xt, t ∈ T) is said to be a stochastic process or a random function.

If T is denumerable, it concerns a discrete-time process, and if T is an interval in , it concerns a continuous-time process. If the set S of observation times is random, we say that we observe a point process (this notion will be elaborated subsequently).

EXAMPLE 9.1.–

– Discrete-time processes:

1) The daily electricity consumption of Paris.

2) The monthly number of vehicle registrations in France.

3) The annual production of gasoline.

4) The evolution of a population: growth, the extinction of surnames, the propagation of epidemics.

5) The evolution of sunspots over the past two centuries.

6) The series of outcomes for a sportsman.

– Continuous-time processes:

1) The trajectory of a particle immersed in a fluid, where it is subjected to successive collisions with the molecules of the fluid.

2) The reading from an electrocardiogram.

3) The variation in concentration of a chemical solution during a reaction.

4) The evolution of stock prices during a session.

5) The number of calls which reach a telephone exchange in an interval of time [0, t], t ≥ 0.

– Point processes:

1) The sequence of instants where telephone calls reach an exchange.

2) The arrival times of customers at service window.

3) A sequence of disasters (earthquakes, car accidents, etc.).

4) Spatial distributions of plants or animals.

5) The position of vehicles at a given instant on a portion of road.

9.2. Processes

Let be a probability space and be a measurable space ( and are σ-algebras, and P is a probability on ). Moreover, let (Xt, t ∈ T) be a family of random variables defined on and with values in .

We say that (Xt, t ∈ T) or (Xt) is a stochastic process with basis space and with state space ; T is called the time set.

For fixed ω in Ω, t Xt(ω) is the trajectory of the point ω. For fixed t in T, ω Xt(ω) is the state of a process at the moment t.

9.2.1. The distribution of a process

Let us consider the mapping

where the σ-algebra ζ is generated by the mappings ∏t, t ∈ T, with

The relation Xt = ∏t X implies that:

Since the , generate , we conclude that X is -measurable.

The distribution PX of X defined by:

is called the distribution of the process (Xt).

The process (∏t, t ∈ T) defined on (ET, ζ, PX) is called the canonical process of (Xt), and it has the same distribution as (Xt).

The distributions of the random variables , k ≥ 1, t1, …, tk ∈ T, are called the finite-dimensional distributions of (Xt). If equipped with its Borelian σ-algebra, , then it may be shown that the finite-dimensional distributions determine PX (this is a consequence of the Kolmogorov existence theorem). The random vectors of the form are called the margins of (Xt).

9.2.2. Gaussian processes

Recall: A real random variable is said to be Gaussian if it may be written in the form aX0 + b, where X0 follows the distribution with density (2π)−1/2 exp(−x2/2), and where a and b are constant (a = 0 is not excluded). A random variable with values in is Gaussian if every linear combination of its components is a real, Gaussian, random variable.

A process (Xt) is said to be Gaussian if its margins are Gaussian.

The functions t EXT and (s, t) Cov(Xs, Xt), called the mean and the covariance of (Xt), respectively, completely determine the distribution of the Gaussian process (Xt), as they determine its finite-dimensional distributions.

9.2.3. Stationary processes

A process (Xt, t ∈ T) is said to be strictly stationary if:

A real, square-integrable process is said to be (weakly) stationary if its mean is constant and its covariance satisfies:

A real, square-integrable, strictly stationary process is weakly stationary. The converse is not necessarily true: for example, real, centered, independent random variables X1, X2,… with the same variance but different distributions, form a weakly stationary but not strictly stationary process. However, the converse is true if the process is Gaussian.

EXAMPLE 9.2.–

1) A strong white noise is a sequence of real, independent, centered, random variables with the same distribution, which are such that:

If we replace “independent”, and “with the same distribution” by “orthogonal” (i.e. Eεnεm = 0, n ≠ m), then we obtain a weak white noise.

A strong white noise is strictly stationary, whereas a weak white noise is weakly stationary.

2) A linear process is defined by the relation:

[9.1]

where (εn) is a white noise and the aj are constant and such that

In the following, we will adopt the more restrictive condition ∑j |aj| < ∞ (see Chapter 10).

The series appearing in [9.1] converges in quadratic mean.

(Xn) is strictly or weakly stationary according to whether (εn) is strong or weak.

9.2.4. Markov processes

A real process (Xt, t ∈ T) is Markovian if, for every s, t ∈ T such that s < t, the conditional distribution of Xt given {Xu, u ≤ s} is the same as the conditional distribution given Xs.

For example, if , we have:

Many of the processes shown in the following are Markovian: strictly stationary first-order autoregressive processes, Poisson processes, Wiener processes, and diffusion processes.

9.3. Statistics for stochastic processes

The study of a process that models observed variables may be represented by dividing it into four steps.

1) Empirical analysis of the observations:

Let us make, for example, the hypothesis that the series is generated by a process of the form:

[9.2]

where g is a deterministic function that represents the tendency, s is a periodic, deterministic function called the seasonality, and (Yt) is a centered, stationary, stochastic process.

The first step then consists of estimating or eliminating the tendency and the seasonality in such a way as to only keep the data of the stationary part (Yt).

For this, we may suppose that g and s have a particular form. For example, if and if S = {1, 2, …, n}, we may set:

where

Thus, τ is the period of s and

The decomposition [9.2] is unique if the functions 1, t,…, tp, δ1t,…, δτt are linearly independent. This is not the case, since

We then introduce the additional condition:

which means that the seasonal effects compensate for each other over one period.

This allows us to construct estimators for a0,…, ap; c1,…, cτ using the method of least squares, i.e. by minimizing

under the constraint

2) Choice of a stationary model for (Yt):

After eliminating g and s, we may suppose the (modified) observations to be realizations of Y1,…, Yn.

Theoretical considerations often allow us to choose a stationary model which is well suited to (Yt). The linear process defined in section 9.2 is one possible choice.

In certain cases, we simply suppose that (Yt) has stationary increments, i.e. the distribution of Yt+h − Ys+h (s, t, s + h, t + h ∈ T) does not depend on h. The Poisson process, the Wiener process, and the ARIMA process, which we will study later, are very important examples of stationary increment processes.

3) Statistical inference:

To completely identify the chosen process, we estimate the unknown parameters from the observed variables. Some tests allow us to verify that the identified model is well suited to the observations.

4) Use of the identified model:

The identified model may be used to solve problems of control, detection, interpolation or prediction of the future values of (Xt).

9.4. Exercises

EXERCISE 9.1.– Consider the Buys Ballot model:

where is a sequence of i.i.d. real variables , δ1t = 1 if t is odd and 0 otherwise, and δ2t = 1 − δ1t.

1) Show that the model is not identifiable, i.e. there exist several values of the parameters giving the same function of t for EXt. Show that it is identifiable if one imposes c1 + c2 = 0. We will impose this condition in the following.

2) Supposing that we use T = 2N observations, corresponding to two half-years for N years, give the value of the least squares estimators obtained by minimizing:

Show that the estimators are unbiased.

images

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for
Chapter 9: Introduction to Statistics for Stochastic Processes

Chapter 9

Introduction to Statistics for Stochastic Processes

9.1. Modeling a family of observations

9.2. Processes

9.2.1. The distribution of a process

9.2.2. Gaussian processes

9.2.3. Stationary processes

9.2.4. Markov processes

9.3. Statistics for stochastic processes

9.4. Exercises

Table of Contents for Chapter 9: Introduction to Statistics for Stochastic Processes

Create new playlist

Sign In

Sign Up

Chapter 9

Introduction to Statistics for Stochastic Processes

9.1. Modeling a family of observations

9.2. Processes

9.2.1. The distribution of a process

9.2.2. Gaussian processes

9.2.3. Stationary processes

9.2.4. Markov processes

9.3. Statistics for stochastic processes

9.4. Exercises

Table of Contents for
Chapter 9: Introduction to Statistics for Stochastic Processes