We have just looked at how correlation is calculated using an easy to understand example, provided historical data, but what is important to simulation is how correlation is used for future projections. If we believe there is a correlation between items that we are simulating, we will have to account for it in our simulation; otherwise the end results can be very different.
The two most common areas that correlation enters into financial simulations are industry- and regional-based correlations. For instance, if we are invested in a telecommunications company and the telecommunications industry as a whole is in decline, we would expect a correlation in the performance of our individual telecommunication company investment. Likewise, if we are invested in a company that services Asian markets and Asian markets are in decline, we might expect our individual invested company to experience decline because of the regional correlation. Let's take a look at a simple simulation of 10 corporate loans with a single correlation factor and see how the correlation affects the expected default of the basket of loans.
=NORMSINV(D7)
Copy and paste this formula over the range F7:F16. You should also label F6 with the title “Default Threshold”.
The opposite of the noncorrelated component to corporate defaults are concepts that might affect the other companies if it occurred in one of our ten exposures. Earlier we mentioned examples that could include industry or regional problems. Keep in mind there can always be other correlations beyond these two. To account for this, we create one normal random variable that is the same for each of the 10 entities.
The way we induce correlation then is to use a certain percentage of the noncorrelated normal random variable versus the correlated normal random variable, depending on the assumed level of correlation. This can be implemented using the following formula:
=(Correlation Coefficient)*Correlated Normal Random+((1-Correlation
Coefficient^2))^0.5*Non-Correlated Normal Random
Let's piece these concepts together by first entering the following labels in the corresponding cells:
I6: “Non-Corr Component” K6: “Combined Test Value”
J6: “Corr Component” L6: “Default Test”
=NORMSINV(RAND())
Copy and paste this across the range I7:I16. This is the noncorrelated component, which is demonstrated by the fact that every time the sheet is calculated there will be a unique normal random number for each exposure.
=NORMSINV(RAND())
Rather than copying this formula down over a similar range as in step 5, enter the following formula in J8:
=J7
Now copy this formula over the range J8:J16. Notice that we created the same normal random number for each exposure.
=$E$4*J7+((1-$E$4^2)^0.5)*I 7
Copy and paste this formula over the range K7:K16. Notice that this formula takes the correlated component and multiplies it by the correlation coefficient and adds to it the noncorrelated component multiplied by the square root of one minus the correlation coefficient squared (the non-correlated factor).
=IF(K7<F7,1,0)
This simulates whether a default occurred against the distribution formed by the exposure's default probability by entering a 1 if a default occurred and a 0 if it did not. Copy and paste this formula over the range L7:L16.
=IF(L7=1,“Default”,“No Default”)
Copy and paste this formula over the range M7:M16.
=NORMSINV(RAND())
Copy and paste this over the range M33:M5033. This is the correlated component that will be the same for all 10 exposures during a single iteration. Notice that it will change for each iteration though.
=IF((($E$4)*$M34+((1-$E$4^2))^0.5*NORMSINV(RAND()))
<OFFSET($F$6,C$32,0),1,0)
While this formula looks complicated it is actually a condensed version of the calculations we did in steps 1 to 9. For each exposure this formula takes the correlation coefficient and multiplies it by the same normal random number (representing the correlated component) and adds this to the product of the square root of one minus the correlation coefficient squared multiplied by a unique normal random number. The sum of these two calculations is then tested against the default thresholds from the tabled data starting in F7. Note that an OFFSET function is used to look up the default threshold for each exposure. If the calculation is less than the default threshold a 1 is returned; otherwise a zero is returned. Copy and paste this over the range C34:L34.
=SUM(C34:L34)
Copy and paste this formula over the range O34:O5033.
=COUNTIF($O$34:$O$5033,B20)
Copy and paste this formula over the range B21:L21. Notice that this tells us how many times we saw 0 exposures out of 10 default, 1 exposure out of 10 default, and so on. We can then create a column chart to visualize the data as shown in Figure 3.6.
Now that the actual construction of this Model Builder is complete we should take some time to interpret what we have done. Notice that the histogram distribution of results is a normal distribution centered on 4.5. This makes sense since we assumed a low correlation, a normal distribution, and all of our companies have a default probability of 45 percent. We can see the distribution looks like a bell curve with the most common scenarios having 4–5 defaults, but there are scenarios with 0 defaults and with 10 defaults.
Change the correlation coefficient in E4 to 0 percent. Make sure the sheet is calculated by pressing F9 if you have Excel set to manual mode (which is a good idea when creating so many formulas on the sheet!). You should see the histogram distribution remain centered on 4.5, but there are fewer scenarios with 0 or 10 defaults, as shown in Figure 3.7.
The interesting effect, though, is when we change the correlation coefficient the other way. Change the correlation coefficient in E4 to 90 percent and make sure the sheet is calculated. The histogram distribution should now change to an extreme barbell distribution with peaks at 0 and 10, as seen in Figure 3.8.
This is because with a high correlation there is a very high probability of doing very well with no defaults or a very high probability of doing very poorly with nearly the entire basket defaulting. This is an effect similar to what we saw in 2007 when the correlation among mortgage-backed securities was revised to a much higher percentage. Rather than having a diverse array of default probabilities the underlying mortgage securities were extremely correlated and many pools of loans started to go into delinquency and default at the same time.
Another interesting characteristic that should be noted about the simulation is that the individual asset default rates among all iterations are close to the assumed default probability, even though the correlation among the assets could be very high. This can be checked by summing up a row of defaults for a single asset and dividing that sum by the total number of iterations. For example, the sum of C34:C5033 divided by 5000 should be approximately 45 percent (i.e., the original default probability that was assumed).