We begin with the standard definition of the multivariate normal distribution
when the variance-covariance matrix
is positive definite. Define
A key step in the development of the numerical methods described by Genz
(1992) was to transform F using
,
where C is
the Cholesky factor of
(
). After this transformation
We wish to generalise this approach for cases where
is positive
semi-definite, and show that a similar definition of F can also be used in
these cases. Let
be a singular value decomposition for
,
where U is an
orthogonal matrix and D is an
diagonal matrix with nonnegative diagonal entries
.
If k is the rank of
,
and k<m, then
.
Now define
,
for
.
Then
where
,
so
is positive definite, and
is properly defined. Next let
,
so
,
and let
.
Then
Taking the limit as
approaches zero, we have
where V = V(0). V can be written as
,
where
is an
matrix with its first k columns as
the first k columns of U and remaining m-k columns as columns of zeros,
because
.
For a final step, we determine an
orthogonal matrix Qso that C=VQ' is lower triangular. The Q required for this step is an
identity matrix except for its principal
submatrix
which is the
orthogonal matrix required to make the
principal
submatrix of V lower triangular (see the book by Golub
and Van Loan, 1996, for a description of how such a Q can be determined).
Then, if
,
,
where C is lower triangular and
cij = 0 if j>k.
The integration region defined by
places
no constraint on the variables
,
and all
possible values between
and
for these variables are
consistent with the definition of the integration region, so the innermost
m-k integrals are all equal to one. Therefore
where C is now the
matrix obtained by removing the original m-kzero columns from the original C and
.
This matrix C can be computed directly using the generalised
Cholesky decomposition algorithm described by Healy (1968).
Following the procedure used by Genz (1992), the next step is to use the lower
triangular structure of C to rewrite each of the
inequalities
and explicitly define
the integration limits for F to give F in the form
where
and
,
for
.
But this definition of F is not complete if k<m, because we must take into
account the m-k additional constraints
,
for
,
that the k integration variables must satisfy. There are various cases
to consider. In order to introduce the general cases, we first consider
the case where
for all i>k.
In this case we only need to place additional constraints on yk, and
modify the definitions of a'k and b'k. We first rewrite the last
m-k+1 constraints in the form
,
for
.
Then we divide these constraints into two groups, the first group consisting of
the constraints where
ci,k > 0 and the second group consisting of
the constraints where
ci,k < 0. For both groups, we divide by
ci,k to produce explicit constraints on yk, but for the second
group we must change the order of the inequalities. The revised limits
for yk can now be written as
and
In the cases where ci,k=0 for some i's, with i>k, the associated
constraints for these i's do not affect yk, so the constraints for
some of other variables need to be adjusted. For these general cases, we first
reorder all of the constraints into groups of sizes
,
with
,
so that if we denote the reordered constraints
by
,
then
the first l1 rows of
have the form
,
the next l2 rows of
have the form
,
and so on
with the last lk rows of
in the form
,
where *
denotes a nonzero and ? denotes zero or nonzero. Then, for each i,
,
the set of li constraints are rewritten and merged
to produce a single constraint on yi using the procedure that we described
for yk. When this process is complete, the integral for F can
be written as
In order to put F into a form that is easy to use with standard numerical
integration methods, two more transformations are required. First, let
,
for
i = 1,2,..kso that
.
Then
with
and
Finally, let
zi=di+(ei-di)ui, for
,
so
dzi=(ei-di)dui. Then
The innermost integral is one, so the numerical problem involves the
estimation of a (k-1) dimensional integral.
Alan C Genz
2001-02-09