The message length expression can also be extended to multi-variate distributions. However, there is a change in notation and a clarification of the meaning for certain variables. As for univariate distributions, the length of the first part of the message is represented mathematically by -
Where
is a n-dimensional vector fully describing the model. Just
like the one-dimensional case,
is approximated to a set of
values. However,
is confined to a hyper-cube of values
rather than an interval of values. An interval being a hyper-cube of
dimension one.
The hypercube has sides of length
.
For ease of
calculation, a symmetric cube is assumed
(
)
and
that the prior
lies uniformly about the cube. This
means that the volume of the cube is
.
Furthermore, it is assumed that each of the parameters
are
independent of each other. Therefore, the length of the the message (c.f.
(
)) is
Applying the midpoint integration rule, the first part of the message can be approximated to be
| (45) |
As before (subsection
), the techniques
Wallace [#!Wallace:2000!#] has suggested, can be applied to ensure that
does not exceed 1.
As with the one-dimensional case (subsection
), the Taylor series
expansion of the negative-log-likelihood is taken -
The expectation values (ie integrate over the hypercube) of the odd-order terms
evaluate to zero as they are an odd function about
,
along at
least one axis. With the second order case, when
,
the terms also
evaluate to zero as they are an odd function - along two axes - about
.
With the fourth order case there are two scenarios when the
coefficients will be non-zero. The first is when i=j=k=l and the other is
where there are two groups
.
The resulting message length
expression is
where
and assuming that
and
.
The message length can then be minimised with respect to the spacing parameter
.
Since each of the parameters are independent, this can be
done by minimising the message length with respect to
.
Generally speaking, this is a set of n non-linear equations, which is
difficult to solve in practice, even though a unique solution exists
theoretically. However, there was a previous assumption that the hypercube was
symmetrical. Therefore,
and the set
of n non-linear equation collapses into a single non-linear equation. The
assumption that the matrix Gij is symmetrical has also been made. This is
reasonable as
are assumed to be independent of each other.
Therefore, the optimal spacing can be derived by solving a single instance of
equation (
), which
can be reformulated (with the substitution
)
as
This results in the expression for
as
As a consistency check, it has been verified that when n=1, the message length and spacing equations collapse back to the one-dimensional fourth order case. It is trivial to check by noting the following changes