Expanding the negative-log-likelihood to the fourth order results in
The expectation of the zeroth, first and second order terms are as in subsection
. The expectation
value of the third order term
is zero, as it is an odd
function (of
)
about
.
The expectation value of the
fourth order is
Since the second-order (
) and
fourth-order terms (
) are slightly complex,
the expressions are simplified via variable substitutions.
This results in the fourth order Taylor series approximation of the Message Length as
The optimal spacing is derived by finding the value of
that
minimises the message length. For simplicity, a variable substitution of
has then been made. Furthermore, to ensure that the
decoder can understand the message, (subsection
),
and
were replaced by their respective expectation
values.
The positive solution of the quadratic solution has been taken, as it is assumed
that
and
.
This would generally depend on the
``negative-log-likelihood'' function and
might need to be altered
to provide a realistic solution. Therefore, the fourth order approximation for
the message length is
In subsection
,
the assumption that
was made to preserve
invariance. In the fourth order case, even if the assumptions
and its fourth order equivalent
were made, the message length expression still
would not be invariant. However, numerical experiments have shown that these
assumptions are required in order to produce realistic results. Therefore, the
fourth order message length expression is simplified to -
At this stage, it is useful to check that in the case when
,
our
fourth order approximation reverts back to the second order approximation.
Obviously, equation (
)
corresponds to the second order case when
- remembering that
.
However, naively,
when
.
In such a case, l'Hospital's rule is required to find the
answer.
![]() |
(40) |
This provides the same answer as the second order approximation. Albrecht [#!Albrecht:private!#] has mentioned that l'Hospital's rule assumes that the numerator and denominator are independent variables, whereas the above limit was taken in function-space. Therefore, further proof are required to place the work on a sound mathematical basis.