Once an optimal coding region is found, it would be useful to convert it to
a point estimate of the parameter. The parameter estimation is derived from the
optimal coding region using the posterior probability function
and the probability
that future data y is generated given the
model
[#!Dowe:private!#]. The method is convolving
[#!Dowe.Baxter.Oliver.Wallace:1998!#] the posterior
over the
probability of future data
to produce a population of future
expected data, then finding the most likely model using a Maximum Likelihood
fit.
The method for generating point estimate from an optimal coding region is similar to the method described by Dowe et.al. [#!Dowe.Baxter.Oliver.Wallace:1998!#] for deriving the minEKL. The difference between MMLD and minEKL, is that integration occurs over the optimal coding region for MMLD and not the whole parameter-space as for minEKL.
where
is the
normalisation constant.
is then the maximum likelihood fit to
.
In multiple dimensions, this equation converts to
where
is the parameter affecting the data yi and
is the normalisation constant. Again
is then the maximum
likelihood fit to
.