Next: Mixed Strategy
Up: Action Selection
Previous: Action Selection
A deterministic strategy based upon the decision network involved the Bayesian
Poker Player performing the action which had the highest EW at that point in
the game. The results in Figure 7 show that in testing
against the earlier version of BPP which used betting curves to select an
action, the deterministic version performed extremely well, consistently
beating its opponent and accumulating winnings at a rate of
betting units per game (
).
These gains may have been in part due to the lack of effective opponent
modeling evident within the earlier model, which was unable to identify and
exploit the predictable deterministic strategy employed. As a result of this
weakness, a more ``optimal'' player would clearly have the advantage over a
randomised model and the results of the self-play experiments should not be
considered too heavily when assessing any improvement in performance. A better
assessment of the performance merits of a deterministic strategy would be
gained through competition with experienced human opponents.
Figure 7:
Comparitive performances of BPP implementing a decision network.
|
|
Next: Mixed Strategy
Up: Action Selection
Previous: Action Selection
Jason R Carlton
2000-11-13