© 1979 by Biometrika Trust
A dynamic allocation index for the discounted multiarmed bandit problem
Mathematical Institute Oxford
Department of Mathematics, Polytechnic of Wales Llantrisant, Mid-Glamorgan
Earlier work by the present authors has established the existence of and a characterization of a priority index giving the Bayes rule for the discounted multiarmed bandit problem. The calculation of this index is described and illustrated, and the results obtained briefly discussed.
Key Words: Dynamic allocation index Sequential decision procedure Two-armed bandit