© 1983 by Biometrika Trust
MISCELLANEA |
A comparison of sequential treatment allocation rules
Statistical Laboratory, University of Manchester
An experiment is considered which consists of a series of trials, where at each trial one of two treatments must be used, the outcome, success or failure, being known immediately; this is often referred to as a two-armed bandit. It is required to find a rule for choosing a treatment at each trial which meets, as far as possible, two objectives: (a) to maximize the use of the better treatment, and (b) to minimize the probability of wrongly identifying the better treatment at the end of the experiment. A number of such rules are compared using computer simulation and it is found that an easy-to-use rule based on a dynamic allocation index performs well for a wide range of model parameters.
Key Words: Bayes rule Dynamic allocation index Multiarmed bandit Randomized treatment allocation Sequential medical trial