© 1976 by Biometrika Trust
Estimating the number of unseen species: How many words did Shakespeare know?
Department of Statistics, Stanford University California
Shakespeare wrote 31534 different words, of which 14376 appear only once, 4343 twice, etc. The question considered is how many words he knew but did not use. A parametric empirical Bayes model due to Fisher and a nonparametric model due to Good & Toulmin are examined. The latter theory is augmented using linear programming methods. We conclude that the models are equivalent to supposing that Shakespeare knew at least 35000 more words.
Key Words: Empirical Bayes Euler transformation Linear programming Negative binomial Vocabulary
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Lijoi, R. H. Mena, and I. Prunster Bayesian Nonparametric Estimation of the Probability of Discovering New Species Biometrika, December 1, 2007; 94(4): 769 - 786. [Abstract] [PDF] |
||||
![]() |
B. J. Paster, S. K. Boches, J. L. Galvin, R. E. Ericson, C. N. Lau, V. A. Levanos, A. Sahasrabudhe, and F. E. Dewhirst Bacterial Diversity in Human Subgingival Plaque J. Bacteriol., June 15, 2001; 183(12): 3770 - 3783. [Abstract] [Full Text] |
||||
![]() |
I. Kroes, P. W. Lepp, and D. A. Relman Bacterial diversity within the human subgingival crevice PNAS, December 7, 1999; 96(25): 14547 - 14552. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. KOLATA Shakespeare's New Poem: An Ode to Statistics: Two statisticians are using a powerful method to determine whether Shakespeare could have written the newly discovered poem that has been attributed to him Science, January 24, 1986; 231(4736): 335 - 336. [PDF] |
||||



