© 1975 by Biometrika Trust
A note on data-splitting for the evaluation of significance levels
Department of Mathematics, Imperial College London
It has sometimes been suggested that to overcome difficulties arising in significance tests when the effects tested are selected in the light of the data, the data should be split randomly into two portions. The first portion is used to choose the hypothesis for test and the second portion for the evaluation of significance. After some general criticism of the idea, it is investigated theoretically on a simple problem about normal means. Recommendations are reached about the proportions into which the data should be divided and the theoretical efficiency of the procedure is assessed and found to be quite high.
Key Words: Data splitting Jackknife Multiple comparisons Selection effects in significance tests
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
D. B. Price, D. G. Tinkelman, R. J. Nordyke, S. Isonaka, R. J. Halbert, and for the COPD Questionnaire Study Group {dagger} Scoring System and Clinical Application of COPD Diagnostic Questionnaires Chest, June 1, 2006; 129(6): 1531 - 1539. [Abstract] [Full Text] [PDF] |
||||
