Skip Navigation

Biometrika 2001 88(4):921-932; doi:10.1093/biomet/88.4.921
© 2001 by Biometrika Trust
This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Zhang, B.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

An information matrix test for logistic regression models based on case-control data

Biao Zhang1

1 Department of Mathematics, The University of Toledo, Toledo, Ohio 43606, U.S.Abzhang{at}math.utoledo.edu

We propose an information-matrix-based goodness-of-fit statistic to test the validity of the logistic regression model based on case-control data by extending the information matrix test of White (1982) for detecting one-sample parametric model misspecification to the semiparametric profile likelihood setting under a two-sample semiparametric model, which is equivalent to the assumed logistic regression model.The proposed test statistic requires a high-dimensional matrix inversion, but is otherwise easily computed and has an asymptotic chi-squared distribution. This test statistic is an alternative to the Kolmogorov–Smirnov-type statistic of Qin & Zhang (1997) and the chi-squared-type statistic of Zhang (1999) and needs neither to employ a bootstrap method to evaluate its critical values nor to group the combined sample data into a finite number of mutually exclusive categories even when the underlying population distribution is continuous. We demonstrate that the proposed test statistic and its asymptotic distribution may be obtained by fitting the prospective logistic regression model to case-control data. We present some results on simulation and on the analysis of three real datasets.

Key Words: Biased sampling problem; Case-control data; Chi-squared; Consistency; Fisher information matrix; Moore–Penrose generalised inverse; Local alternative; Mixture sampling; Profile likelihood; Score derivative matrix; Squared score matrix


Received October 1999. Revised August 2000


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BiometrikaHome page
H. D. Bondell
Testing goodness-of-fit in logistic case-control studies
Biometrika, June 1, 2007; 94(2): 487 - 495.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Z. Guan and H. Zhao
A semiparametric approach for marker gene selection based on gene expression data
Bioinformatics, February 15, 2005; 21(4): 529 - 536.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.