Biometrika Advance Access originally published online on April 1, 2009
Biometrika 2009 96(2):371-382; doi:10.1093/biomet/asp002
Article |
Adjusting for covariate effects on classification accuracy using the covariate-adjusted receiver operating characteristic curve
Division of Public Health Sciences, Fred Hutchinson Cancer, Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, U.S.A. hjanes{at}scharp.org mspepe{at}u.washington.edu
Received for publication 1 May 2007. Revision received 1 June 2008.
Recent scientific and technological innovations have produced an abundance of potential markers that are being investigated for their use in disease screening and diagnosis. In evaluating these markers, it is often necessary to account for covariates associated with the marker of interest. Covariates may include subject characteristics, expertise of the test operator, test procedures or aspects of specimen handling. In this paper, we propose the covariate-adjusted receiver operating characteristic curve, a measure of covariate-adjusted classification accuracy. Nonparametric and semiparametric estimators are proposed, asymptotic distribution theory is provided and finite sample performance is investigated. For illustration we characterize the age-adjusted discriminatory accuracy of prostate-specific antigen as a biomarker for prostate cancer.
Key Words: Classification accuracy Covariate effect Receiver operating characteristic curve Sensitivity Specificity
References
-
Alonzo T. A., Pepe M. S. Distribution-free ROC analysis using binary regression techniques. Biostatistics (2002) 3:421–32.[Abstract]
Baker S. The central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer. J. Nat. Cancer Inst. (2003) 95:511–5.
Cai T., Pepe M. S. Semi-parametric ROC analysis to evaluate biomarkers for disease. J. Am. Statist. Assoc. (2002) 97:1099–107.[CrossRef][Web of Science]
Cole T. J. The LMS method for constructing normalized growth standards. Eur. J. Clin. Nutr. (1990) 44:45–60.[Web of Science][Medline]
Cole T. J., Green P. J. Smoothing reference centile curves: the LMS method and penalized likelihood. Statist. Med. (1992) 11:1305–19.[CrossRef]
Etzioni R., Falcon S., Gann P. H., Kooperberg C. L., Penson D. F., Stampfer M. J. Prostate-specific antigen and free prostate-specific antigen in the early detection of prostate cancer: do combination tests improve detection. Cancer Epidemiol. Biomark. Prev. (2004) 13:1640–5.
Ferguson T. S. A Course in Large Sample Theory (1996) London: Chapman and Hall.
Gann P. H., Hennekens C. H., Stampfer M. J. A prospective evaluation of plasma prostate-specific antigen for detection of prostatic cancer. J. Am. Med. Assoc. (1995) 273:289–94.
Hanley J. A. The robustness of the binormal assumptions used in fitting ROC curves. Med. Decis. Making (1988) 8:197–203.
Hanley J. A. The use of the binormal model for parametric ROC analysis of quantitative diagnostic tests. Statist. Med. (1996) 15:1575–85.[CrossRef]
Heagerty P., Pepe M. S. Semi-parametric estimation of regression quantiles with application to standardizing weight for height and age in U.S. children. Appl. Statist. (1999) 48:533–51.
Janes H., Pepe M. S. Matching in studies of classification accuracy: implications for bias, efficiency, and assessment of incremental value. Biometrics (2008) 64:1–9.[CrossRef][Web of Science][Medline]
Lin Y., Jeon Y. Discriminant analysis through a semi-parametric model. Biometrika (2003) 90:379–92.
Pepe M. S. An interpretation for the ROC curve and inference using GLM procedures. Biometrics (2000) 56:352–9.[CrossRef][Web of Science][Medline]
Pepe M. S. The Statistical Evaluation of Medical Tests for Classification and Prediction (2003) Oxford: Oxford University Press.
Pepe M. S., Cai T. The analysis of placement values for evaluating discriminatory measures. Biometrics (2004) 60:528–35.[CrossRef][Web of Science][Medline]
Pepe M. S., Etzioni R., Feng Z., Potter J. D., Thompson M. L., Thornquist M., Winget M., Yatsui Y. Phases of biomarker development for early detection of cancer. J. Nat. Cancer Inst. (2001) 93:1054–61.
Silverman B. W. Density Estimation for Statistics and Data Analysis (1986) London: Chapman and Hall.
Swets J. A. Indices of discrimination or diagnostic accuracy: their ROCs and implied methods. Psychol. Bull. (1986) 99:100–17.[CrossRef][Web of Science][Medline]
Zheng Y., Heagerty P. J. Semiparametric estimation of time-dependent ROC curves for longitudinal marker data. Biostatistics (2004) 5:615–32.[Abstract]
| ||||||||||||||||||||||||||||||||||||||||||||||||