Biometrika Advance Access originally published online on February 8, 2008
Biometrika 2008 95(1):241-247; doi:10.1093/biomet/asm083
Miscellanea |
A note on path-based variable selection in the penalized proportional hazards model
School of Statistics, University of Minnesota, 224 Church Street S.E., Minneapolis, Minnesota 55455, U.S.A. hzou{at}stat.umn.edu
Received for publication 1 May 2006. Revision received 1 June 2007.
We propose an efficient and adaptive shrinkage method for variable selection in the Cox model. The method constructs a piecewise-linear regularization path connecting the maximum partial likelihood estimator and the origin. Then a model is selected along the path. We show that the constructed path is adaptive in the sense that, with a proper choice of regularization parameter, the fitted model works as well as if the true underlying submodel were given in advance. A modified algorithm of the least-angle-regression type efficiently computes the entire regularization path of the new estimator. Furthermore, we show that, with a proper choice of shrinkage parameter, the method is consistent in variable selection and efficient in estimation. Simulation shows that the new method tends to outperform the lasso and the smoothly-clipped-absolute-deviation estimators with moderate samples. We apply the methodology to data concerning nursing homes.
Key Words: Adaptive path Lasso Oracle property Penalized partial likelihood Smoothly-clipped-absolute deviation penalty Variable selection
References
-
Anderson P. K., Gill R. D. Cox's regression model for counting processes: a large sample study. Ann. Statist. (1982) 10:1100–20.[CrossRef]
Cai J., Fan J., Li R., Zhou H. Variable selection for multivariate failure time data. Biometrika (2005) 92:303–16.
Cox D. R. Regression models and life tables (with Discussion). J. R. Statist. Soc. (1972) B 74:187–220.
Cox D. R. Partial likelihood. Biometrika (1975) 62:269–76.
Efron B., Hastie T., Johnstone I., Tibshirani R. Least angle regression (with Discussion). Ann. Statist. (2004) 32:407–99.[CrossRef]
Fan J., Li R. Variable selection via nonconcave penalized likelihood and its oracle properties. J. Am. Statist. Assoc. (2001) 96:1348–60.[CrossRef][Web of Science]
Fan J., Li R. Variable selection for Cox's proportional hazards model and frailty model. Ann. Statist. (2002) 30:74–99.[CrossRef]
Geyer C. On the asymptotics of constrained M-estimation. Ann. Statist. (1994) 22:1993–2010.[CrossRef]
Hunter D., Li R. Variable selection using MM algorithms. Ann. Statist. (2005) 33:1617–42.[CrossRef]
Knight K., Fu W. Asymptotics for lasso-type estimators. Ann. Statist. (2000) 28:1356–78.[CrossRef]
Morris C., Norton E., Zhou X.H. Parametric duration analysis of nursing home usage. Case Studies in Biometry—Lange N., Ryan L., Billard D., Brillinger D., Conquest L., Greenhouse J., eds. (1994) New York: Wiley. 231–48.
Rosset S., Zhu J. Piecewise linear regularized solution paths. Ann. Statist. (2007) 35:1012–30.[CrossRef]
Tibshirani R. Regression shrinkage and selection via the lasso. J. R. Statist. Soc. (1996) B 58:267–88.
Tibshirani R. The lasso method for variable selection in the Cox model. Statist. Med. (1997) 16:385–95.[CrossRef]
Zhu J., Rosset S., Hastie T., Tibshirani R. 1-norm support vector machines. Advances in Neural Information Processing Systems—Thrun S., Saul L., Schölkopf B., eds. (2004) 16. Cambridge, MA: MIT Press.
Zou H. The adaptive lasso and its oracle properties. J. Am. Statist. Assoc. (2006) 101:1418–29.[CrossRef][Web of Science]
Zou H., Hastie T. Regression and variable selection via the elastic net. J. R. Statist. Soc. (2005) B 67:301–20.[CrossRef]
Zou H., Hastie T., Tibshirani R. On the "degrees of freedom" of the lasso. Ann. Statist. (2007) 35:2173–92.[CrossRef]
This article has been cited by other articles:
![]() |
I. Sohn, J. Kim, S.-H. Jung, and C. Park Gradient lasso for Cox proportional hazards model Bioinformatics, July 15, 2009; 25(14): 1775 - 1781. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wang, B. Nan, N. Zhu, and J. Zhu Hierarchically penalized Cox regression with grouped variables Biometrika, June 1, 2009; 96(2): 307 - 322. [Abstract] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||

