Biometrika Advance Access originally published online on August 5, 2007
Biometrika 2007 94(3):615-625; doi:10.1093/biomet/asm043
Copyright © 2007 Biometrika Trust
Articles |
Partial inverse regression
Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695, U.S.A.
School of Statistics, University of Minnesota, St Paul, Minnesota 55108, U.S.A.
Graduate School of Management, University of California, Davis, California 95616, U.S.A.
li{at}stat.ncsu.edu
dennis{at}stat.umn.edu
cltsai{at}ucdavis.edu
Received for publication 1 March 2005. Revision received 1 December 2006.
In regression with a vector of quantitative predictors, sufficient dimension reduction methods can effectively reduce the predictor dimension, while preserving full regression information and assuming no parametric model. However, all current reduction methods require the sample size n to be greater than the number of predictors p. It is well known that partial least squares can deal with problems with n < p. We first establish a link between partial least squares and sufficient dimension reduction. Motivated by this link, we then propose a new dimension reduction method, entitled partial inverse regression. We show that its sample estimator is consistent, and that its performance is similar to or superior to partial least squares when n < p, especially when the regression model is nonlinear or heteroscedastic. An example involving the spectroscopy analysis of biscuit dough is also given.
Key Words: Partial least squares Single-index model Sliced inverse regression
References
-
Brown P. J., Fearn T., Vannucci M. Bayesian wavelet regression on curves with application to a spectroscopic calibration problem. J. Am. Statist. Assoc. (2001) 96:398–408.[CrossRef][ISI]
Carroll R., Li K. C. Measurement error regression with unknown link: Dimension reduction and visualization. J. Am. Statist. Assoc. (1992) 87:1040–50.[CrossRef][ISI]
Cook R. D. Graphics for regressions with a binary response. J. Am. Statist. Assoc. (1996) 91:983–92.[CrossRef][ISI]
Cook R. D. Regression Graphics: Ideas for Studying Regressions Through Graphics (1998) New York: Wiley.
Cook R. D., Weisberg S. Discussion of a paper by K.-C. Li. J. Am. Statist. Assoc. (1991) 86:328–32.[CrossRef][ISI]
Craven P., Wahba G. Smoothing noisy data with spline functions. Numer. Math. (1979) 31:377–403.[CrossRef]
Eaton M. A characterization of spherical distributions. J. Mult. Anal. (1986) 20:272–76.[CrossRef]
Fan J., Gijbels I. Local Polynomial Modelling and its Application (1996) London: Chapman and Hall.
Fan J., Yao Q. Efficient estimation of conditional variance functions in stochastic regression. Biometrika (1998) 85:645–60.
Garthwaite P. H. An interpretation of partial least squares. J. Am. Statist. Assoc. (1994) 89:122–7.[CrossRef][ISI]
Green P. J., Silverman B. W. Nonparametric Regression and Generalized Linear Models: A Roughness Penalty Approach (1994) London: Chapman and Hall.
Hall P., Li K. C. On almost linearity of low dimensional projections from high dimensional data. Ann. Statist. (1993) 21:867–89.
Härdle W., Hall P., Ichimura H. Optimal smoothing in single-index models. Ann. Statist. (1993) 21:157–78.
Helland I. S. On the structure of partial least squares regression. Commun. Statist. (1988) B 17:581–607.
Helland I. S. Partial least squares regression and statistical models. Scand. J. Statist. (1990) 17:97–114.
Helland I. S. Maximum likelihood regression on relevant components. J. R. Statist. Soc. (1992) B 54:637–47.
Helland I. S., Alm\Oy T. Comparison of prediction methods when only a few components are relevant. J. Am. Statist. Assoc. (1994) 89:583–91.[CrossRef][ISI]
Li K. C. Sliced inverse regression for dimension reduction (with Discussion). J. Am. Statist. Assoc. (1991) 86:316–42.[CrossRef][ISI]
Li K. C., Duan N. Regression analysis under link violation. Ann. Statist. (1989) 17:1009–52.
Naik P., Tsai C.-L. Partial least squares estimator for single-index models. J. R. Statist. Soc. (2000) B 62:763–71.[CrossRef]
Naes T., Helland I. S. Relevant components in regression. Scand. J. Statist. (1993) 20:239–50.
Osborne B. G., Fearn T., Miller A. R., Douglas S. Application of near infrared reflectance spectroscopy to compositional analysis of biscuits and biscuit doughs. J. Sci. Food Agric. (1984) 35:99–105.[CrossRef][ISI]
Stone M., Brooks R. J. Continuum regression: cross-validated sequentially constructed prediction embracing ordinary least squares, partial least squares and principal components regression (with Discussion). J. R. Statist. Soc. (1990) B 52:237–69.
Wold H. Soft modelling by latent variables: The Nonlinear Partial Least Squares (NIPALS) approach. In: Perspectives in Probability and Statistics, Papers in Honour of M.S. Barlett—Gani J., ed. (1975) London: Academic Press. 117–42.
| ||||||||||||||||||||||||||||||||||||||||||||||||||