Articles |
Joint modelling of paired sparse functional data using principal components
Department of Statistics, Texas A & M University, College Station, Texas 77843, U.S.A. lzhou{at}stat.tamu.edu jianhua{at}stat.tamu.edu carroll{at}stat.tamu.edu
Received for publication 1 December 2006. Revision received 1 March 2008.
We propose a modelling framework to study the relationship between two paired longitudinally observed variables. The data for each variable are viewed as smooth curves measured at discrete time-points plus random errors. While the curves for each variable are summarized using a few important principal components, the association of the two longitudinal variables is modelled through the association of the principal component scores. We use penalized splines to model the mean curves and the principal component curves, and cast the proposed model into a mixed-effects model framework for model fitting, prediction and inference. The proposed method can be applied in the difficult case in which the measurement times are irregular and sparse and may differ widely across individuals. Use of functional principal components enhances model interpretation and improves statistical and numerical stability of the parameter estimates.
Key Words: Functional data Longitudinal data Mixed-effects model Penalized spline Principal component Reduced-rank model
References
-
Dempster A. P., Laird N. M., Rubin D. B. Maximum likelihood from incomplete data via the em algorithm (with Discussion). J. R. Statist. Soc. (1977) B 39:1–38.
Eilers P., Marx B. Flexible smoothing with B-splines and penalties (with Discussion). Statist. Sci. (1996) 89:89–121.
Fahrmeir L., Tutz G. Multivariate Statistical Modelling Based on Generalized Linear Models (1994) New York: Springer.
He G., Müller H.-G., Wang J.-L. Functional canonical analysis for square integrable stochastic processes. J. Mult. Anal. (2003) 85:54–77.[CrossRef]
Hoover D. R., Rice J. A., Wu C. O., Yang L.-P. Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika (1998) 85:809–22.
Huang J. Z., Wu C. O., Zhou L. Varying coefficient models and basis function approximation for the analysis of repeated measurements. Biometrika (2002) 89:111–28.
James G. M., Hastie T. J., Sugar C. A. Principal component models for sparse functional data. Biometrika (2000) 87:587–602.
Laird N., Ware J. Random-effects models for longitudinal data. Biometrics (1982) 38:963–74.[CrossRef][Web of Science][Medline]
Lederman M. M., Connick E., Landay A., Kuritzkes D. R., Spritzler J., Clair M. S., Kotzin B. L., Fox L., Chiozzi M. H., Leonard J. M., Rousseau F., Wade M., D'arc Roe J., Martinez A., Kessler H. Immunological responses associated with 12 weeks of combination antiretroviral therapy consisting of zidovudine, lamivudine & ritonavir: results of AIDS Clinical Trials Group Protocol 315. J. Inf. Dis. (1998) 178:70–9.[Web of Science][Medline]
Leurgans S. E., Moyeed R. A., Silverman B. W. Canonical correlation analysis when the data are curves. J. R. Statist. Soc. (1993) B 55:725–40.
Liang H., Wu H., Carroll R. J. The relationship between virologic and immunologic responses in AIDS clinical research using mixed-effects varying-coefficient models with measurement error. Biostatistics (2003) 4:297–312.[Abstract]
Liang K.-Y., Zeger S. L. Longitudinal data analysis using generalized linear models. Biometrika (1986) 73:13–22.
Moyeed R. A., Diggle P. J. Rates of convergence in semi-parametric modelling of longitudinal data. Aust. J. Statist. (1994) 36:75–93.[CrossRef]
Nelder J. A., Mead R. A simplex method for function minimization. Comp. J. (1965) 7:308–13.
Ramsay J. O., Silverman B. W. Functional Data Analysis (2005) 2nd ed. New York: Springer.
Rice J. A. Functional and longitudinal data analysis: perspectives on smoothing. Statist. Sinica (2004) 14:613–29.
Rice J. A., Wu C. Nonparametric mixed effects models for unequally sampled noisy curves. Biometrics (2001) 57:253–59.[CrossRef][Web of Science][Medline]
Ruppert D., Wand M. P., Carroll R. J. Semiparametric Regression (2003) Cambridge, UK: Cambridge University Press.
Shi M., Weiss R. E., Taylor J. M. G. An analysis of paediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves. Appl. Statist. (1996) 45:151–63.[CrossRef]
Wu C. O., Chiang C.-T., Hoover D. R. Asymptotic confidence regions for kernel smoothing of a varying-coefficient model with longitudinal data. J. Am. Statist. Assoc. (1998) 93:1388–402.[CrossRef][Web of Science]
Wu H., Ding A. Population HIV-1 dynamics in vivo: application models and inference tools for virological data from AIDS clinical trials. Biometrics (1999) 55:410–8.[CrossRef][Web of Science][Medline]
Yao F., Müller H.-G., Wang J.-L. Functional data analysis for sparse longitudinal data. J. Am. Statist. Assoc. (2005) 100:577–90.[CrossRef][Web of Science]
Yao F., Müller H.-G., Wang J.-L. Functional linear regression analysis for longitudinal data. Ann. Statist. (2005) 33:2873–903.[CrossRef]
Zeger S. L., Diggle P. J. Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics (1994) 50:689–99.[CrossRef][Web of Science][Medline]
Zellner A. An efficient method of estimating seemingly unrelated regressions, and tests for aggregation bias. J. Am. Statist. Assoc. (1962) 57:348–68.[CrossRef][Web of Science]
| ||||||||||||||||||||||||||||||||||||||||||||||||||