Skip Navigation

Biometrika 2008 95(3):601-619; doi:10.1093/biomet/asn035
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Zhou, L.
Right arrow Articles by Carroll, R. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2008 Biometrika Trust

Articles

Joint modelling of paired sparse functional data using principal components

Lan Zhou, Jianhua Z. Huang and Raymond J. Carroll

Department of Statistics, Texas A & M University, College Station, Texas 77843, U.S.A. lzhou{at}stat.tamu.edu jianhua{at}stat.tamu.edu carroll{at}stat.tamu.edu

Received for publication 1 December 2006. Revision received 1 March 2008.

We propose a modelling framework to study the relationship between two paired longitudinally observed variables. The data for each variable are viewed as smooth curves measured at discrete time-points plus random errors. While the curves for each variable are summarized using a few important principal components, the association of the two longitudinal variables is modelled through the association of the principal component scores. We use penalized splines to model the mean curves and the principal component curves, and cast the proposed model into a mixed-effects model framework for model fitting, prediction and inference. The proposed method can be applied in the difficult case in which the measurement times are irregular and sparse and may differ widely across individuals. Use of functional principal components enhances model interpretation and improves statistical and numerical stability of the parameter estimates.

Key Words: Functional data • Longitudinal data • Mixed-effects model • Penalized spline • Principal component • Reduced-rank model



References

    Dempster A. P., Laird N. M., Rubin D. B. Maximum likelihood from incomplete data via the em algorithm (with Discussion). J. R. Statist. Soc. (1977) B 39:1–38.

    Eilers P., Marx B. Flexible smoothing with B-splines and penalties (with Discussion). Statist. Sci. (1996) 89:89–121.

    Fahrmeir L., Tutz G. Multivariate Statistical Modelling Based on Generalized Linear Models (1994) New York: Springer.

    He G., Müller H.-G., Wang J.-L. Functional canonical analysis for square integrable stochastic processes. J. Mult. Anal. (2003) 85:54–77.[CrossRef]

    Hoover D. R., Rice J. A., Wu C. O., Yang L.-P. Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika (1998) 85:809–22.[Abstract/Free Full Text]

    Huang J. Z., Wu C. O., Zhou L. Varying coefficient models and basis function approximation for the analysis of repeated measurements. Biometrika (2002) 89:111–28.[Abstract/Free Full Text]

    James G. M., Hastie T. J., Sugar C. A. Principal component models for sparse functional data. Biometrika (2000) 87:587–602.[Abstract/Free Full Text]

    Laird N., Ware J. Random-effects models for longitudinal data. Biometrics (1982) 38:963–74.[CrossRef][Web of Science][Medline]

    Lederman M. M., Connick E., Landay A., Kuritzkes D. R., Spritzler J., Clair M. S., Kotzin B. L., Fox L., Chiozzi M. H., Leonard J. M., Rousseau F., Wade M., D'arc Roe J., Martinez A., Kessler H. Immunological responses associated with 12 weeks of combination antiretroviral therapy consisting of zidovudine, lamivudine & ritonavir: results of AIDS Clinical Trials Group Protocol 315. J. Inf. Dis. (1998) 178:70–9.[Web of Science][Medline]

    Leurgans S. E., Moyeed R. A., Silverman B. W. Canonical correlation analysis when the data are curves. J. R. Statist. Soc. (1993) B 55:725–40.

    Liang H., Wu H., Carroll R. J. The relationship between virologic and immunologic responses in AIDS clinical research using mixed-effects varying-coefficient models with measurement error. Biostatistics (2003) 4:297–312.[Abstract]

    Liang K.-Y., Zeger S. L. Longitudinal data analysis using generalized linear models. Biometrika (1986) 73:13–22.[Abstract/Free Full Text]

    Moyeed R. A., Diggle P. J. Rates of convergence in semi-parametric modelling of longitudinal data. Aust. J. Statist. (1994) 36:75–93.[CrossRef]

    Nelder J. A., Mead R. A simplex method for function minimization. Comp. J. (1965) 7:308–13.

    Ramsay J. O., Silverman B. W. Functional Data Analysis (2005) 2nd ed. New York: Springer.

    Rice J. A. Functional and longitudinal data analysis: perspectives on smoothing. Statist. Sinica (2004) 14:613–29.

    Rice J. A., Wu C. Nonparametric mixed effects models for unequally sampled noisy curves. Biometrics (2001) 57:253–59.[CrossRef][Web of Science][Medline]

    Ruppert D., Wand M. P., Carroll R. J. Semiparametric Regression (2003) Cambridge, UK: Cambridge University Press.

    Shi M., Weiss R. E., Taylor J. M. G. An analysis of paediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves. Appl. Statist. (1996) 45:151–63.[CrossRef]

    Wu C. O., Chiang C.-T., Hoover D. R. Asymptotic confidence regions for kernel smoothing of a varying-coefficient model with longitudinal data. J. Am. Statist. Assoc. (1998) 93:1388–402.[CrossRef][Web of Science]

    Wu H., Ding A. Population HIV-1 dynamics in vivo: application models and inference tools for virological data from AIDS clinical trials. Biometrics (1999) 55:410–8.[CrossRef][Web of Science][Medline]

    Yao F., Müller H.-G., Wang J.-L. Functional data analysis for sparse longitudinal data. J. Am. Statist. Assoc. (2005) 100:577–90.[CrossRef][Web of Science]

    Yao F., Müller H.-G., Wang J.-L. Functional linear regression analysis for longitudinal data. Ann. Statist. (2005) 33:2873–903.[CrossRef]

    Zeger S. L., Diggle P. J. Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics (1994) 50:689–99.[CrossRef][Web of Science][Medline]

    Zellner A. An efficient method of estimating seemingly unrelated regressions, and tests for aggregation bias. J. Am. Statist. Assoc. (1962) 57:348–68.[CrossRef][Web of Science]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Zhou, L.
Right arrow Articles by Carroll, R. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?