Skip Navigation


Biometrika Advance Access originally published online on November 19, 2007
Biometrika 2007 94(4):841-860; doi:10.1093/biomet/asm070
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Vansteelandt, S.
Right arrow Articles by Robins, J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 Biometrika Trust

Articles

Estimation of Regression Models for the Mean of Repeated Outcomes Under Nonignorable Nonmonotone Nonresponse

Stijn Vansteelandt

Department of Applied Mathematics and Computer Sciences Ghent University, 9000 Ghent, Belgium stijn.vansteelandt{at}ugent.be

Andrea Rotnitzky

Department of Economics, Di Tella University, Buenos Aires, Argentina arotnitzky{at}utdt.edu

James Robins

Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts 02115, U.S.A. robins{at}hsph.harvard.edu

Received for publication 1 April 2005. Revision received 1 April 2007.

We propose a new class of models for making inference about the mean of a vector of repeated outcomes when the outcome vector is incompletely observed in some study units and missingness is nonmonotone. Each model in our class is indexed by a set of unidentified selection-bias functions which quantify the residual association of the outcome at each occasion t and the probability that this outcome is missing after adjusting for variables observed prior to time t and for the past nonresponse pattern. In particular, selection-bias functions equal to zero encode the investigator's a priori belief that nonresponse of the next outcome does not depend on that outcome after adjusting for the observed past. We call this assumption sequential explainability. Since each model in our class is nonparametric, it fits the data perfectly well. As such, our models are ideal for conducting sensitivity analyses aimed at evaluating the impact that different degrees of departure from sequential explainability have on inference about the marginal means of interest. Although the marginal means are identified under each of our models, their estimation is not feasible in practice because it requires the auxiliary estimation of conditional expectations and probabilities given high-dimensional variables. We henceforth discuss the estimation of the marginal means under each model in our class assuming, additionally, that at each occasion either one of the following two models holds: a parametric model for the conditional probability of nonresponse given current outcomes and past recorded data or a parametric model for the conditional mean of the outcome on the nonrespondents given the past recorded data. We call the resulting procedure 2T-multiply robust as it protects at each of the T time points against misspecification of one of these two working models, although not against simultaneous misspecification of both. We extend our proposed class of models and estimators to incorporate data configurations which include baseline covariates and a parametric model for the conditional mean of the vector of repeated outcomes given the baseline covariates.

Key Words: Double robustness • Generalized estimating equation • Intermittent missingness • Longitudinal study • Missing at random • Semiparametric inference



References

    Albert P. S. A transitional model for longitudinal binary data subject to nonignorable missing data. Biometrics (2000) 56:602–8.[CrossRef][Web of Science][Medline]

    Andersson S. A., Perlman M. D. Lattice-ordered conditional-independence models for missing data. Statist. Prob. Lett. (1991) 12:465–86.[CrossRef]

    Deltour I., Richardson S., Le Hesran J.-Y. Stochastic algorithms for Markov models estimation with intermittent missing data. Biometrics (1999) 55:565–73.[CrossRef][Web of Science][Medline]

    Fairclough D. L., Peterson H. F., Cella D., Bonomi P. Comparison of several model-based methods for analysing incomplete quality of life data in cancer clinical trials. Statist. Med. (1998) 17:781–96.[CrossRef]

    Gill R. D., Robins J. M. Sequential models for coarsening and missingness. In: Proc. First Seattle Symp. Biostatist: Survival Anal.—Lin D.Y., Fleming T.R., eds. (1997) New York: Springer. 295–305.

    Gill R. D., van der Laan M. J., Robins J. M. Coarsening at random: characterizations, conjectures and counterexamples. In: Proc. First Seattle Symp. Biostatist: Survival Anal.—Lin D.Y., Fleming T.R., eds. (1997) New York: Springer. 255–94.

    Ibrahim J. G., Chen M.-H., Lipsitz S. R. Missing responses in generalized linear mixed models when the missing data mechanism is nonignorable. Biometrika (2001) 88:551–64.[Abstract/Free Full Text]

    Laird N., Ware J. Random effects models for longitudinal data. Biometrics (1982) 38:963–74.[CrossRef][Web of Science][Medline]

    Lin H., Scharfstein D. O., Rosenheck R. A. Analysis of longitudinal data with irregular, informative follow-up. J. R. Statist. Soc. B (2003) 66:791–813.[CrossRef]

    Little R. J. A., Rubin D. B. Statistical Analysis with Missing Data (1987) New York: Wiley.

    Robins J. M. Non-response models for the analysis of non-monotone non-ignorable missing data. Statist. Med. (1997) 16:21–37.[CrossRef]

    Robins J. M. Robust estimation in sequentially ignorable missing data and causal inference models. Proc. Am. Statist. Assoc. Sec. Bayesian Sci. (2000) 1999. Alexandria, VA: American Statistcal Association. 6–10.

    Robins J. M., Gill R.D. Non-response models for the analysis of non-monotone ignorable missing data. Statist. Med. (1997) 16:39–56.[CrossRef]

    Robins J. M., Rotnitzky A. Recovery of information and adjustment for dependent censoring using surrogate markers. In: AIDS Epidemiology–Methodological Issues—Jewell N., Dietz K., Farewell V., eds. (1992) Boston, MA: Birkhäuser. 297–331.

    Robins J. M., Rotnitzky A. Comment on a paper by P. Bickel and J. Kwon. Statist. Sinica (2001) 11:920–36.

    Robins J. M., Rotnitzky A., Scharfstein D. Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In: Statistical Models in Epidemiology: The Environment and Clinical Trials—Halloran M.E., Berry D., eds. (1999) Volume 116. New York: Springer-Verlag. 1–92.

    Robins J. M., Rotnitzky A., Zhao L. P. Estimation of regression coefficients when some regressors are not always observed. J. Am. Statist. Assoc. (1994) 89:846–66.[CrossRef][Web of Science]

    Robins J. M., Rotnitzky A., Zhao L-P. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. J. Am. Statist. Assoc. (1995) 90:106–21.[CrossRef][Web of Science]

    Rotnitzky A., Robins J. M., Scharfstein D. O. Semiparametric regression for repeated outcomes with nonignorable nonresponse. J. Am. Statist. Assoc. (1998) 93:1321–39.[CrossRef][Web of Science]

    Scharfstein D. O., Rotnitzky A., Robins J. M. Adjusting for nonignorable drop-out using semiparametric nonresponse models. J. Am. Statist. Assoc. (1999) 94:1096–146.[CrossRef][Web of Science]

    Shah A., Laird N., Schoenfeld D. A random-effects model for multiple characteristics with possibly missing data. J. Am. Statist. Assoc. (1997) 92:775–9.[CrossRef][Web of Science]

    Troxel A. B., Fairclough D. L., Curran D., Hahn E. A. Statistical analysis of quality of life with missing data in cancer clinical trials. Statist. Med. (1998) 17:653–66.[CrossRef]

    Troxel A. B., Lipsitz S. R., Harrington D. P. Marginal models for the analysis of longitudinal measurements with nonignorable non-monotone missing data. Biometrika (1998) 85:661–72.[Abstract/Free Full Text]

    van der Laan M. J., Robins J. M. Unified Methods for Censored Longitudinal Data and Causality (2003) New York: Springer-Verlag.

    Zeuzem S., Feinman S. V., Rasenack J., Heathcote E. J., Lai M. Y., Gane E., O'Grady J., Reichen J., Diago M., Lin A., Hoffman J., Brunda M. J. Peginterferon alfa-2a in patients with chronic hepatitis C. New Engl. J. Med. (2000) 343:1666–72.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Vansteelandt, S.
Right arrow Articles by Robins, J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?