Skip Navigation


Biometrika Advance Access originally published online on June 5, 2009
Biometrika 2009 96(3):645-661; doi:10.1093/biomet/asp023
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Beerenwinkel, N.
Right arrow Articles by Sullivant, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2009 Biometrika Trust

Article

Markov models for accumulating mutations

N. Beerenwinkel

Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058 Basel, Switzerland niko.beerenwinkel{at}bsse.ethz.ch

S. Sullivant

Department of Mathematics, North Carolina State University, Raleigh, North Carolina 27607, U.S.A. smsulli2{at}ncsu.edu

Received for publication 1 January 2008. Revision received 1 November 2008.

We introduce and analyze a waiting time model for the accumulation of genetic changes. The continuous-time conjunctive Bayesian network is defined by a partially ordered set of mutations and by the rate of fixation of each mutation. The partial order encodes constraints on the order in which mutations can fixate in the population, shedding light on the mutational pathways underlying the evolutionary process. We study a censored version of the model and derive equations for an EM algorithm to perform maximum likelihood estimation of the model parameters. We also show how to select the maximum likelihood partially ordered set. The model is applied to genetic data from cancer cells and from drug resistant human immunodeficiency viruses, indicating implications for diagnosis and treatment.

Key Words: Bayesian network • Cancer • Genetic progression • HIV • Partially ordered set • Poset



References

    Beerenwinkel N., Drton M. A mutagenetic tree hidden Markov model for longitudinal clonal HIV sequence data. Biostatistics (2007) 8:53–71.[Abstract/Free Full Text]

    Beerenwinkel N., Däumer M., Sing T., Rahnenführer J., Lengauer T., Selbig J., Hoffmann D., Kaiser R. Estimating HIV evolutionary pathways and the genetic barrier to drug resistance. J. Inf. Dis. (2005a) 191:1953–60.[CrossRef][Web of Science][Medline]

    Beerenwinkel N., Eriksson N., Sturmfels B. Evolution on distributive lattices. J. Theor. Biol. (2006) 242:409–20.[CrossRef][Web of Science][Medline]

    Beerenwinkel N., Eriksson N., Sturmfels B. Conjunctive Bayesian networks. Bernoulli (2007) 13:893–909.[CrossRef][Web of Science]

    Beerenwinkel N., Rahnenführer J., Däumer M., Hoffmann D., Kaiser R., Selbig J., Lengauer T. Learning multiple evolutionary pathways from cross-sectional data. J. Comp. Biol. (2005b) 12:584–98. RECOMB 2004.[CrossRef]

    Beerenwinkel N., Rahnenführer J., Kaiser R., Hoffmann D., Selbig J., Lengauer T. Mtreemix: a software package for learning and using mixture models of mutagenetic trees. Bioinformatics (2005c) 21:2106–07.[Abstract/Free Full Text]

    Boucher C. A., O'Sullivan E., Mulder J. W., Ramautarsing C., Kellam P., Darby G., Lange J. M., Goudsmit J., Larder B. A. Ordered appearance of zidovudine resistance mutations during treatment of 18 human immunodeficiency virus-positive subjects. J. Inf. Dis. (1992) 165:105–10.[Web of Science][Medline]

    Brightwell G. R., Winkler P. Counting linear extensions. Order (1991) 8:225–42.[CrossRef]

    Deforche K., Silander T., Camacho R., Grossman Z., Soares M. A., Laethem K. V., Kantor R., Moreau Y., Vandamme A.-M., non B. Workgroup. Analysis of HIV-1 pol sequences using Bayesian networks: implications for drug resistance. Bioinformatics (2006) 22:2975–79.[Abstract/Free Full Text]

    Desper R., Jiang F., Kallioniemi O. P., Moch H., Papadimitriou C. H., Schäffer A. A. Inferring tree models for oncogenesis from comparative genome hybridization data. J. Comp. Biol. (1999) 6:37–51.[CrossRef]

    Fearon E. R., Vogelstein B. A genetic model for colorectal tumorigenesis. Cell (1990) 61:759–767.[CrossRef][Web of Science][Medline]

    Foulkes A., DeGruttola V. Characterizing the progression of viral mutations over time. J. Am. Statist. Assoc. (2003) 98:859–67.[CrossRef][Web of Science]

    Gatenby R. A., Maini P. K. Mathematical oncology: cancer summed up. Nature (2003) 421:321.[CrossRef][Medline]

    Hjelm M., Höglund M., Lagergren J. New probabilistic network models and algorithms for oncogenesis. J. Comp. Biol. (2006) 13:853–65.[CrossRef]

    Iwasa Y., Michor F., Nowak M. A. Evolutionary dynamics of escape from biomedical intervention. Proc. Biol. Sci. (2003) 270:2573–78.[Abstract/Free Full Text]

    Johnson V. A., Brun-Vezinet F., Clotet B., Gunthard H. F., Kuritzkes D. R., Pillay D., Schapiro J. M., Richman D. D. Update of the drug resistance mutations in hiv-1: Spring 2008. Top. HIV Med. (2008) 16:62–68.[Medline]

    Jones S., Chen W.-D., Parmigiani G., Diehl F., Beerenwinkel N., Antal T., Traulsen A., Nowak M. A., Siegel C., Velculescu V. E., Kinzler K. W., Vogelstein B., Willis J., Markowitz S. D. Comparative lesion sequencing provides insights into tumor evolution. Proc. Nat. Acad. Sci. (2008) 105:4283–88.[Abstract/Free Full Text]

    Norris J. Markov Chains (1997) Cambridge, UK: Cambridge University Press.

    Radmacher M. D., Simon R., Desper R., Taetle R., Schäffer A. A., Nelson M. A. Graph models of oncogenesis with an application to melanoma. J. Theor. Biol. (2001) 212:535–48.[CrossRef][Web of Science][Medline]

    Rahnenführer J., Beerenwinkel N., Schulz W. A., Hartmann C., von Deimling A., Wullich B., Lengauer T. Estimating cancer survival and clinical outcome based on genetic tumor progression scores. Bioinformatics (2005) 21:2438–46.[Abstract/Free Full Text]

    Simon R., Desper R., Papadimitriou C. H., Peng A., Alberts D. S., Taetle R., Trent J. M., Schäffer A. A. Chromosome abnormalities in ovarian adenocarcinoma: Iii. using breakpoint data to infer and test mathematical models for oncogenesis. Genes Chromosomes Cancer (2000) 28:106–20.[CrossRef][Web of Science][Medline]

    Stanley R. Enumerative Combinatorics (1999) Cambridge, UK: Cambridge University Press.

    von Heydebreck A., Gunawan B., Füzesi L. Maximum likelihood estimation of oncogenetic tree models. Biostatistics (2004) 5:545–56.[Abstract]

    Weinreich D. M., Delaney N. F., Depristo M. A., Hartl D. L. Darwinian evolution can follow only very few mutational paths to fitter proteins. Science (2006) 312:111–14.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
M. Gerstung, M. Baudis, H. Moch, and N. Beerenwinkel
Quantifying cancer progression with conjunctive Bayesian networks
Bioinformatics, November 1, 2009; 25(21): 2809 - 2815.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Beerenwinkel, N.
Right arrow Articles by Sullivant, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?