Biometrika Advance Access originally published online on June 5, 2009
Biometrika 2009 96(3):645-661; doi:10.1093/biomet/asp023
Article |
Markov models for accumulating mutations
Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058 Basel, Switzerland niko.beerenwinkel{at}bsse.ethz.ch
Department of Mathematics, North Carolina State University, Raleigh, North Carolina 27607, U.S.A. smsulli2{at}ncsu.edu
Received for publication 1 January 2008. Revision received 1 November 2008.
We introduce and analyze a waiting time model for the accumulation of genetic changes. The continuous-time conjunctive Bayesian network is defined by a partially ordered set of mutations and by the rate of fixation of each mutation. The partial order encodes constraints on the order in which mutations can fixate in the population, shedding light on the mutational pathways underlying the evolutionary process. We study a censored version of the model and derive equations for an EM algorithm to perform maximum likelihood estimation of the model parameters. We also show how to select the maximum likelihood partially ordered set. The model is applied to genetic data from cancer cells and from drug resistant human immunodeficiency viruses, indicating implications for diagnosis and treatment.
Key Words: Bayesian network Cancer Genetic progression HIV Partially ordered set Poset
References
-
Beerenwinkel N., Drton M. A mutagenetic tree hidden Markov model for longitudinal clonal HIV sequence data. Biostatistics (2007) 8:53–71.
Beerenwinkel N., Däumer M., Sing T., Rahnenführer J., Lengauer T., Selbig J., Hoffmann D., Kaiser R. Estimating HIV evolutionary pathways and the genetic barrier to drug resistance. J. Inf. Dis. (2005a) 191:1953–60.[CrossRef][Web of Science][Medline]
Beerenwinkel N., Eriksson N., Sturmfels B. Evolution on distributive lattices. J. Theor. Biol. (2006) 242:409–20.[CrossRef][Web of Science][Medline]
Beerenwinkel N., Eriksson N., Sturmfels B. Conjunctive Bayesian networks. Bernoulli (2007) 13:893–909.[CrossRef][Web of Science]
Beerenwinkel N., Rahnenführer J., Däumer M., Hoffmann D., Kaiser R., Selbig J., Lengauer T. Learning multiple evolutionary pathways from cross-sectional data. J. Comp. Biol. (2005b) 12:584–98. RECOMB 2004.[CrossRef]
Beerenwinkel N., Rahnenführer J., Kaiser R., Hoffmann D., Selbig J., Lengauer T. Mtreemix: a software package for learning and using mixture models of mutagenetic trees. Bioinformatics (2005c) 21:2106–07.
Boucher C. A., O'Sullivan E., Mulder J. W., Ramautarsing C., Kellam P., Darby G., Lange J. M., Goudsmit J., Larder B. A. Ordered appearance of zidovudine resistance mutations during treatment of 18 human immunodeficiency virus-positive subjects. J. Inf. Dis. (1992) 165:105–10.[Web of Science][Medline]
Brightwell G. R., Winkler P. Counting linear extensions. Order (1991) 8:225–42.[CrossRef]
Deforche K., Silander T., Camacho R., Grossman Z., Soares M. A., Laethem K. V., Kantor R., Moreau Y., Vandamme A.-M., non B. Workgroup. Analysis of HIV-1 pol sequences using Bayesian networks: implications for drug resistance. Bioinformatics (2006) 22:2975–79.
Desper R., Jiang F., Kallioniemi O. P., Moch H., Papadimitriou C. H., Schäffer A. A. Inferring tree models for oncogenesis from comparative genome hybridization data. J. Comp. Biol. (1999) 6:37–51.[CrossRef]
Fearon E. R., Vogelstein B. A genetic model for colorectal tumorigenesis. Cell (1990) 61:759–767.[CrossRef][Web of Science][Medline]
Foulkes A., DeGruttola V. Characterizing the progression of viral mutations over time. J. Am. Statist. Assoc. (2003) 98:859–67.[CrossRef][Web of Science]
Gatenby R. A., Maini P. K. Mathematical oncology: cancer summed up. Nature (2003) 421:321.[CrossRef][Medline]
Hjelm M., Höglund M., Lagergren J. New probabilistic network models and algorithms for oncogenesis. J. Comp. Biol. (2006) 13:853–65.[CrossRef]
Iwasa Y., Michor F., Nowak M. A. Evolutionary dynamics of escape from biomedical intervention. Proc. Biol. Sci. (2003) 270:2573–78.
Johnson V. A., Brun-Vezinet F., Clotet B., Gunthard H. F., Kuritzkes D. R., Pillay D., Schapiro J. M., Richman D. D. Update of the drug resistance mutations in hiv-1: Spring 2008. Top. HIV Med. (2008) 16:62–68.[Medline]
Jones S., Chen W.-D., Parmigiani G., Diehl F., Beerenwinkel N., Antal T., Traulsen A., Nowak M. A., Siegel C., Velculescu V. E., Kinzler K. W., Vogelstein B., Willis J., Markowitz S. D. Comparative lesion sequencing provides insights into tumor evolution. Proc. Nat. Acad. Sci. (2008) 105:4283–88.
Norris J. Markov Chains (1997) Cambridge, UK: Cambridge University Press.
Radmacher M. D., Simon R., Desper R., Taetle R., Schäffer A. A., Nelson M. A. Graph models of oncogenesis with an application to melanoma. J. Theor. Biol. (2001) 212:535–48.[CrossRef][Web of Science][Medline]
Rahnenführer J., Beerenwinkel N., Schulz W. A., Hartmann C., von Deimling A., Wullich B., Lengauer T. Estimating cancer survival and clinical outcome based on genetic tumor progression scores. Bioinformatics (2005) 21:2438–46.
Simon R., Desper R., Papadimitriou C. H., Peng A., Alberts D. S., Taetle R., Trent J. M., Schäffer A. A. Chromosome abnormalities in ovarian adenocarcinoma: Iii. using breakpoint data to infer and test mathematical models for oncogenesis. Genes Chromosomes Cancer (2000) 28:106–20.[CrossRef][Web of Science][Medline]
Stanley R. Enumerative Combinatorics (1999) Cambridge, UK: Cambridge University Press.
von Heydebreck A., Gunawan B., Füzesi L. Maximum likelihood estimation of oncogenetic tree models. Biostatistics (2004) 5:545–56.[Abstract]
Weinreich D. M., Delaney N. F., Depristo M. A., Hartl D. L. Darwinian evolution can follow only very few mutational paths to fitter proteins. Science (2006) 312:111–14.
This article has been cited by other articles:
![]() |
M. Gerstung, M. Baudis, H. Moch, and N. Beerenwinkel Quantifying cancer progression with conjunctive Bayesian networks Bioinformatics, November 1, 2009; 25(21): 2809 - 2815. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||
