The Library
Bayesian nonparametric hidden Markov models with application to the analysis of copy-number-variation in mammalian genomes
Tools
Yau, C., Papaspiliopoulos, Omiros, Roberts, Gareth O. and Holmes, Christopher (2009) Bayesian nonparametric hidden Markov models with application to the analysis of copy-number-variation in mammalian genomes. Working Paper. Coventry: University of Warwick. Centre for Research in Statistical Methodology. (Working papers, Vol.2009).
|
PDF
WRAP_Yau_09-12w.pdf - Published Version - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader Download (3055Kb) |
Official URL: http://www2.warwick.ac.uk/fac/sci/statistics/crism...
Abstract
We consider the development of Bayesian Nonparametric methods for product partition models such as Hidden Markov Models and change point models. Our approach uses a Mixture of Dirichlet Process (MDP) model for the unknown sampling distribution (likelihood) for the observations arising in each state and a computationally e±cient data augmentation scheme to aid inference. The method uses novel MCMC methodology which combines recent retrospective sampling methods with the use of slice sampler variables. The methodology is computationally efficient, both in terms of MCMC mixing properties, and robustness to the length of the time series being investigated. Moreover, the method is easy to implement requiring little or no user-interaction. We apply our methodology to the analysis of genomic copy number variation.
| Item Type: | Working or Discussion Paper (Working Paper) |
|---|---|
| Subjects: | Q Science > QA Mathematics |
| Divisions: | Faculty of Science > Statistics |
| Library of Congress Subject Headings (LCSH): | Sampling (Statistics), Regression analysis, Nonparametric statistics |
| Series Name: | Working papers |
| Publisher: | University of Warwick. Centre for Research in Statistical Methodology |
| Place of Publication: | Coventry |
| Date: | 2009 |
| Volume: | Vol.2009 |
| Number: | No.12 |
| Number of Pages: | 26 |
| Status: | Not Peer Reviewed |
| Access rights to Published version: | Open Access |
| References: | Andersson, R., Bruder, C. E. G., Piotrowski, A., Menzel, U., Nord, H., Sandgren, J., Hvidsten, T. R., de Sthl, T. D., Dumanski, J. P. & Komorowski, J. (2008). A segmental maximum a posteriori approach to genome-wide copy number pro¯ling. Bioinformatics 24 751{ 758. URL http://dx.doi.org/10.1093/bioinformatics/btn003. Antoniak, C. E. (1974). Mixtures of Dirichlet processes with applications to bayesian nonpara- metric problems. Ann. Statist. 2, 1152{74. Barry, D. & Hartigan, J. A. (1992). Product partition models for change point models. Annals of Statistics 20 260{279. Baum, L. E. (1966). Statistical inference for probabilistic functions of ¯nite state space markov chains. Annals of Mathematical Statistics 37 1554{1563. B.Dunson, D. (2005). Bayesian semiparametric isotonic regression for count data. Journal of the American Statistical Association 100 618{627. Burr, D. & Doss, H. (2005). A Bayesian semiparametric model for random-e®ects meta-analysis. J. Am. Statist. Assoc. 100, 242{51. Cappe, ., Moulines, E. & Ryden, T. (2005). Inference in Hidden Markov Models. Springer. Colella, S., Yau, C., Taylor, J. M., Mirza, G., Butler, H., Clouston, P., Bassett, A. S., Seller, A., Holmes, C. C. & Ragoussis, J. (2007). Quantisnp: an objective bayes hidden-markov model to detect and accurately map copy number variation using snp genotyping data. Nucleic Acids Res 35 2013{2025. URL http://dx.doi.org/10.1093/nar/gkm076. Devroye, L. (1986). Non-Uniform Random Variate Generation. Springer-Verlag. Dunson, D. B., Pillai, N. & Park, J. H. (2007). Bayesian density regression. Journal of the Royal Statistical Society, Series B 69 163{183. Escobar, M. (1988). Estimating the means of several normal populations by nonparametric estimation of the distribution of the means. PhD Dissertation, Department of Statistics, Yale University. Escobar, M. D. & West, M. (1995). Bayesian density estimation and inference using mixtures. J. Am. Statist. Assoc 90 577{88. Gelfand, A. & Kottas, A. (2003). Bayesian semiparametric regression for median residual life. Scand. J. Statist. 30, 651{65. Green, P. & Richardson, S. (2001). Modelling heterogeneity with and without the Dirichlet process. Scand. J. Statist. 28 355{75. Griffin, J. & Steel, M. J. F. (2004). Semiparametric bayesian inference for stochastic frontier models. J. Econometrics 123 121{152. Griffin, J. & Steel, M. J. F. (2007). Bayesian non-parametric modelling with the Dirichlet process regression smoother. CRiSM technical report 07-05. Hu, J., Gao, J.-B., Cao, Y., Bottinger, E. & Zhang, W. (2007). Exploiting noise in array cgh data to improve detection of dna copy number change. Nucleic Acids Res 35 e35. URL http://dx.doi.org/10.1093/nar/gkl730. Ishwaran, H. & James, L. (2001). Gibbs sampling methods for stick-breaking priors. J. Am. Statist. Assoc. 96, 161{73. Ishwaran, H. & James, L. F. (2003). Some further developments for stick-breaking priors: ¯nite and in¯nite clustering and classi¯cation. Sankhy¹a, A, 65, 577{92. Ishwaran, H. & Zarepour, M. (2000). Markov chain Monte Carlo in approximate Dirichlet and beta two-parameter process hierarchical models. Biometrika 87, 371{90. Jain, S. & Neal, R. M. (2004). A split-merge Markov chain Monte Carlo procedure for the Dirichlet process mixture model. J. Comp. Graph. Statist. 13, 158{82. Lakshmi, B., Hall, I. M., Egan, C., Alexander, J., Leotta, A., Healy, J., Zender, L., Spector, M. S., Xue, W., Lowe, S. W., Wigler, M. & Lucito, R. (2006). Mouse genomic representational oligonucleotide microarray analysis: detection of copy number vari- ations in normal and tumor specimens. Proc Natl Acad Sci U S A 103 11234{11239. URL http://dx.doi.org/10.1073/pnas.0602984103. Liu, J. S. (1996). Nonparametric hierarchical Bayes via sequential imputations. Ann. Statist. 24, 911{30. MacEachern, S. & MÄuller, P. (1998). Estimating mixture of Dirichlet process models. J. Comp. Graph. Statist. 7, 223{38. Marioni, J. C., Thorne, N. P. & Tavar, S. (2006). Biohmm: a heterogeneous hid- den markov model for segmenting array cgh data. Bioinformatics 22 1144{1146. URL http://dx.doi.org/10.1093/bioinformatics/btl089. Muliere, P. & Tardella, L. (1998). Approximating distributions of random functionals of Ferguson-Dirichlet priors. Can. J. Statist. 26, 283{97. MÄuller, P., Erkanli, A. & West, M. (1996). Bayesian curve ¯tting using multivariate normal mixtures. Biometrika 83, 67{79. MÄuller, P., Rosner, G. L., De Iorio, M. & MacEachern, S. (2005). A nonparametric Bayesian model for inference in related longitudinal studies. Appl. Statist. 54, 611{26. Neal, R. (2000). Markov chain sampling: Methods for Dirichlet process mixture models. J. Comp. Graph. Statist. 9, 283{97. Papaspiliopoulos, O. & Roberts, G. O. (2008). Retrospective markov chain monte carlo for dirichlet process hierarchical models. Biometrika 95 169{186. Quintana, F. & Iglesias, P. (2003). Bayesian clustering and product partition models. J. Roy. Statist. Soc. B 65, 557{574. Rodriguez, A., B.Dunson, D. & Gelfand, A. E. (2008). The nested dirichlet process. Journal of the American Statistical Association 103 1131{1144. Scott, S. (2002). Bayesian Methods for Hidden Markov Models: Recursive Computing in the 21st Century. Journal of the American Statistical Association 97 337{351. Shah, S. P., Xuan, X., DeLeeuw, R. J., Khojasteh, M., Lam, W. L., Ng, R. & Murphy, K. P. (2006). Integrating copy number polymorphisms into ar- ray cgh analysis using a robust hmm. Bioinformatics 22 e431{e439. URL http://dx.doi.org/10.1093/bioinformatics/btl238. Stjernqvist, S., Rydn, T., Skld, M. & Staaf, J. (2007). Continuous-index hidden markov modelling of array cgh copy number data. Bioinformatics 23 1006{1014. URL http://dx.doi.org/10.1093/bioinformatics/btm059. Teh, Y., Jordan, M., Beal, M. & Blei, D. (2006). Hierarchical dirichlet processes. to appear in J. Amer. Statist. Assoc., available from http://www.cs.princeton.edu/ blei/papers/TehJordanBealBlei2006.pdf. Walker, S. (2007). Sampling the dirichlet mixture model with slices. Comm. Statist. Sim. Comput. 36 45{54. |
| URI: | http://wrap.warwick.ac.uk/id/eprint/35201 |
Actions (login required)
![]() |
View Item |
Tools
Tools

