Skip to content Skip to navigation
University of Warwick
  • Study
  • |
  • Research
  • |
  • Business
  • |
  • Alumni
  • |
  • News
  • |
  • About

University of Warwick
Publications service & WRAP

Highlight your research

  • WRAP
    • Home
    • Search WRAP
    • Browse by Warwick Author
    • Browse WRAP by Year
    • Browse WRAP by Subject
    • Browse WRAP by Department
    • Browse WRAP by Funder
    • Browse Theses by Department
  • Publications Service
    • Home
    • Search Publications Service
    • Browse by Warwick Author
    • Browse Publications service by Year
    • Browse Publications service by Subject
    • Browse Publications service by Department
    • Browse Publications service by Funder
  • Statistics
  • Help & Advice
University of Warwick

The Library

  • Login

Transcriptional programs : modelling higher order structure in transcriptional control

Tools
- Tools
+ Tools

Reid, J. E. (John E.), Ott, Sascha and Wernisch, Lorenz. (2009) Transcriptional programs : modelling higher order structure in transcriptional control. BMC Bioinformatics, Vol.10 . Article no. 218. ISSN 1471-2105

[img]
Preview
PDF
WRAP_Ott_1471-2105-10-218.pdf - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Download (526Kb)
Official URL: http://dx.doi.org/10.1186/1471-2105-10-218

Abstract

Background: Transcriptional regulation is an important part of regulatory control in eukaryotes. Even if binding motifs for transcription factors are known, the task of finding binding sites by scanning sequences is plagued by false positives. One way to improve the detection of binding sites from motifs is by taking cooperativity of transcription factor binding into account. We propose a non-parametric probabilistic model, similar to a document topic model, for detecting transcriptional programs, groups of cooperative transcription factors and co-regulated genes. The analysis results in transcriptional programs which generalise both transcriptional modules and TF-target gene incidence matrices and provide a higher-level summary of these structures. The method is independent of prior specification of training sets of genes, for example, via gene expression data. The analysis is based on known binding motifs. Results: We applied our method to putative regulatory regions of 18,445 Mus musculus genes. We discovered just 68 transcriptional programs that effectively summarised the action of 149 transcription factors on these genes. Several of these programs were significantly enriched for known biological processes and signalling pathways. One transcriptional program has a significant overlap with a reference set of cell cycle specific transcription factors. Conclusion: Our method is able to pick out higher order structure from noisy sequence analyses. The transcriptional programs it identifies potentially represent common mechanisms of regulatory control across the genome. It simultaneously predicts which genes are co-regulated and which sets of transcription factors cooperate to achieve this co-regulation. The programs we discovered enable biologists to choose new genes and transcription factors to study in specific transcriptional regulatory systems.

Item Type: Journal Article
Subjects: Q Science > QH Natural history > QH426 Genetics
Divisions: Faculty of Science > Centre for Systems Biology
Library of Congress Subject Headings (LCSH): Genetic transcription -- Research, Eukaryotic cells, Binding sites (Biochemistry), Genetics -- Mathematical models
Journal or Publication Title: BMC Bioinformatics
Publisher: BioMed Central Ltd.
ISSN: 1471-2105
Date: 16 July 2009
Volume: Vol.10
Page Range: Article no. 218
Identification Number: 10.1186/1471-2105-10-218
Status: Peer Reviewed
Access rights to Published version: Open Access
References: 1. Ingham PW: The molecular genetics of embryonic pattern formation in Drosophila. Nature 1988, 335(6185):25-34. 2. Arnosti DN, Kulkarni MM: Transcriptional enhancers: Intelligent enhanceosomes or exible billboards? J Cell Biochem 2005, 94(5):890-898. 3. Simmons DM, Voss JW, Ingraham HA, Holloway JM, Broide RS, Rosenfeld MG, Swanson LW: Pituitary cell phenotypes involve cell-specific Pit-1 mRNA translation and synergistic interactions with other classes of transcription factors. Genes Dev 1990, 4(5):695-711. 4. Matys V, Fricke E, Geffers R, Gössling E, Haubrock M, Hehl R, Hornischer K, Karas D, Kel AE, Kel-Margoulis OV, Kloos DU, Land S, Lewicki-Potapov B, Michael H, Munch R, Reuter I, Rotert S, Saxel H, Scheer M, Thiele S, Wingender E: TRANSFAC: transcriptional regulation, from patterns to profiles. Nucleic Acids Res 2003, 31:374-378. 5. Sandelin A, Alkema W, Engström P, Wasserman WW, Lenhard B: JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res 2004:D91-D94. 6. Teh YW, Jordan MI, Beal MJ, Blei DM: Hierarchical Dirichlet Processes. Journal of the American Statistical Association 2006, 101(476):1566-1581. 7. Sharan R, Ben-Hur A, Loots GG, Ovcharenko I: CREME: Cis-Regulatory Module Explorer for the human genome. Nucleic Acids Res 2004:W253-W256. 8. Ho Sui SJ, Fulton DL, Arenillas DJ, Kwon AT, Wasserman WW: oPOSSUM: integrated tools for analysis of regulatory motif over-representation. Nucleic Acids Res 2007:W245-W252. 9. Kreiman G: Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes. Nucleic Acids Res 2004, 32:2889-2900. 10. Singh LN, Wang LS, Hannenhalli S: TREMOR – a tool for retrieving transcriptional modules by incorporating motif covariance. Nucleic Acids Res 2007, 35(21):7360-7371. 11. Lemmens K, Dhollander T, Bie TD, Monsieurs P, Engelen K, Smets B, Winderickx J, Moor BD, Marchal K: Inferring transcriptional modules from ChIP-chip, motif and microarray data. Genome Biol 2006, 7(5):R37. 12. Chen G, Jensen ST, Stoeckert CJ: Clustering of genes into regulons using integrated modeling-COGRIM. Genome Biol 2007, 8:R4. 13. Jensen ST, Chen G, Stoeckert CJ: Bayesian variable selection and data integration for biological regulatory networks. ANNALS OF APPLIED STATISTICS 2007, 1:612. 14. Tanay A, Sharan R, Kupiec M, Shamir R: Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data. Proc Natl Acad Sci USA 2004, 101(9):2981-2986. 15. Blüthgen N, Kie lbasa SM, Herzel H: Inferring combinatorial regulation of transcription in silico. Nucleic Acids Res 2005, 33:272-279. 16. Frith MC, Li MC, Weng Z: Cluster-Buster: Finding dense clusters of motifs in DNA sequences. Nucleic Acids Res 2003, 31(13):3666-3668. 17. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel- Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25:25-29. 18. Wu WS, Li WH, Chen BS: Computational reconstruction of transcriptional regulatory modules of the yeast cell cycle. BMC Bioinformatics 2006, 7:421. 19. Segal E, Yelensky R, Koller D: Genome-wide discovery of transcriptional modules from DNA sequence and gene expression. Bioinformatics 2003, 19(Suppl 1):i273-i282. 20. Xu X, Wang L, Ding D: Learning module networks from genome-wide location and expression data. FEBS Lett 2004, 578(3):297-304. 21. Segal E, Shapira M, Regev A, Pe'er D, Botstein D, Koller D, Friedman N: Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet 2003, 34(2):166-176. 22. Gerber GK, Dowell RD, Jaakkola TS, Gifford DK: Automated discovery of functional generality of human gene expression programs. PLoS Comput Biol 2007, 3(8):e148. 23. Liu X, Jessen WJ, Sivaganesan S, Aronow BJ, Medvedovic M: Bayesian hierarchical model for transcriptional module discovery by jointly modeling gene expression and ChIP-chip data. BMC Bioinformatics 2007, 8:283. 24. Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res 2006:D354-D357. 25. Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, Wiltshire T, Orth AP, Vega RG, Sapinoso LM, Moqrich A, Patapoutian A, Hampton GM, Schultz PG, Hogenesch JB: Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci USA 2002, 99(7):4465-4470. 26. Alexa A, Rahnenführer J, Lengauer T: Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 2006, 22(13):1600-1607. 27. Elkon R, Linhart C, Sharan R, Shamir R, Shiloh Y: Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. Genome Res 2003, 13(5):773-780. 28. Lichtlen P, Wang Y, Belser T, Georgiev O, Certa U, Sack R, Schaffner W: Target gene search for the metal-responsive transcription factor MTF-1. Nucleic Acids Res 2001, 29(7):1514-1523. 29. Joshi B, Ordonez-Ercan D, Dasgupta P, Chellappan S: Induction of human metallothionein 1G promoter by VEGF and heavy metals: differential involvement of E2F and metal transcription factors. Oncogene 2005, 24(13):2204-2217. 30. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D: The human genome browser at UCSC. Genome Res 2002, 12(6):996-1006. 31. Hubbard TJ, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Fitzgerald S, Fernandez-Banet J, Graf S, Haider S, Hammond M, Herrero J, Holland R, Howe K, Howe K, Johnson N, Kahari A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Melsopp C, Megy K, Meidl P, Ouverdin B, Parker A, Prlic A, Rice S, Rios D, Schuster M, Sealy I, Severin J, Slater G, Smedley D, Spudich G, Trevanion S, Vilella A, Vogel J, White S, Wood M, Cox T, Curwen V, Durbin R, Fernandez-Suarez XM, Flicek P, Kasprzyk A, Proctor G, Searle S, Smith J, Ureta-Vidal A, Birney E: Ensembl 2007. Nucleic Acids Res 2007:D610-D617. 32. Teh YW, Kurihara K, Welling M: Collapsed Variational Inference for HDP. Advances in Neural Information Processing Systems 2008, 20:. 33. Sethuraman J: A constructive definition of Dirichlet priors. Statistica Sinica 1994, 4:639-650.
URI: http://wrap.warwick.ac.uk/id/eprint/2173

Request changes to a record

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...
twitter

Email us: publications@warwick.ac.uk
Contact Details
About Us