The Library
Inferring causal relations from multivariate time series : a fast method for large-scale gene expression data
Tools
Yuan, Yinyin and Li, Chang-Tsun (2009) Inferring causal relations from multivariate time series : a fast method for large-scale gene expression data. In: BioInformatics and BioEngineering (BIBE), 2009 Ninth IEEE International Conference on, Tʻai-chung shih (Taiwan), 2009 pp. 92-99.
|
PDF
WRAP_Yuan_Inferring_Causal.pdf - Draft Version - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader Download (340Kb) |
Abstract
Various multivariate time series analysis techniques have been developed with the aim of inferring causal relations between time series. Previously, these techniques have proved their effectiveness on economic and neurophysiological data, which normally consist of hundreds of samples. However, in their applications to gene regulatory inference, the small sample size of gene expression time series poses an obstacle. In this paper, we describe some of the most commonly used multivariate inference techniques and show the potential challenge related to gene expression analysis. In response, we propose a directed partial correlation (DPC) algorithm as an efficient and effective solution to causal/regulatory relations inference on small sample gene expression data. Comparative evaluations on the existing techniques and the proposed method are presented. To draw reliable conclusions, a comprehensive benchmarking on data sets of various setups is essential. Three experiments are designed to assess these methods in a coherent manner. Detailed analysis of experimental results not only reveals good accuracy of the proposed DPC method in large-scale prediction, but also gives much insight into all methods under evaluation.
| Item Type: | Conference Item (Paper) |
|---|---|
| Subjects: | Q Science > QA Mathematics Q Science > QH Natural history > QH426 Genetics |
| Divisions: | Faculty of Science > Computer Science |
| Library of Congress Subject Headings (LCSH): | Time-series analysis, Gene expression -- Mathematical models |
| Date: | 2009 |
| Page Range: | pp. 92-99 |
| Identification Number: | 10.1109/BIBE.2009.8 |
| Status: | Peer Reviewed |
| Access rights to Published version: | Open Access |
| Conference Paper Type: | Paper |
| Title of Event: | BioInformatics and BioEngineering (BIBE), 2009 Ninth IEEE International Conference on |
| Type of Event: | Conference |
| Location of Event: | Tʻai-chung shih (Taiwan) |
| Date(s) of Event: | 2009 |
| References: | [1] R. Clarke, H.W.Ressom, A.Wang, J.Xuan,M. C. Liu, E. A. Gehan, and Y. Wang, “The properties of high-dimensional data spaces: implications for exploring gene and protein expression data,” Nature Revolution Cancer, vol. 8, no. 1, pp. 37–49, 2008. [2] A. Bernard and A. J. Hartemink, “Informative structure priors: joint learning of dynamic regulatory networks from multiple types of data.” Proceedings of the Pacific Symposium on Biocomputing, pp. 459–70, 2005. [3] S. Lebre, “Inferring dynamic genetic networks with low order independencies,” 2007. [4] R. Opgen-Rhein and K. Strimmer, “Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process,” BMC Bioinformatics, vol. 8, no. Suppl 2, p. S3, 2007. [5] Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate: A practical and powerful approach to multiple testing,” Journal of the Royal Statistical Society, vol. B, no. 57, pp. 289–300, 1995. [6] C. W. J. Granger, “Investigating causal relations by econometric models and cross-spectral methods,” Econometrica, vol. 37, pp. 424–438, 1969. [7] N. D. D. Mukhopadhyay and S. Chatterjee, “Causality and pathway search in microarray time series experiment.” Bioinformatics, 2006. [8] J. Sch¨afer and K. Strimmer, “An empirical Bayes approach to inferring large-scale gene association networks.” Bioinformatics, vol. 21, no. 6, pp. 754–64, 2005. [9] D. Veiga, F. Vicente, M. Grivet, A. de la Fuente, and A. Vasconcelos, “Genome-wide partial correlation analysis of escherichia coli microarray data,” Genet. Mol. Res., vol. 6, no. 4, pp. 730–742, 2007. [10] M. Winterhalder, B. Schelter, W. Hesse, K. Schwab, L. Leistritz, D. Klan, R. Bauer, J. Timmer, and H.Witte, “Comparison of linear signal processing techniques to infer directed interactions in multivariate neural systems,” Signal Processing, vol. In Press, Uncorrected Proof, 2005. [11] E. Pereda, R. Q. Quiroga, and J. Bhattacharya, “Nonlinear multivariate analysis of neurophysiological signals,” 2005. [12] R. Bro, N. D. Sidiropoulos, and A. K. Smilde, “Maximum likelihood fitting using ordinary least squares algorithms,” Journal of Chemometrics, vol. 16, pp. 387–400, 2002. [13] T. Michoel, S. Maere, E. Bonnet, A. Joshi, Y. Saeys, T. Van den Bulcke, K. Van Leemput, P. van Remortel, M. Kuiper, K. Marchal, and Y. Van de Peer, “Validating module network learning algorithms using simulated data,” BMC Bioinformatics, vol. 8, no. Suppl 2, 2007. [14] J. Egan, Signal detection theory and ROC analysis, Series in Cognition and Perception. New York, NY, USA: Academic Press, 1975. [15] T. Fawcett, “An introduction to ROC analysis,” Pattern Recogn. Lett., vol. 27, no. 8, pp. 861–874, 2006. [16] C. J. V. Rijsbergen, Information Retrieval. Newton,MA,USA: Butterworth-Heinemann, 1979. [17] T. V. den Bulcke, K. V. Leemput, B. Naudts, P. van Remortel, H. Ma, A. Verschoren, B. D. Moor, and K. Marchal, “SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms.” BMC Bioinformatics, vol. 7, no. 1, p. 43, 2006. |
| URI: | http://wrap.warwick.ac.uk/id/eprint/3363 |
Data sourced from Thomson Reuters' Web of Knowledge
Actions (login required)
![]() |
View Item |
Tools
Tools

