A novel approach to simulate gene-environment interactions in complex diseases
Amato, R. (Roberto), Pinelli, Michele, D’Andrea, Daniel, Miele, Gennaro, Nicodemi, Mario, Raiconi, Giancarlo and Cocozza, Sergio. (2010) A novel approach to simulate gene-environment interactions in complex diseases. BMC Bioinformatics, Vol.11 (Article 8). ISSN 1471-2105
WRAP_Nicodemi_novel_approach.pdf - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Official URL: http://dx.doi.org/10.1186/1471-2105-11-8
Background: Complex diseases are multifactorial traits caused by both genetic and environmental factors. They represent the major part of human diseases and include those with largest prevalence and mortality (cancer, heart disease, obesity, etc.). Despite a large amount of information that has been collected about both genetic and environmental risk factors, there are few examples of studies on their interactions in epidemiological literature. One reason can be the incomplete knowledge of the power of statistical methods designed to search for risk factors and their interactions in these data sets. An improvement in this direction would lead to a better understanding and description of gene-environment interactions. To this aim, a possible strategy is to challenge the different statistical methods against data sets where the underlying phenomenon is completely known and fully controllable, for example simulated ones.
Results: We present a mathematical approach that models gene-environment interactions. By this method it is possible to generate simulated populations having gene-environment interactions of any form, involving any number of genetic and environmental factors and also allowing non-linear interactions as epistasis. In particular, we implemented a simple version of this model in a Gene-Environment iNteraction Simulator (GENS), a tool designed to simulate case-control data sets where a one gene-one environment interaction influences the disease risk. The main aim has been to allow the input of population characteristics by using standard epidemiological measures and to implement constraints to make the simulator behaviour biologically meaningful.
Conclusions: By the multi-logistic model implemented in GENS it is possible to simulate case-control samples of complex disease where gene-environment interactions influence the disease risk. The user has full control of the main characteristics of the simulated population and a Monte Carlo process allows random variability. A knowledge-based approach reduces the complexity of the mathematical model by using reasonable biological constraints and makes the simulation more understandable in biological terms. Simulated data sets can be used for the assessment of novel statistical methods or for the evaluation of the statistical power when designing a study.
|Item Type:||Journal Article|
|Subjects:||R Medicine > RA Public aspects of medicine|
|Divisions:||Faculty of Science > Physics|
|Library of Congress Subject Headings (LCSH):||Medical genetics -- Data processing, Environmentally induced diseases -- Data processing, Case-control method, Simulation methods, Monte Carlo method|
|Journal or Publication Title:||BMC Bioinformatics|
|Publisher:||BioMed Central Ltd.|
|Official Date:||5 January 2010|
|Access rights to Published version:||Open Access|
|Funder:||Università di Napoli (UdN)|
1. Weeks DE, Lathrop GM: Polygenic disease: methods for mapping
Actions (login required)