Rule-based analysis for detecting epistasis using associative classification mining
dc.contributor.author | Uppu, Suneetha | |
dc.contributor.author | Krishna, Aneesh | |
dc.contributor.author | Gopalan, Raj | |
dc.date.accessioned | 2017-01-30T13:07:35Z | |
dc.date.available | 2017-01-30T13:07:35Z | |
dc.date.created | 2016-02-28T19:30:32Z | |
dc.date.issued | 2015 | |
dc.identifier.citation | Uppu, S. and Krishna, A. and Gopalan, R. 2015. Rule-based analysis for detecting epistasis using associative classification mining. Network Modeling Analysis in Health Informatics and Bioinformatics. 4 (1): Article No. 12. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/28822 | |
dc.identifier.doi | 10.1007/s13721-015-0084-3 | |
dc.description.abstract |
The advancements in sequencing high-throughput human genome and computational abilities have tremendously improved the understanding of the genetic architecture behind the complex diseases. The development of high-throughput genotyping and next-generation sequencing technologies enables large-scale data for genetic epidemiological analysis. These advances led to the identification of a number of single nucleotide polymorphisms (SNPs) associated with complex diseases. The interactions between SNPs responsible for disease susceptibility have been increasingly explored in the current literature. These interaction studies are mathematically challenging and computationally complex. These challenges have been addressed by a number of data mining and machine learning approaches. The goal of this research is to implement associative classification and study its effectiveness for detecting the epistasis in balanced and imbalanced datasets. The proposed approach was evaluated for single-locus models to six-locus models using simulated data. The datasets were generated for five different penetrance functions by varying heritability, minor allele frequency and sample size. In total, 57,300 datasets were generated and several experiments conducted to identify the disease causal SNP interactions. The accuracy of classification by the proposed approach was compared with the existing approaches. The experimental results demonstrated significant improvements in accuracy for detecting interactions associated with the phenotype. Further, the approach was successfully applied over sporadic breast cancer data. The results show interaction among six polymorphisms, which included five different estrogen-metabolism genes. | |
dc.title | Rule-based analysis for detecting epistasis using associative classification mining | |
dc.type | Journal Article | |
dcterms.source.volume | 4 | |
dcterms.source.number | 1 | |
dcterms.source.issn | 2192-6662 | |
dcterms.source.title | Network Modeling Analysis in Health Informatics and Bioinformatics | |
curtin.department | Department of Computing | |
curtin.accessStatus | Fulltext not available |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |