Analysis of Schizophrenia Data Using A Nonlinear Threshold Index Logistic Model
MetadataShow full item record
Genetic information, such as single nucleotide polymorphism (SNP) data, has been widely recognized as useful in prediction of disease risk. However, how to model the genetic data that is often categorical in disease class prediction is complex and challenging. In this paper, we propose a novel class of nonlinear threshold index logistic models to deal with the complex, nonlinear effects of categorical/discrete SNP covariates for Schizophrenia class prediction. A maximum likelihood methodology is suggested to estimate the unknown parameters in the models. Simulation studies demonstrate that the proposed methodology works viably well for moderate-size samples. The suggested approach is therefore applied to the analysis of the Schizophrenia classification by using a real set of SNP data from Western Australian Family Study of Schizophrenia (WAFSS). Our empirical findings provide evidence that the proposed nonlinear models well outperform the widely used linear and tree based logistic regression models in class prediction of schizophrenia risk with SNP data in terms of both Types I/II error rates and ROC curves.
This article is published under the Open Access publishing model and distributed under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/. Please refer to the licence to obtain terms for any further reuse or distribution of this work.
Showing items related by title, author, creator and subject.
Jiang, Zhenyu (2011)Genomics is a major scientific revolution in this century. High-throughput genomic data provides an opportunity for identifying genes and SNPs (singlenucleotide polymorphism) that are related to various clinical phenotypes. ...
Amiri, Amirpiran (2013)The alumina industry provides the feedstock for aluminium metal production and contributes to around A$6 billion of Australian exports annually. One of the most energy-intensive parts of alumina production, with a strong ...
Ghanem, Amal Saleh (2009)Most data mining and pattern recognition techniques are designed for learning from at data files with the assumption of equal populations per class. However, most real-world data are stored as rich relational databases ...