A model to predict ordinal suitability using sparse and uncertain data
MetadataShow full item record
We describe the development of the algorithms that comprise the Spatial Decision Support System (SDSS) CaNaSTA (Crop Niche Selection in Tropical Agriculture). The system was designed to assist farmers and agricultural advisors in the tropics to make crop suitability decisions. These decisions are frequently made in highly diverse biophysical and socioeconomic environments and must often rely on sparse datasets. The field trial datasets that provide a knowledge base for SDSS such as this are characterised by ordinal response variables. Our approach has been to apply Bayes’ formula as a prediction model. This paper does not describe the entire CaNaSTA system, but rather concentrates on the algorithm of the central prediction model. The algorithm is tested using a simulated dataset to compare results with ordinal regression, and to test the stability of the model with increasingly sparse calibration data. For all but the richest input datasets it outperforms ordinal regression, as determined using Cohen’s weighted kappa. The model also performs well with sparse datasets. Whilst this is not as conclusive as testing with real world data, the results are encouraging.
Showing items related by title, author, creator and subject.
Buzzacott, Peter; Lambrechts, K.; Mazur, A.; Wang, Q.; Papadopoulou, V.; Theron, M.; Balestra, C.; Guerrero, F. (2014)© 2014 Elsevier Ltd. Background: Decompression sickness (DCS) in rats is commonly modelled as a binary outcome. The present study aimed to develop a ternary model of predicting probability of DCS in rats, (as no-DCS, ...
Tran, The Truyen; Phung, D.; Luo, W.; Venkatesh, S. (2014)The recent wide adoption of electronic medical records (EMRs) presents great opportunities and challenges for data mining. The EMR data are largely temporal, often noisy, irregular and high dimensional. This paper constructs ...
Jiang, Zhenyu (2011)Genomics is a major scientific revolution in this century. High-throughput genomic data provides an opportunity for identifying genes and SNPs (singlenucleotide polymorphism) that are related to various clinical phenotypes. ...