Evaluation of Three Feature Dimension Reduction Techniques for Machine Learning-Based Crop Yield Prediction Models
Citation
Source Title
ISSN
Faculty
School
Collection
Abstract
Machine learning (ML) has been widely used worldwide to develop crop yield forecasting models. However, it is still challenging to identify the most critical features from a dataset. Although either feature selection (FS) or feature extraction (FX) techniques have been employed, no research compares their performances and, more importantly, the benefits of combining both methods. Therefore, this paper proposes a framework that uses non-feature reduction (All-F) as a baseline to investigate the performance of FS, FX, and a combination of both (FSX). The case study employs the vegetation condition index (VCI)/temperature condition index (TCI) to develop 21 rice yield forecasting models for eight sub-regions in Vietnam based on ML methods, namely linear, support vector machine (SVM), decision tree (Tree), artificial neural network (ANN), and Ensemble. The results reveal that FSX takes full advantage of the FS and FX, leading FSX-based models to perform the best in 18 out of 21 models, while 2 (1) for FS-based (FX-based) models. These FXS-, FS-, and FX-based models improve All-F-based models at an average level of 21% and up to 60% in terms of RMSE. Furthermore, 21 of the best models are developed based on Ensemble (13 models), Tree (6 models), linear (1 model), and ANN (1 model). These findings highlight the significant role of FS, FX, and specially FSX coupled with a wide range of ML algorithms (especially Ensemble) for enhancing the accuracy of predicting crop yield.
Related items
Showing items related by title, author, creator and subject.
-
Mostafa, Fahed. (2011)Market risk refers to the potential loss that can be incurred as a result of movements inmarket factors. Capturing and measuring these factors are crucial in understanding andevaluating the risk exposure associated with ...
-
Ting, Huey Tze (2013)Not until recently did we see an enormous surge of interest in the study of machining of advanced ceramics. This has resulted in significant advances lately in their development and usage. Machinable glass ceramics, ...
-
Chan, Kit; Khadem, Saghar; Dillon, Tharam; Palade, Vasile; Singh, Jaipal; Chang, Elizabeth (2012)Over the past two decades, neural networks have been applied to develop short-term traffic flow predictors. The past traffic flow data, captured by on-road sensors, is used as input patterns of neural networks to forecast ...