Characterization and prediction of issue-related risks in software projects
dc.contributor.author | Choetkiertikul, M. | |
dc.contributor.author | Dam, H. | |
dc.contributor.author | Tran, The Truyen | |
dc.contributor.author | Ghose, A. | |
dc.date.accessioned | 2017-01-30T11:57:48Z | |
dc.date.available | 2017-01-30T11:57:48Z | |
dc.date.created | 2016-02-28T19:30:32Z | |
dc.date.issued | 2015 | |
dc.identifier.citation | Choetkiertikul, M. and Dam, H. and Tran, T.T. and Ghose, A. 2015. Characterization and prediction of issue-related risks in software projects, pp. 280-291. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/16791 | |
dc.identifier.doi | 10.1109/MSR.2015.33 | |
dc.description.abstract |
© 2015 IEEE. Identifying risks relevant to a software project and planning measures to deal with them are critical to the success of the project. Current practices in risk assessment mostly rely on high-level, generic guidance or the subjective judgements of experts. In this paper, we propose a novel approach to risk assessment using historical data associated with a software project. Specifically, our approach identifies patterns of past events that caused project delays, and uses this knowledge to identify risks in the current state of the project. A set of risk factors characterizing 'risky' software tasks (in the form of issues) were extracted from five open source projects: Apache, Duraspace, JBoss, Moodle, and Spring. In addition, we performed feature selection using a sparse logistic regression model to select risk factors with good discriminative power. Based on these risk factors, we built predictive models to predict if an issue will cause a project delay. Our predictive models are able to predict both the risk impact (i.e. The extend of the delay) and the likelihood of a risk occurring. The evaluation results demonstrate the effectiveness of our predictive models, achieving on average 48% - 81% precision, 23% - 90% recall, 29% - 71% F-measure, and 70% - 92% Area Under the ROC Curve. Our predictive models also have low error rates: 0.39 - 0.75 for Macro-averaged Mean Cost-Error and 0.7 - 1.2 for Macro-averaged Mean Absolute Error. | |
dc.title | Characterization and prediction of issue-related risks in software projects | |
dc.type | Conference Paper | |
dcterms.source.volume | 2015-August | |
dcterms.source.startPage | 280 | |
dcterms.source.endPage | 291 | |
dcterms.source.title | IEEE International Working Conference on Mining Software Repositories | |
dcterms.source.series | IEEE International Working Conference on Mining Software Repositories | |
dcterms.source.isbn | 9780769555942 | |
curtin.department | Multi-Sensor Proc & Content Analysis Institute | |
curtin.accessStatus | Fulltext not available |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |