Duplicate bug report detection using clustering
dc.contributor.author | Gopalan, Raj | |
dc.contributor.author | Krishna, Aneesh | |
dc.contributor.editor | Jim Steel | |
dc.contributor.editor | Liming Zhu | |
dc.date.accessioned | 2017-01-30T11:23:11Z | |
dc.date.available | 2017-01-30T11:23:11Z | |
dc.date.created | 2014-02-03T20:02:09Z | |
dc.date.issued | 2014 | |
dc.identifier.citation | Gopalan, Raj P. and Krishna, Aneesh. 2014. Duplicate bug report detection using clustering, in Steel, J. and Zhu, L. (ed), 23rd Australasian Software Engineering Conference, Apr 7-10 2014, pp. 104-109. Sydney, NSW: IEEE. | |
dc.identifier.uri | http://hdl.handle.net/20.500.11937/11144 | |
dc.identifier.doi | 10.1109/ASWEC.2014.31 | |
dc.description.abstract |
Bug reporting and fixing the reported bugs play a critical part in the development and maintenance of software systems. The software developers and end users can collaborate in this process to improve the reliability of software systems. Various end users report the defects they have found in the software and how these bugs affect them. However, the same defect may be reported independently by several users leading to a significant number of duplicate bug reports. There are a number of existing methods for detecting duplicate bug reports, but the best results so far account for only 24% of actual duplicates. In this paper, we propose a new method based on clustering to identify a larger proportion of duplicate bug reports while keeping the false positives of misidentified non-duplicates low. The proposed approach is experimentally evaluated on a large sample of bug reports from three public domain data sets. The results show that this approach achieves better performance in terms of a harmonic measure that combines true positive and true negative rates when compared to the existing methods. | |
dc.publisher | IEEE | |
dc.subject | bug report | |
dc.subject | duplicate detection | |
dc.subject | Bugzilla | |
dc.subject | clustering | |
dc.title | Duplicate bug report detection using clustering | |
dc.type | Conference Paper | |
dcterms.source.title | Proceedings of 23rd Australasian Software Engineering Conference | |
dcterms.source.series | Proceedings of 23rd Australasian Software Engineering Conference | |
dcterms.source.conference | 23rd Australasian Software Engineering Conference | |
dcterms.source.conference-start-date | Apr 7 2014 | |
dcterms.source.conferencelocation | Sydney | |
dcterms.source.place | New Jersy | |
curtin.note |
Copyright © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
curtin.department | ||
curtin.accessStatus | Open access |