Abstract
The manual routing of bug reports to specialized expertteams is a time-consuming and expensive process. In this paper,we investigated how this process can be automated by trainingdeep networks and state-of-the-art classifiers from thousands ofreal bug reports from a software company. Different combinationsof the natural language processing methodslemmatization,postagger,bigramandstopword removalwere evaluated in theclassification algorithmsLinear Support Vector Machines(SVMs),multinomial naive Bayes, andLong Short Term Memory(LSTM)networks. For feature processing we used theTerm Frequency-Inverse Document Frequency(TF-IDF) method. Best results wereobtained with a combination of thebigrammethod and linearSVMs. Similar prediction performance values were observedwith LSTM networks that however promise to improve furtherwith larger datasets. The bug triage tool was implemented in amicroservice architecture using docker containers which allowsfor extending individual components and simplifies applicationsto other text classification problems.
Original language | English |
---|---|
Number of pages | 6 |
Publication status | Published - 01.10.2018 |
Event | The Tenth International Conference on Advances in System Testing and Validation Lifecycle - Nice, France Duration: 14.10.2018 → 18.10.2018 https://www.iaria.org/conferences2018/VALID18.html |
Conference
Conference | The Tenth International Conference on Advances in System Testing and Validation Lifecycle |
---|---|
Abbreviated title | VALID 2018 |
Country/Territory | France |
City | Nice |
Period | 14.10.18 → 18.10.18 |
Internet address |