Learning to Categorize Bug Reports with LSTM Networks

Elmar Rueckert, Kaushikkumar D. Gondaliya, Jan Peters


The manual routing of bug reports to specialized expertteams is a time-consuming and expensive process. In this paper,we investigated how this process can be automated by trainingdeep networks and state-of-the-art classifiers from thousands ofreal bug reports from a software company. Different combinationsof the natural language processing methodslemmatization,postagger,bigramandstopword removalwere evaluated in theclassification algorithmsLinear Support Vector Machines(SVMs),multinomial naive Bayes, andLong Short Term Memory(LSTM)networks. For feature processing we used theTerm Frequency-Inverse Document Frequency(TF-IDF) method. Best results wereobtained with a combination of thebigrammethod and linearSVMs. Similar prediction performance values were observedwith LSTM networks that however promise to improve furtherwith larger datasets. The bug triage tool was implemented in amicroservice architecture using docker containers which allowsfor extending individual components and simplifies applicationsto other text classification problems.
Original languageEnglish
Number of pages6
Publication statusPublished - 01.10.2018
EventThe Tenth International Conference on Advances in System Testing and Validation Lifecycle - Nice, France
Duration: 14.10.201818.10.2018


ConferenceThe Tenth International Conference on Advances in System Testing and Validation Lifecycle
Abbreviated titleVALID 2018
Internet address


Dive into the research topics of 'Learning to Categorize Bug Reports with LSTM Networks'. Together they form a unique fingerprint.

Cite this