Treffer: Using Cross-Entropy Value of Code for Better Defect Prediction.

Title:
Using Cross-Entropy Value of Code for Better Defect Prediction.
Authors:
Xian Zhang1 tomtomzx@foxmail.com, Kerong Ben1, Jie Zeng1
Source:
International Journal of Performability Engineering. Sep2018, Vol. 14 Issue 9, p2105-2115. 11p.
Database:
Supplemental Index

Weitere Informationen

Defect prediction is meaningful because it can assist software inspection by predicting defective code locations and improving software reliability. Many software features are designed for defect prediction models to identify potential bugs, but no one feature set can perform well in most cases yet. To improve defect prediction, this paper proposes a new code feature, the cross-entropy value of the sequence of code's abstract syntax tree nodes (CE-AST), and develops a neural language model for feature measurement. To evaluate the effectiveness of CE-AST, we first investigate its discrimination for defect-proneness. Experiments on 12 Java projects show that CE-AST is more discriminative than 45% of twenty widely used traditional features. Furthermore, we investigate CE-AST's contribution to defect prediction. Combined with different traditional feature suites to feed prediction models, CE-AST can bring performance improvements of 4.7% in Precision, 2.5% in Recall, and 3.5% in F1 on average. [ABSTRACT FROM AUTHOR]