STUCKMAN
Updated 69 days ago
To complement the datasets discussed above, we are currently developing tools supporting easier replication of prediction studies for defects and vulnerabilities... Defect and vulnerability prediction often involves the construction of models which estimate the likelihood that a defect will exist in a particular source code artifact. Sometimes, these models can also be utilized in a generative capacity to produce synthetic data (such as counts of simulated defects). We are examining if characteristics of this synthetic data can be compared to those of real defect data in order to examine the consistency of the model against the data that was actually observed. In addition, we are studying ways to improve cross-project prediction performance by increasing the generality of predictive models, in the same way that avoiding overfitting can improve performance in within-project prediction... Much research on defect prediction with software metrics has studied languages such as Java and C;..