Spectrum-based fault localization (SBFL) generates a ranked list of suspicious elements by using the program execution spectrum, but the excessive number of elements ranked in parallel results in low localization accu...Spectrum-based fault localization (SBFL) generates a ranked list of suspicious elements by using the program execution spectrum, but the excessive number of elements ranked in parallel results in low localization accuracy. Most researchers consider intra-class dependencies to improve localization accuracy. However, some studies show that inter-class method call type faults account for more than 20%, which means such methods still have certain limitations. To solve the above problems, this paper proposes a two-phase software fault localization based on relational graph convolutional neural networks (Two-RGCNFL). Firstly, in Phase 1, the method call dependence graph (MCDG) of the program is constructed, the intra-class and inter-class dependencies in MCDG are extracted by using the relational graph convolutional neural network, and the classifier is used to identify the faulty methods. Then, the GraphSMOTE algorithm is improved to alleviate the impact of class imbalance on classification accuracy. Aiming at the problem of parallel ranking of element suspicious values in traditional SBFL technology, in Phase 2, Doc2Vec is used to learn static features, while spectrum information serves as dynamic features. A RankNet model based on siamese multi-layer perceptron is constructed to score and rank statements in the faulty method. This work conducts experiments on 5 real projects of Defects4J benchmark. Experimental results show that, compared with the traditional SBFL technique and two baseline methods, our approach improves the Top-1 accuracy by 262.86%, 29.59% and 53.01%, respectively, which verifies the effectiveness of Two-RGCNFL. Furthermore, this work verifies the importance of inter-class dependencies through ablation experiments.展开更多
Existing unsupervised domain adaptation approaches primarily focus on reducing the data distribution gap between the source and target domains,often neglecting the influence of class information,leading to inaccurate ...Existing unsupervised domain adaptation approaches primarily focus on reducing the data distribution gap between the source and target domains,often neglecting the influence of class information,leading to inaccurate alignment outcomes.Guided by this observation,this paper proposes an adaptive inter-intra-domain discrepancy method to quantify the intra-class and inter-class discrepancies between the source and target domains.Furthermore,an adaptive factor is introduced to dynamically assess their relative importance.Building upon the proposed adaptive inter-intradomain discrepancy approach,we develop an inter-intradomain alignment network with a class-aware sampling strategy(IDAN-CSS)to distill the feature representations.The classaware sampling strategy,integrated within IDAN-CSS,facilitates more efficient training.Through multiple transfer diagnosis cases,we comprehensively demonstrate the feasibility and effectiveness of the proposed IDAN-CSS model.展开更多
基金funded by the Youth Fund of the National Natural Science Foundation of China(Grant No.42261070).
文摘Spectrum-based fault localization (SBFL) generates a ranked list of suspicious elements by using the program execution spectrum, but the excessive number of elements ranked in parallel results in low localization accuracy. Most researchers consider intra-class dependencies to improve localization accuracy. However, some studies show that inter-class method call type faults account for more than 20%, which means such methods still have certain limitations. To solve the above problems, this paper proposes a two-phase software fault localization based on relational graph convolutional neural networks (Two-RGCNFL). Firstly, in Phase 1, the method call dependence graph (MCDG) of the program is constructed, the intra-class and inter-class dependencies in MCDG are extracted by using the relational graph convolutional neural network, and the classifier is used to identify the faulty methods. Then, the GraphSMOTE algorithm is improved to alleviate the impact of class imbalance on classification accuracy. Aiming at the problem of parallel ranking of element suspicious values in traditional SBFL technology, in Phase 2, Doc2Vec is used to learn static features, while spectrum information serves as dynamic features. A RankNet model based on siamese multi-layer perceptron is constructed to score and rank statements in the faulty method. This work conducts experiments on 5 real projects of Defects4J benchmark. Experimental results show that, compared with the traditional SBFL technique and two baseline methods, our approach improves the Top-1 accuracy by 262.86%, 29.59% and 53.01%, respectively, which verifies the effectiveness of Two-RGCNFL. Furthermore, this work verifies the importance of inter-class dependencies through ablation experiments.
基金the National Natural Science Foundation of China(Grant Nos.52275104,51905160)the Natural Science Fund for Excellent Young Scholars of Hunan Province(Grant No.2021JJ20017)。
文摘Existing unsupervised domain adaptation approaches primarily focus on reducing the data distribution gap between the source and target domains,often neglecting the influence of class information,leading to inaccurate alignment outcomes.Guided by this observation,this paper proposes an adaptive inter-intra-domain discrepancy method to quantify the intra-class and inter-class discrepancies between the source and target domains.Furthermore,an adaptive factor is introduced to dynamically assess their relative importance.Building upon the proposed adaptive inter-intradomain discrepancy approach,we develop an inter-intradomain alignment network with a class-aware sampling strategy(IDAN-CSS)to distill the feature representations.The classaware sampling strategy,integrated within IDAN-CSS,facilitates more efficient training.Through multiple transfer diagnosis cases,we comprehensively demonstrate the feasibility and effectiveness of the proposed IDAN-CSS model.