Developing Conceptual Framework of Software Defect Prediction in Software Testing: The Case of Ethiopian Software Industries

Authors

  • Musa Dima Genemo Department of Computer Engineering

DOI:

https://doi.org/10.47604/ijts.1834

Keywords:

Software Defect, Defect Prediction, Software Testing, Software Security

Abstract

Purpose: Software defect prediction is one of the most active research areas in software engineering. As our dependency on software is increasing, software quality is becoming gradually more and more important in present era. Software used almost everywhere and in every tread of life. Software consequences such as fault and failures may diminish the quality of software which leads to customer dissatisfaction. Different points that are related to the current work had been discussed.

Methodology: This research contains a detailed explanation of the scientific methods and methodologies used for the design of proposed framework. Primary and secondary source of data has been used.

Findings: Due to the tremendous amount of data generated daily from fields such as business that software has resulted in the generation of tremendous amount of defected data. As the organizations struggle to handle and utilize effectively all the information available in order to provide better products and services and gain competitive advantage, defect separation and the so called "big data" analytics are two fields that currently constitute matter of concern and discussion.

Unique Contribution to Theory, Practice and Policy: There are different research areas in defect prediction using big data analytics defect segmentations one of the issues. Defect Segmentation is an important component of much organization, and the algorithm designed in this work is expected to be a significant contribution to the field, and mainly to researchers working on various aspects of defect prediction defect segmentation using big data analytics. Therefore, the researchers in the area can use the algorithm or the implemented system for processing segmentation as component in their research, mainly on machine learning applications.

 

Downloads

Download data is not yet available.

Author Biography

Musa Dima Genemo , Department of Computer Engineering

 

 

References

D. W. Aha, D. Kibler, and M. K. Albert. Instance-based learning algorithms. Machine Learning, 6(1):1573-0565, January 1991.

M. L. Berenson, M. Goldstein, and D. Levine. Intermediate Statistical Methods and Applications: A Computer Package Approach. Prentice-Hall, Englewood Cliffs, NJ, 1983.

G. Boetticher, T. Menzies, and T. Ostrand. Promise repository of empirical software engineering data. http://promisedata.org/, 2007.

A. C. Cameron and P. K. Trivedi. Regression Analysis of Count Data. Cambridge University Press, 1998.

Z. Chen, T. Menzies, D. Port, and B. Boehm. Finding the right data for software cost modeling. IEEE Software, 22(6):38-46, November 2005.

J. T. de Souza, N. Japkowicz, and S. Matwin. Stochfs: A framework for combining feature selection outcomes through a stochastic process. In Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, pages 667-674, Porto, Portugal, October 3-7 2005.

T. Fawcett. An introduction to ROC analysis. Pattern Recognition Letters, 27(8):861-874, June 2006.

U. M. Fayyad and K. B. Irani. On the handling of continuous-valued attributes in decision tree generation. Machine Learning, 8:87-102, 1992.

K. Gao, T. M. Khoshgoftaar, and H. Wang. An empirical investigation of filter attribute selection techniques for software quality classification. In Proceedings of the 10th IEEE International Conference on Information Reuse and Integration, pages 272-277, Las Vegas, Nevada, August 10-12 2009.

I. Guyon and A. Elisseeff. An introduction to variable and feature selection. Journal of Machine Learning Research, 3:1157-1182, March 2003.

M. A. Hall and G. Holmes. Benchmarking attribute selection techniques for discrete class data mining. IEEE Transactions on Knowledge and Data Engineering, 15(6):1437 - 1447, Nov/Dec 2003.

S. Haykin. Neural Networks: A Comprehensive Foundation. Prentice-Hall, 2nd edition, 1998.

G. Ilczuk, R. Mlynarski, W. Kargul, and A. Wakulicz-Deja. New feature selection methods for qualification of the patients for cardiac pacemaker implantation. In Computers in Cardiology, 2007, pages 423-426, Durham, NC, USA, 2007.

Y. Jiang, J. Lin, B. Cukic, and T. Menzies. Variance analysis in software fault prediction models. In Proceedings of the 20th IEEE International Symposium on Software Reliability Engineering, pages 99-108, Bangalore-Mysore, India, Nov. 16-19 2009.

G. H. John and P. Langley. Estimating continuous distributions in Bayesian classifiers. In Proceedings of Eleventh Conference on Uncertainty in Artificial Intelligence, volume 2, pages 338-345, San Mateo, 1995.

K. Jong, E. Marchiori, M. Sebag, and A. van der Vaart. Feature selection in proteomic pattern data with support vector machines. In Proceedings of the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, Oct 7-8 2004.

S. A. Julious. Using confidence intervals around individual means to assess statistical significance between two means. Pharmaceutical Statistics, 3:217-222, 2004.

T. M. Khoshgoftaar, L. A. Bullard, and K. Gao. Attribute selection using rough sets in software quality classification. International Journal of Reliability, Quality and Safety Engineering, 16(1):73-89, 2009.

T. M. Khoshgoftaar, M. Golawala, and J. V. Hulse. An empirical study of learning from imbalanced data using random forest. In Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence, volume 2, pages 310-317, Patras, Greece, Oct. 29-31 2007.

T. M. Khoshgoftaar, Y. Xiao, and K. Gao. Assessment of a multi-strategy classifier for an embedded software system. In Proceedings of 18th IEEE International Conference on Tools with Artificial Intelligence, pages 651-658, Washington, DC, USA, November 13-15 2006.

K. Kira and L. A. Rendell. A practical approach to feature selection. In Proceedings of 9th International Workshop on Machine Learning, pages 249-256, 1992.

A. G. Koru, D. Zhang, K. E. Emam, and H. Liu. An investigation into the functional form of the size-defect relationship for software modules. IEEE Transactions on Software Engineering, 35(2):293-304, 2009.

S. Le Cessie and J. C. Van Houwelingen. Ridge estimators in logistic regression. Applied Statistics, 41(1):191-201, 1992.

K. Lee. Combining multiple feature selection methods. In Mid-Atlantic Student Workshop on Programming Languages and Systems (MASPLAS'02), pages 12.1-12.9, 2002.

S. Lessmann, B. Baesens, C. Mues, and S. Pietsch. Benchmarking classification models for software defect prediction: A proposed framework and novel findings. IEEE Transactions on Software Engineering, 34(4):485-496, 2008.

H. Liu and L. Yu. Toward integrating feature selection algorithms for classification and clustering. IEEE Transactions on Knowledge and Data Engineering, 17(4):491-502, 2005.

Downloads

Published

2023-03-09

How to Cite

Gemeno, M. (2023). Developing Conceptual Framework of Software Defect Prediction in Software Testing: The Case of Ethiopian Software Industries. International Journal of Technology and Systems, 8(1), 1–13. https://doi.org/10.47604/ijts.1834

Issue

Section

Articles