CROSS-PROJECT DEFECT PREDICTION PADA DATASET AEEEM MENGGUNAKAN HYBRID SMOTE–TOMEK DAN ENSEMBLE LEARNING
- FINA SIFAUL NUFUS
- 14230025
ABSTRAK
ABSTRAK
Nama : Fina Sifaul Nufus
NIM : 14230025
Program Studi : Ilmu Komputer
Fakultas : Teknologi Informasi
Jenjang : Strata Dua (S2)
Peminatan : Software Engineering & Data Science
Judul Tesis : Cross-project Defect Prediction pada Dataset AEEEM Menggunakan Hybrid SMOTE–Tomek dan Ensemble Learning
Penelitian ini bertujuan untuk meningkatkan kinerja Cross-project Defect Prediction (CPDP) pada dataset AEEEM melalui penerapan pendekatan hybrid preprocessing yang mengombinasikan Normalisasi, reduksi dimensi menggunakan PCA, penyeimbangan kelas dengan SMOTE–Tomek, serta penyesuaian threshold keputusan. Eksperimen dilakukan pada lima proyek AEEEM, yaitu EQ, JDT, LC, ML, dan PDE, dengan dua skenario utama, yaitu single-source CPDP dan multi-source CPDP. Model yang digunakan adalah Random Forest dan Support Vector Machine (SVM), sedangkan kinerja dievaluasi menggunakan metrik F1-score dan AUC. Hasil eksperimen menunjukkan bahwa pendekatan multi-source secara umum menghasilkan kinerja yang lebih stabil dibandingkan single-source. Selain itu, penerapan hybrid preprocessing terbukti mampu meningkatkan F1-score secara signifikan dibandingkan baseline, terutama pada dataset dengan rasio defect yang rendah. Ablation study mengonfirmasi bahwa peningkatan kinerja diperoleh dari kombinasi penyeimbangan kelas dan threshold tuning, bukan dari satu komponen tunggal.
Kata kunci: AEEEM, Cross-project Defect Prediction, Hybrid Preprocessing, Machine Learning, SMOTE–Tomek
KATA KUNCI
Cross project,Software Defect Prediction,Hybrid Smote Tomek,Ensemble Learning
DAFTAR PUSTAKA
DAFTAR PUSTAKA
[1] M. Ali et al., “Software Defect Prediction Using an Intelligent Ensemble-Based Model,” IEEE Access, vol. 12, no. January, pp. 20376–20395, 2024, doi: 10.1109/ACCESS.2024.3358201.
[2] M. M. Febrian, S. W. Saputro, T. H. Saragih, F. Abadi, and R. Herteno, “Hybrid Feature Selection and Balancing Data Approach for Improved Software Defect Prediction,” Indones. J. Electron. Electromed. Eng. Med. Informatics, vol. 7, no. 2, pp. 232–244, Apr. 2025, doi: 10.35882/ijeeemi.v7i2.67.
[3] Y. Zhao, Y. Zhu, Q. Yu, and X. Chen, “SS symmetry Cross-project Defect Prediction Considering Multiple Data,” pp. 1–18, 2022.
[4] N. Kumar, O. P. Sangwan, and S. Beniwal, “Software defect prediction using machine learning techniques,” 2025, p. 040004, doi: 10.1063/5.0299059.
[5] A. S. Nugraha, M. R. Faisal, F. Abadi, and ..., “Deep Neural Network on Software Defect Prediction,” … Softw. …, vol. 02, no. 02, pp. 82–89, 2021, [Online]. Available: https://jurnalmahasiswamipa.ulm.ac.id/index.php/integer/article/view/44%0A https://jurnalmahasiswamipa.ulm.ac.id/index.php/integer/article/download/44/ 20.
[6] E. H. A. Prastyo, M. A. Yaqin, Suhartono, M. Faisal, and R. A. J. Firdaus, “Naive Bayes Classification for Software Defect Prediction,” Trans. Informatics Data Sci., vol. 1, no. 1, pp. 11–20, 2024, doi: 10.24090/tids.v1i1.12192.
[7] Z. Li, J. Niu, and X.-Y. Jing, “Software defect prediction: future directions and challenges,” Autom. Softw. Eng., vol. 31, no. 1, p. 19, May 2024, doi: 10.1007/s10515-024-00424-1. [8] M. R. Kusuma, Windu Gata, Sigit Kurniawan, Dedi Dwi Saputra, and Supriadi Panggabean, “Software Defect Prediction For Quality Evaluation Using Learning Techniques Ensemble Stacking,” Inspir. J. Teknol. Inf. dan Komun., vol. 13, no. 2, pp. 1–13, Nov. 2023, doi: 10.35585/inspir.v13i2.58.
[9] D. A. Rebro, S. Chren, and B. Rossi, “Source Code Metrics for Software Defects Prediction,” Proc. ACM Symp. Appl. Comput., pp. 1469–1472, 2023, doi: 10.1145/3555776.3577809.
[10] A. K. Dewi, H. Y. Azmina, and Y. Sugiarti, “Penggunaan Metrik Software Defect dalam Pengembangan Perangkat Lunak: Literature Review,” Acad. J. Comput. Sci. Res., vol. 6, no. 2, p. 89, 2024, doi: 10.38101/ajcsr.v6i2.15649.
[11] M. SALMAN Saeed, M. Salman Saeed, M. Saleem, and S. Saeed, “Cross Project Software Defect Prediction Using Machine Learning: A Review,” Pp, vol. 3, no. June, p. 52, 2024, [Online]. Available: https://www.researchgate.net/publication/374557048.
[12] A. B. Nassif et al., “Software defect prediction using learning to rank approach,” Sci. Rep., vol. 13, no. 1, p. 18885, Nov. 2023, doi: 10.1038/s41598- 023-45915-5.
[13] M. Nevendra and P. Singh, “Cross-project Defect Prediction with Metrics Selection and Balancing Approach,” Appl. Comput. Syst., vol. 27, no. 2, pp. 137–148, Dec. 2022, doi: 10.2478/acss-2022-0015.
[14] K. Javed, R. Shengbing, M. Asim, and M. A. Wani, “Cross-project Defect Prediction Based on Domain adaptation and LSTM Optimization,” Algorithms, vol. 17, no. 5, p. 175, Apr. 2024, doi: 10.3390/a17050175.
[15] Y. Z. Bala, P. Abdul Samat, K. Y. Sharif, and N. Manshor, “The influence of machine learning on the predictive performance of cross-project defect prediction: empirical analysis,” TELKOMNIKA (Telecommunication Comput. Electron. Control., vol. 22, no. 4, p. 830, Aug. 2024, doi: 10.12928/telkomnika.v22i4.25916.
[16] A. O. Balogun et al., “Impact of Feature Selection Methods on the Predictive Performance of Software Defect Prediction Models: An Extensive Empirical Study,” Symmetry (Basel)., vol. 12, no. 7, p. 1147, Jul. 2020, doi: 10.3390/sym12071147.
[17] E. U. Oyo-ita, E. A. Edim, A. Otiko, and D. E. Izuki, “Improving model performance for software defect detection and prediction using ensemble method and cross validation techniques Improving model performance for software defect detection and prediction using ensemble method and cross validation techniques,” no. August, 2024.
[18] M. Dewi, T. H. Saragih, and R. Herteno, “Penerapan SMOTE-NCL untuk Mengatasi Ketidakseimbangan Kelas pada Klasifikasi Penyakit Jantung Koroner,” J. Inform. Polinema, vol. 10, no. 1, pp. 27–34, 2023, doi: 10.33795/jip.v10i1.1394.
[19] C. B. Handoko and C. S. K. Aditya, “Penerapan Teknik SMOTE Dalam Mengatasi Imbalance Data Penyakit Diabetes Menggunakan Algoritma ANN,” Smart Comp Jurnalnya Orang Pint. Komput., vol. 14, no. 1, pp. 13–20, 2025, doi: 10.30591/smartcomp.v14i1.7045.
[20] Y. Liu, W. Zhang, G. Qin, and J. Zhao, “A comparative study on the effect of data imbalance on software defect prediction,” Procedia Comput. Sci., vol. 214, pp. 1603–1616, 2022, doi: 10.1016/j.procs.2022.11.349.
[21] S. R. Goyal, “A systematic review on AI based class imbalance handling in software defect prediction,” Results Eng., vol. 27, p. 106578, Sep. 2025, doi: 10.1016/j.rineng.2025.106578.
[22] R. A. Astari, I. M. Sumertajaya, and A. M. Soleh, “A Hybrid Sampling Approach for Handling Data Imbalance in Ensemble Learning Algorithms,” Sci. J. Informatics, vol. 12, no. 2, pp. 247–258, 2025, doi: 10.15294/sji.v12i2.19163.
[23] M. Ilham, A. Winarno, M. Lutfi, and A. Indrasetianingsih, “Handling Imbalanced Fraudulent Transaction Data Using SMOTE-Tomek and Random Forest: A Classification Approach,” BEST J. Appl. Electr. Sci. Technol., vol. 7, no. 1, pp. 35–38, Mar. 2025, doi: 10.36456/best.vol7.no1.10335.
[24] Emma Andini, M. R. Faisal, Rudy Herteno, R. A. Nugroho, Friska Abadi, and Muliadi, “Peningkatan Kinerja Prediksi Cacat Software Dengan Hyperparameter Tuning Pada Algoritma Klasifikasi Deep Forest,” J. Mnemon., vol. 5, no. 2, pp. 119–127, 2022, doi: 10.36040/mnemonic.v5i2.4793.
[25] A. J. Anju and J. E. Judith, “Hybrid feature selection method for predicting 48 Program Studi Ilmu Komputer (S2) FTI Universitas Nusa Mandiri software defect,” J. Eng. Appl. Sci., vol. 71, no. 1, p. 124, Dec. 2024, doi: 10.1186/s44147-024-00453-3.
[26] M. Abdelhamid and A. Desai, “Balancing the Scales: A Comprehensive Study on Tackling Class Imbalance in Binary Classification,” 2024, [Online]. Available: http://arxiv.org/abs/2409.19751.
[27] Z. Fadia Laila Hidayati, F. Abadi, M. Reza Faisal, and D. Kartini, “Improving Performance of Random Forest Algorithm Using Abc Feature Selection for Software Defect Prediction,” J. CoreIT, vol. 10, no. 1, pp. 32–39, 2024, doi: 10.24014/coreit.v10i1.29283.
Detail Informasi
Tesis ini ditulis oleh :
- Nama : FINA SIFAUL NUFUS
- NIM : 14230025
- Prodi : Ilmu Komputer
- Kampus : Margonda
- Tahun : 2025
- Periode : II
- Pembimbing : Dr. Yan Riyanto, M. Eng
- Asisten :
- Kode : 0024.S2.IK.TESIS.II.2025
- Diinput oleh : RKY
- Terakhir update : 28 April 2026
- Dilihat : 5 kali
TENTANG PERPUSTAKAAN
E-Library Perpustakaan Universitas Nusa Mandiri merupakan
platform digital yang menyedikan akses informasi di lingkungan kampus Universitas Nusa Mandiri seperti akses koleksi buku, jurnal, e-book dan sebagainya.
INFORMASI
Alamat : Jln. Jatiwaringin Raya No.02 RT08 RW 013 Kelurahan Cipinang Melayu Kecamatan Makassar Jakarta Timur
Email : perpustakaan@nusamandiri.ac.id
Jam Operasional
Senin - Jumat : 08.00 s/d 20.00 WIB
Isitirahat Siang : 12.00 s/d 13.00 WIB
Istirahat Sore : 18.00 s/d 19.00 WIB
Perpustakaan Universitas Nusa Mandiri @ 2020