Effectiveness of SMOTE in Enhancing Adult Autism Spectrum Disorder Diagnosis Predictive Performance With Missforest Imputation And Random Forest

Autism Spectrum Disorder Random Forest Imbalanced Data Missing Values MissForest SMOTE

Authors

  • Muhammad Hafizh Musyaffa Department of Computer Science, Lambung Mangkurat University, Kalimantan Selatan,Indonesia, Indonesia
  • Triando Hamonangan Saragih
    Triando.saragih@ulm.ac.id
    Department of Computer Science, Lambung Mangkurat University, Kalimantan Selatan,Indonesia, Indonesia
  • Dodon Turianto Nugrahadi Department of Computer Science, Lambung Mangkurat University, Kalimantan Selatan, Indonesia, Indonesia
  • Dwi Kartini Department of Computer Science, Lambung Mangkurat University, Kalimantan Selatan,Indonesia, Indonesia
  • Andi Farmadi Department of Computer Science, Lambung Mangkurat University, Kalimantan Selatan,Indonesia, Indonesia
March 4, 2025
March 16, 2025
April 24, 2025

Downloads

Autism Spectrum Disorder (ASD), originally described by Leo Kanner in 1943, is a complex developmental condition that manifests through social, emotional, and behavioral challenges, often including speech delays and difficulties in interpersonal interactions. Despite significant advancements in diagnostic criteria over the years, accurate diagnosis of ASD in adults remains challenging due to limited access to comprehensive datasets and inherent methodological constraints. The Autism Screening Adult dataset used in this study exemplifies these issues, as it contains missing values and exhibits a marked class imbalance, both of which can adversely affect model performance. To address these challenges, we proposed a framework that integrates Random Forest classification with MissForest imputation and the Synthetic Minority Over-sampling Technique (SMOTE). MissForest effectively imputes missing data by employing an iterative random forest approach that preserves the underlying structure of the data without relying on strict parametric assumptions. Meanwhile, SMOTE generates synthetic samples for the minority class, thereby balancing the dataset and reducing prediction bias. Experimental evaluation through 10-Fold Cross Validation demonstrated that the application of SMOTE significantly enhanced model performance. Notably, the overall accuracy improved from 70.17% to 79.32%, and the AUC-ROC increased from 47.13% to 85.84%, indicating a robust improvement in the model’s ability to distinguish between positive and negative cases. These results underscore the critical importance of addressing data imbalance and missing values in predictive modeling for ASD. The promising outcomes of this study provide a solid foundation for developing more reliable diagnostic tools for adult ASD, and future research may further refine feature selection and incorporate additional data sources to optimize performance even further.

How to Cite

Musyaffa, M. H., Saragih, T. H. ., Nugrahadi, D. T. ., Kartini, D., & Farmadi, A. (2025). Effectiveness of SMOTE in Enhancing Adult Autism Spectrum Disorder Diagnosis Predictive Performance With Missforest Imputation And Random Forest. Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics, 7(2), 270-280. https://doi.org/10.35882/ijeeemi.v7i2.66

Most read articles by the same author(s)

1 2 > >> 

Similar Articles

1-10 of 142

You may also start an advanced similarity search for this article.