ENHANCED DERMATOSCOPIC SKIN LESION CLASSIFICATION USING MACHINE LEARNING TECHNIQUES
Abstract
Malignant melanoma poses significant impact on public health. Extensive research has focused on
distinguishing between benign and malignant skin lesions through dermatoscopic image analysis. Our study
emphasizes the classification aspect, using the MNIST HAM 10000 dataset. Initially, we addressed the challenge of
imbalanced data by applying Synthetic Minority Oversampling Technique (SMOTE) to balance the dataset,
significantly improving accuracy across various machine learning algorithms. Among these algorithms—Decision
Tree (using Gini index and Entropy), Naïve Bayes, XGBoost, Random Forest, Logistic Regression, and Support
Vector Machine (specifically with Polynomial kernel)—we found that Support Vector Machine with Polynomial
kernel achieved the highest accuracy of 96.825%. While XGBoost, being a Gradient Boosting algorithm, showed
varying results, we verified its accuracy using k-fold cross validation (k=10), achieving 95.984%. Ultimately, our
findings highlight Support Vector Machine with Polynomial kernel as the most effective for this classification
task[1]
