Analysing the performance of machine learning algorithms for effective classification of breast cancer

dc.contributor.authorAchini Nisansala, M. M.
dc.date.accessioned2023-09-05T10:04:49Z
dc.date.available2023-09-05T10:04:49Z
dc.date.issued2023-05-03
dc.description.abstractBreast cancer is the most common type of cancer diagnosed in women throughout the world. It can occur at any age in women’s lives, but the risk increased with the age. In 2020 around 2.3millions of women are diagnosed with breast cancer and among them, around 0.68 million died globally. There are two types of breast cancer tumors: benign and malignant. Diagnosing breast cancer is kind of tough due to the compound nature of the breast cancer cells. However, the treatments for breast cancer are very effective when the disease is diagnosed at an early stage. In this study seven machine learning algorithms are used: Logistic Regression (LR), Linear Discriminant Analysis (LDA), K-Nearest Neighbor (KNN), Gaussian Naïve Bayes (GN), Decision Tree Classifier (C4.5), Support Vector Classifier (SVC) and Random Forest (RF) on Wisconsin Breast Cancer Dataset (WBCD) collected from UCI repository for classifying the tumors into benign and malignant. This analysis is carried out in two parts without removing the outliers from the dataset and after removing the outliers from the dataset. Based on the analysis without removing the outliers SVC outperforms other classifiers with 97.82% accuracy. After removing the outliers RF gives the highest accuracy of 96.18%.en_US
dc.identifier.citation11th International Symposium (IntSym 2023) "Managing Contemporary Issues for Sustainable Future through Multidisciplinary Research" Proceedings 03rd May 2023: South Eastern University of Sri Lanka. p. 691-700.en_US
dc.identifier.isbn978-955-627-013-6
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/6819
dc.language.isoen_USen_US
dc.publisherSouth Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka.en_US
dc.subjectBreast Canceren_US
dc.subjectClassification Algorithmsen_US
dc.subjectAccuracyen_US
dc.titleAnalysing the performance of machine learning algorithms for effective classification of breast canceren_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
IntSym 2023 Proceedings-691-700.pdf
Size:
863.09 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: