Overhead Cross Section Sampling Machine Learning based Cervical Cancer Risk Factors Prediction

Peter Soosai Anandaraj

Overhead Cross Section Sampling Machine Learning based Cervical Cancer Risk Factors Prediction

Turkish Online Journal of Qualitative Inquiry (TOJQI) 12 (6): 7697-7715 (2021) Copy BIBT_EX

Abstract

Most forms of human papillomavirus can create alterations on a woman's cervix that can lead to cervical cancer in the long run, while others can produce genital or epidermal tumors. Cervical cancer is a leading cause of morbidity and mortality among women in low- and middle-income countries. The prediction of cervical cancer still remains an open challenge as there are several risk factors affecting the cervix of the women. By considering the above, the cervical cancer risk factor dataset from KAGGLE data warehouse is executed for predicting the cervical cancer risk classes. The cervical cancer data set is normalised with incomplete data and Pattern Calibration. Secondly, the interpretive data analysis is carried out, and the target feature's dispersion of the cervical cancer risk is visualised. Thirdly, several classifiers are fitted to the unprocessed data set, and the performance is measured with pre and post feature scaling. Fourth, oversampling methodologies are applied to the pre - processed data set. Fifth, the oversampled dataset by differment methods are applied to all the classifiers and the performance is compared with pre and post feature scaling. Sixth, Precision, recall, Fscore, accuracy, and running time are some of the metrics used in performance analysis. The code is written in Python and executed with Anaconda Navigator on the Spyder framework. The findings of the experiments reveal that the Random forest classifier tends to sustain 96% accuracy pre and post scaling for unporocessed dataset. Similarly the same classifier tends to sustain 98% accuracy for all the oversampling techniques.

View on PhilPapers

Archival history

Archival date: 2021-07-27
View all versions

Keywords

Machine learning classification scaling oversampling

Reprint years

Analytics

Added to PP
2021-07-27

Downloads
396 (#71,511)

6 months
87 (#79,804)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Overhead Cross Section Sampling Machine Learning based Cervical Cancer Risk Factors Prediction

Abstract

Archival history

Categories

Keywords

Reprint years

Analytics