Comparison of Sampling Methods Using Machine Learning and Deep Learning Algorithms with an Imbalanced Data Set for the Prevention of Violence Against Physicians
No Thumbnail Available
Date
2021
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
The aim of this study is to compare sampling methods using machine and deep learning algorithms with a small and imbalanced data set for the prevention of violence against physicians. In this data set, it is determined whether there is violence against physicians by using various demographic information of physicians. In addition, in this study, it is tried find effective solutions to improve the working conditions of physicians in order to reduce violence against physicians. As a solution to the imbalanced data problem, Synthetic Minority Oversampling (SMOTE), Random Oversampling (ROS) and Random Undersampling (RUS) methods were used to balance the data in this study. Then, Random Forest Classifier (RFC), Extra Tree Classifier (ETC) and Multi-Layer Perceptron (MLP) algorithms were applied. Among all sampling techniques and classification algorithms, the ETC algorithm applied with the ROS method shows the best performance with 82% accuracy and 0.81 F1-Score. © 2021 IEEE.
Description
Keywords
criminology, CRISP-DM, deep learning, ETC, machine learning, MLP, RFC, ROS, RUS, SMOTE
Turkish CoHE Thesis Center URL
Fields of Science
Citation
1
WoS Q
Scopus Q
Source
2021 Turkish National Software Engineering Symposium, UYMS 2021 - Proceedings -- 15th Turkish National Software Engineering Symposium, UYMS 2021 -- 17 November 2021 through 19 November 2021 -- Virtual, Izmir -- 176220