Pattern classification with imbalanced and multiclass data for the prediction of albendazole adverse event outcomes
No Thumbnail Available
Date
2016
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier Science Bv
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
Class imbalance problem is one of the important problems for classification studies in data mining. In this study, a comparative analysis of some sampling methods was performed based on the evaluation of four classification algorithms for the prediction of albendazole adverse events outcomes. Albendazole is one of the main medications used for the treatment of a variety of parasitic worm infestations. The dataset was created from the public release of the FDA's FAERS database. Four sampling algorithms were used to analyze the dataset and their performance was evaluated by using four classifiers. Among the algorithms, ID3 with resample algorithm has higher accuracy results than the others after the application of sampling methods. This study supported that sampling methods are capable to improve the performance of learning algorithms. (C) 2016 The Authors. Published by Elsevier B.V.
Description
Keywords
Imbalanced class, under sampling, over sampling, RBF Network, IBK, ID3, Randomtree
Turkish CoHE Thesis Center URL
Fields of Science
Citation
12
WoS Q
Scopus Q
Q2
Source
7th International Conference on Ambient Systems, Networks and Technologies (ANT) / 6th International Conference on Sustainable Energy Information Technology (SEIT) -- MAY 23-26, 2016 -- Madrid, SPAIN
Volume
83
Issue
Start Page
1013
End Page
1018