Pattern classification with imbalanced and multiclass data for the prediction of albendazole adverse event outcomes

No Thumbnail Available

Date

2016

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier Science Bv

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Research Projects

Organizational Units

Journal Issue

Abstract

Class imbalance problem is one of the important problems for classification studies in data mining. In this study, a comparative analysis of some sampling methods was performed based on the evaluation of four classification algorithms for the prediction of albendazole adverse events outcomes. Albendazole is one of the main medications used for the treatment of a variety of parasitic worm infestations. The dataset was created from the public release of the FDA's FAERS database. Four sampling algorithms were used to analyze the dataset and their performance was evaluated by using four classifiers. Among the algorithms, ID3 with resample algorithm has higher accuracy results than the others after the application of sampling methods. This study supported that sampling methods are capable to improve the performance of learning algorithms. (C) 2016 The Authors. Published by Elsevier B.V.

Description

Keywords

Imbalanced class, under sampling, over sampling, RBF Network, IBK, ID3, Randomtree

Turkish CoHE Thesis Center URL

Fields of Science

Citation

12

WoS Q

Scopus Q

Q2

Source

7th International Conference on Ambient Systems, Networks and Technologies (ANT) / 6th International Conference on Sustainable Energy Information Technology (SEIT) -- MAY 23-26, 2016 -- Madrid, SPAIN

Volume

83

Issue

Start Page

1013

End Page

1018