Full Length Research Paper
Abstract
Data mining is the process of extracting informative and useful rules or relations, that can be used to make predictions about the values of new instances, from existing data. A wide range of commercial and open source software programs are used for data mining. In this study, a comparison of several classification algorithms included in some open source softwares such as WEKA, Tanagra and Scikit-learn using SEER (Survillance Epidemiology and End Results) data set which consists of 60948 instances is performed.
Key words: Data mining, classification analysis, open source data mining tools.
Copyright © 2024 Author(s) retain the copyright of this article.
This article is published under the terms of the Creative Commons Attribution License 4.0