Project details for KEEL Knowledge Extraction based on Evolutionary Learning

Screenshot KEEL Knowledge Extraction based on Evolutionary Learning 3.0

by keel - September 18, 2015, 12:38:54 CET [ Project Homepage BibTeX Download ]

view (1 today), download ( 0 today ), 0 subscriptions

Description:

KEEL is a software tool to assess EAs for DM problems including regression, classification, clustering, pattern mining and so on. The version of KEEL presently available consists of the following function blocks:

  • Data Management: This part is composed by a set of tools that can be used to build new data, export and import data in other formats to KEEL format, data edition and visualization, apply transformations and partitioning to data, etc...

  • Design of Experiments: The aim of this part is the design of the desired experimentation over the selected data sets. It provides options for many choices: type of validation, type of learning (classification, regression, unsupervised learning, subgroup discovery), etc...

  • Design of Imbalanced Experiments: The aim of this part is the design of the desired experimentation over the selected imbalanced data sets. These experiments are created for 5cfv datasets and include specific algorithms for imbalanced data and general classification algorithms.

  • Experimentation with Multiple Instance Learning Algorithms: In this section any researcher is able to address classification with multiple instance datasets. In this case, instead of receiving a set of instances which are labeled positive or negative, the learner receives a set of bags, with multiple instances, that are labeled positive or negative. The most common assumption is that a bag is labeled negative if all the instances in it are negative. On the other hand, a bag is labeled positive if there is at least one instance in it which is positive.

  • Experimentation with Semi-supervised Learning Algorithms: In this section any researcher is able to address classification with semi-supervised learning datasets. In this case, the learner works with both unlabeled and labeled examples and it can be used to perform both a transductive and inductive classification. The former concerns the problem of predicting the labels of the unlabeled examples, given in advance (in the training set), by taking both labeled and unlabeled data together into account to train a classifier. The latter considers the given labeled and unlabeled data as the training examples, and its objective is to predict unseen data.

  • Statistical Tests: KEEL is one of the fewest Data Mining software tools that provides to the researcher a complete set of statistical procedures for pairwise and multiple comparisons. Inside the KEEL environment, several parametric and nonparametric procedures have been coded, which should help to contrast the results obtained in any experiment performed with the software tool.

  • Educational Experiments: With a similar structure to the Design of Experimets part, allows us to design an experiment which can be step-by-step debugged in order to use this as a guideline to show the learning process of a certain model by using the platform with educational objectives.

Taking into account each one of the function blocks, KEEL can be useful by different types of user, which expect to find determined features in a Data Mining (DM) software. It is mainly intended for two categories of users: researchers and students. Either group has a different set of needs:

  • KEEL as a research tool: The most common use of this tool for a researcher will be the automated execution of experiments, and the statistical analysis of their results. Routinely, an experimental design includes a mix of evolutionary algorithms, statistical and AI-related techniques. Special care was taken to make possible that a researcher can use KEEL to assess the relevance of his own procedures. Since the actual standards in machine learning require heavy computational work, the research tool is not designed to offer a real-time view of the progress of the algorithms, it is designed to rather generate a script and be batch-executed in a cluster of computers. The tool allows the researcher to apply the same sequence of pre-processing, experiments and analysis to large batteries of problems and focus his attention in the summary of the results.

  • KEEL as an educational tool: The needs of a student are quite different to those of a researcher. Generally speaking, the objective is no longer that of making statistically sound comparisons between algorithms. There is no need of repeating each experiment a large number of times. If the tool is to be used in class, the execution time must be short and a real-time view of the evolution of the algorithms is needed, since the student will use this information to learn how to adjust the parameters of the algorithms. In this sense, the educational tool is a simplified version of the research tool, where only the most relevant algorithms are available. The execution is made in real time. The user has a visual feedback of the progress of the algorithms, and can access the final results from the same interface used to design the experimentation.

Both types of user require an availability of a set of features in order to be interested in using KEEL. Then, this is when we describe the main features of the KEEL software tool.

Changes to previous version:

Initial Announcement on mloss.org.

BibTeX Entry: Download
Supported Operating Systems: Platform Independent
Data Formats: Arff
Archive: download here

Comments

No one has posted any comments yet. Perhaps you'd like to be the first?

Leave a comment

You must be logged in to post comments.