|
About: MultiBoost is a multi-purpose boosting package implemented in C++. It is based on the multi-class/multi-task AdaBoost.MH algorithm [Schapire-Singer, 1999]. Basic base learners (stumps, trees, products, Haar filters for image processing) can be easily complemented by new data representations and the corresponding base learners, without interfering with the main boosting engine. Changes:
|
|
About: A broad collection of script-friendly command-line tools for machine learning and data mining tasks. (The command-line tools wrap functionality from a C++ class library.) Changes:See the change log at http://waffles.sourceforge.net/changelog.html
|
|
About: The Advanced Data mining And Machine learning System (ADAMS) is a novel, flexible workflow engine aimed at quickly building and maintaining real-world, complex knowledge workflows. Changes:
|
|
About: ELKI is a framework for implementing data-mining algorithms with support for index structures, that includes a wide variety of clustering and outlier detection methods. Changes:This is mostly a bug fix release. A lot of small issues have been fixed that improve performance, make error reporting a lot better, ease the use of sparse vectors and external precomputed distances, for example. This will be the last ELKI release to support Java 6. The next ELKI release will require Java 7. Algorithms
Index layer
Database layer
Visualizations
Various
Tutorials
|
|
About: An open source Java software providing collaborative filtering algorithms. Changes:Initial Announcement on mloss.org.
|
|
About: Mulan is an open-source Java library for learning from multi-label datasets. Multi-label datasets consist of training examples of a target function that has multiple binary target variables. This means that each item of a multi-label dataset can be a member of multiple categories or annotated by many labels (classes). This is actually the nature of many real world problems such as semantic annotation of images and video, web page categorization, direct marketing, functional genomics and music categorization into genres and emotions. Changes:Learners
Measures/Evaluation
Bug fixes
API changes
Miscellaneous
|
|
About: MLwizard recommends and optimizes classification algorithms based on meta-learning and is a software wizard fully integrated into RapidMiner but can be used as library as well. Changes:Faster parameter optimization using genetic algorithm with predefined start population.
|
|
About: Use the power of crowdsourcing to create ensembles. Changes:Initial Announcement on mloss.org.
|
|
About: Motivated by a need to classify high-dimensional, heterogeneous data from the bioinformatics domain, we developed ML-Flex, a machine-learning toolbox that enables users to perform two-class and multi-class classification analyses in a systematic yet flexible manner. ML-Flex was written in Java but is capable of interfacing with third-party packages written in other programming languages. It can handle multiple input-data formats and supports a variety of customizations. MLFlex provides implementations of various validation strategies, which can be executed in parallel across multiple computing cores, processors, and nodes. Additionally, ML-Flex supports aggregating evidence across multiple algorithms and data sets via ensemble learning. (See http://jmlr.csail.mit.edu/papers/volume13/piccolo12a/piccolo12a.pdf.) Changes:Initial Announcement on mloss.org.
|
|
About: NaN-toolbox is a statistics and machine learning toolbox for handling data with and without missing values. Changes:Changes in v.2.5.2 - faster version of quantile if multiple quantiles are requested - removes the dependency on ZLIB and thus - fixes "pkg install nan" for Octave on Windows - a number of minor improvements For details see the CHANGELOG at http://pub.ist.ac.at/~schloegl/matlab/NaN/CHANGELOG
|
|
About: The source code of the mldata.org site - a community portal for machine learning data sets. Changes:Initial Announcement on mloss.org.
|
|
About: Tools to convert datasets from various formats to various formats, performance measures and API functions to communicate with mldata.org Changes:
|
|
About: Apache Mahout is an Apache Software Foundation project with the goal of creating both a community of users and a scalable, Java-based framework consisting of many machine learning algorithm [...] Changes:We are pleased to announce release 0.4 of Mahout. Virtually every corner of the project has changed, and significantly, since 0.3. Developers are invited to use and depend on version 0.4 even as yet more change is to be expected before the next release. Highlights include:
|
|
About: Pyriel is a Python system for learning classification rules from data. Unlike other rule learning systems, it is designed to learn rule lists that maximize the area under the ROC curve (AUC) instead of accuracy. Pyriel is mostly an experimental research tool, but it's robust and fast enough to be used for lightweight industrial data mining. Changes:1.5 Changed CF (confidence factor) to do LaPlace smoothing of estimates. New flag "--score-for-class C" causes scores to be computed relative to a given (positive) class. For two-class problems. Fixed bug in example sampling code (--sample n) Fixed bug keeping old-style example formats (terminated by dot) from working. More code restructuring.
|
|
About: This Java software implements Profile Hidden Markov Models (PHMMs) for protein classification for the WEKA workbench. Standard PHMMs and newly introduced binary PHMMs are used. In addition the software allows propositionalisation of PHMMs. Changes:description changed
|
|
About: Massive Online Analysis (MOA) is a real time analytic tool for data streams. It is a software environment for implementing algorithms and running experiments for online learning from evolving data streams. MOA includes a collection of offline and online methods as well as tools for evaluation. In particular, it implements boosting, bagging, and Hoeffding Trees, all with and without Naive Bayes classifiers at the leaves. MOA supports bi-directional interaction with WEKA, the Waikato Environment for Knowledge Analysis, and it is released under the GNU GPL license. Changes:Initial Announcement on mloss.org.
|
|
About: ELF provides many well implemented supervised learners for classification and regression tasks with an opportunity of ensemble learning. Changes:Initial Announcement on mloss.org.
|
|
About: Debellor is a scalable and extensible platform which provides common architecture for data mining and machine learning algorithms of various types. Changes:
|