|
About: This toolbox implements a novel visualization technique called Sectors on Sectors (SonS), and a extended version called Multidimensional Sectors on Sectors (MDSonS), for improving the interpretation of several data mining algorithms. The MDSonS method makes use of Multidimensional Scaling (MDS) to solve the main drawback of the previous method, namely, the lack of representing distances between pairs of clusters. These methods have been applied for visualizing the results of hierarchical clustering, Growing Hierarchical Self-Organizing Maps (GHSOM), classification trees and several manifolds. These methods make possible to extract all the existing relationships among centroids’ attributes at any hierarchy level. Changes:Initial Announcement on mloss.org.
|
|
About: Cluster quality Evaluation software. Implements cluster quality metrics based on ground truths such as Purity, Entropy, Negentropy, F1 and NMI. It includes a novel approach to correct for pathological or ineffective clusterings called 'Divergence from a Random Baseline'. Changes:Initial Announcement on mloss.org.
|
|
About: The apcluster package implements Frey's and Dueck's Affinity Propagation clustering in R. The package further provides leveraged affinity propagation, exemplar-based agglomerative clustering, and various tools for visual analysis of clustering results. Changes:
|
|
About: This project is a C++ toolkit containing machine learning algorithms and tools that facilitate creating complex software in C++ to solve real world problems. Changes:This release has primarily focused on improving the flexibility and ease of use of the object detection tools.
|
|
About: The Cognitive Foundry is a modular Java software library of machine learning components and algorithms designed for research and applications. Changes:
|
|
About: A broad collection of script-friendly command-line tools for machine learning and data mining tasks. (The command-line tools wrap functionality from a C++ class library.) Changes:See the change log at http://waffles.sourceforge.net/changelog.html
|
|
About: MLDemos is a user-friendly visualization interface for various machine learning algorithms for classification, regression, clustering, projection, dynamical systems, reward maximisation and reinforcement learning. Changes:New Visualization and Dataset Features Added 3D visualization of samples and classification, regression and maximization results Added Visualization panel with individual plots, correlations, density, etc. Added Editing tools to drag/magnet data, change class, increase or decrease dimensions of the dataset Added categorical dimensions (indexed dimensions with non-numerical values) Added Dataset Editing panel to swap, delete and rename dimensions, classes or categorical values Several bug-fixes for display, import/export of data, classification performance New Algorithms and methodologies Added Projections to pre-process data (which can then be classified/regressed/clustered), with LDA, PCA, KernelPCA, ICA, CCA Added Grid-Search panel for batch-testing ranges of values for up to two parameters at a time Added One-vs-All multi-class classification for non-multi-class algorithms Trained models can now be kept and tested on new data (training on one dataset, testing on another) Added a dataset generator panel for standard toy datasets (e.g. swissroll, checkerboard,...) Added a number of clustering, regression and classification algorithms (FLAME, DBSCAN, LOWESS, CCA, KMEANS++, GP Classification, Random Forests) Added Save/Load Model option for GMMs and SVMs Added Growing Hierarchical Self Organizing Maps (original code by Michael Dittenbach) Added Automatic Relevance Determination for SVM with RBF kernel (Thanks to Ashwini Shukla!)
|
|
About: The Weka workbench contains a collection of visualization tools and algorithms for data analysis and predictive modelling, together with graphical user interfaces for easy access to this [...] Changes:http://sourceforge.net/projects/weka/files/weka-3-7/3.7.9/README-3-7-9.txt/view
|
|
About: Orange is a component-based machine learning and data mining software. It includes a friendly yet powerful and flexible graphical user interface for visual programming. For more advanced use(r)s, [...] Changes:Added multi-target and multi-label learning, neural networks, Earth (MARS), PLS, and a faster tree induces for use in random forests; reorganization of module hierarchy; (weakly supported) Qwt has been replaced with a homemade module; networkx is used instead of a (weak) homemade structures for graphs; documentation has been moved to .rst, with a lot of it written anew or heavily redacted; improved system for registration of add-ons.
|
|
About: Automatic Analysis of Malware Behavior using Machine Learning Changes:The tool's persistent state is stored in the local state directory (i.e. /var) for better maintenance. Several minor bugs have been fixed.
|
|
About: FABIA is a biclustering algorithm that clusters rows and columns of a matrix simultaneously. Consequently, members of a row cluster are similar to each other on a subset of columns and, analogously, members of a column cluster are similar to each other on a subset of rows. Biclusters are found by factor analysis where both the factors and the loading matrix are sparse. FABIA is a multiplicative model that extracts linear dependencies between samples and feature patterns. Applications include detection of transcriptional modules in gene expression data and identification of haplotypes/>identity by descent< consisting of rare variants obtained by next generation sequencing. Changes:CHANGES IN VERSION 2.4.0
CHANGES IN VERSION 2.3.1NEW FEATURES 2.0.0:
1.4.0:
|
|
About: ELKI is a framework for implementing data-mining algorithms with support for index structures, that includes a wide variety of clustering and outlier detection methods. Changes:This is mostly a bug fix release. A lot of small issues have been fixed that improve performance, make error reporting a lot better, ease the use of sparse vectors and external precomputed distances, for example. This will be the last ELKI release to support Java 6. The next ELKI release will require Java 7. Algorithms
Index layer
Database layer
Visualizations
Various
Tutorials
|
|
About: C++ software for statistical classification, probability estimation and interpolation/non-linear regression using variable bandwidth kernel estimation. Changes:New in Version 0.9.6:
|
|
About: Divvy is a Mac OS X application for performing dimensionality reduction, clustering, and visualization. Changes:Initial Announcement on mloss.org.
|
|
About: Fast C++ implementation of the variation of information (Meila 2003) and Rand index (Rand 1971) with MATLAB mex files Changes:Initial Announcement on mloss.org.
|
|
About: mlpy is a Python module for Machine Learning built on top of NumPy/SciPy and of GSL. Changes:New features:
Fix:
|
|
About: Implementation of the multi-assignment clustering method for Boolean vectors. Changes:new bib added
|
|
About: An annotated java framework for machine learning, aimed at making it really easy to access analytically functions. Changes:Now supports OLS and GLS regression and NaiveBayes classification
|
|
About: FLANN is a library for performing fast approximate nearest neighbor searches in high dimensional spaces. It contains a collection of algorithms we found to work best for nearest neighbor search. Changes:See project page for changes.
|
|
About: The K-tree is a scalable approach to clustering inspired by the B+-tree and k-means algorithms. Changes:Release of K-tree implementation in Python. This is targeted at a research and rapid prototyping audience.
|


