About: A WEKA package for analyzing emotion and sentiment of tweets. Changes:Initial Announcement on mloss.org.
|
About: A thin Python3 wrapper that uses the javabridge Python library to communicate with a Java Virtual Machine executing Weka API calls. Changes:
|
About: A thin Python wrapper that uses the javabridge Python library to communicate with a Java Virtual Machine executing Weka API calls. Changes:
|
About: The Advanced Data mining And Machine learning System (ADAMS) is a flexible workflow engine aimed at quickly building and maintaining data-driven, reactive workflows, easily integrated into business processes. Changes:Some highlights:
|
About: An open-source framework for benchmarking of feature selection algorithms and cost functions. Changes:Initial Announcement on mloss.org.
|
About: DynaML is a Scala environment for conducting research and education in Machine Learning. DynaML comes packaged with a powerful library of classes implementing predictive models and a Scala REPL where one can not only build custom models but also play around with data work-flows. Changes:Initial Announcement on mloss.org.
|
About: A native Python, scikit-compatible, implementation of a variety of multi-label classification algorithms. Changes:
|
About: NaN-toolbox is a statistics and machine learning toolbox for handling data with and without missing values. Changes:Changes in v.3.1.2 - improve configuration and build system - support of more platforms (including Octave 4.2.0) improved Changes in v.3.0.3 - improve compatibility for Octave on Windows Changes in v.3.0.1 - fix packaging for octave Changes in v.2.8.5 - bug fix: trimmean - compiler support for gcc-5 and clang - fix typos For details see the CHANGELOG at http://pub.ist.ac.at/~schloegl/matlab/NaN/CHANGELOG
|
About: General purpose Java Machine Learning library for classification, regression, and clustering. Changes:See github release tab for change info
|
About: A Java Toolbox for Scalable Probabilistic Machine Learning. Changes:
Detailed information can be found in the toolbox's web page
|
About: machine learning library in java for easy development of new kernels and kernel algorithms Changes:Version 3.0 /! Warning: this version is incompatible with previous code
|
About: Jatecs is an open source Java library focused on automatic text categorization. Changes:Initial Announcement on mloss.org.
|
About: ELKI is a framework for implementing data-mining algorithms with support for index structures, that includes a wide variety of clustering and outlier detection methods. Changes:Additions and improvements from ELKI 0.7.0 to 0.7.1: Algorithm additions:
Important bug fixes:
UI improvements:
Smaller changes:
|
About: Easily prototype WEKA classifiers and filters using Python scripts. Changes:0.3.0
0.2.1
0.2.0
0.1.1
0.1.0
|
About: Apache Mahout is an Apache Software Foundation project with the goal of creating both a community of users and a scalable, Java-based framework consisting of many machine learning algorithm [...] Changes:Apache Mahout introduces a new math environment we call Samsara, for its theme of universal renewal. It reflects a fundamental rethinking of how scalable machine learning algorithms are built and customized. Mahout-Samsara is here to help people create their own math while providing some off-the-shelf algorithm implementations. At its core are general linear algebra and statistical operations along with the data structures to support them. You can use is as a library or customize it in Scala with Mahout-specific extensions that look something like R. Mahout-Samsara comes with an interactive shell that runs distributed operations on a Spark cluster. This make prototyping or task submission much easier and allows users to customize algorithms with a whole new degree of freedom. Mahout Algorithms include many new implementations built for speed on Mahout-Samsara. They run on Spark 1.3+ and some on H2O, which means as much as a 10x speed increase. You’ll find robust matrix decomposition algorithms as well as a Naive Bayes classifier and collaborative filtering. The new spark-itemsimilarity enables the next generation of cooccurrence recommenders that can use entire user click streams and context in making recommendations.
|
About: KEEL (Knowledge Extraction based on Evolutionary Learning) is an open source (GPLv3) Java software tool that can be used for a large number of different knowledge data discovery tasks. KEEL provides a simple GUI based on data flow to design experiments with different datasets and computational intelligence algorithms (paying special attention to evolutionary algorithms) in order to assess the behavior of the algorithms. It contains a wide variety of classical knowledge extraction algorithms, preprocessing techniques (training set selection, feature selection, discretization, imputation methods for missing values, among others), computational intelligence based learning algorithms, hybrid models, statistical methodologies for contrasting experiments and so forth. It allows to perform a complete analysis of new computational intelligence proposals in comparison to existing ones. Moreover, KEEL has been designed with a two-fold goal: research and educational. KEEL is also coupled with KEEL-dataset: a webpage that aims at providing to the machine learning researchers a set of benchmarks to analyze the behavior of the learning methods. Concretely, it is possible to find benchmarks already formatted in KEEL format for classification (such as standard, multi instance or imbalanced data), semi-supervised classification, regression, time series and unsupervised learning. Also, a set of low quality data benchmarks is maintained in the repository. Changes:Initial Announcement on mloss.org.
|
About: Mulan is an open-source Java library for learning from multi-label datasets. Multi-label datasets consist of training examples of a target function that has multiple binary target variables. This means that each item of a multi-label dataset can be a member of multiple categories or annotated by many labels (classes). This is actually the nature of many real world problems such as semantic annotation of images and video, web page categorization, direct marketing, functional genomics and music categorization into genres and emotions. Changes:Learners
Measures/Evaluation
Bug fixes
API changes
Miscalleneous
|
About: Hubness-aware Machine Learning for High-dimensional Data Changes:
|
About: Java package for calculating Entropy for Machine Learning Applications. It has implemented several methods of handling missing values. So it can be used as a lab for examining missing values. Changes:Discretizing numerical values is added to calculate mode of values and fractional replacement of missing ones. class diagram is on the web http://profs.basu.ac.ir/bathaeian/free_space/jemla.rar
|
About: pySPACE is the abbreviation for "Signal Processing and Classification Environment in Python using YAML and supporting parallelization". It is a modular software for processing of large data streams that has been specifically designed to enable distributed execution and empirical evaluation of signal processing chains. Various signal processing algorithms (so called nodes) are available within the software, from finite impulse response filters over data-dependent spatial filters (e.g. CSP, xDAWN) to established classifiers (e.g. SVM, LDA). pySPACE incorporates the concept of node and node chains of the MDP framework. Due to its modular architecture, the software can easily be extended with new processing nodes and more general operations. Large scale empirical investigations can be configured using simple text- configuration files in the YAML format, executed on different (distributed) computing modalities, and evaluated using an interactive graphical user interface. Changes:improved testing, improved documentation, windows compatibility, more algorithms
|