<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>mloss.org new software</title><link>http://mloss.org</link><description>Updates and additions to mloss.org</description><language>en</language><lastBuildDate>Wed, 19 Jun 2013 19:21:07 -0000</lastBuildDate><item><title>MLPACK 1.0.6</title><link>http://mloss.org/revision/view/1327/</link><description>&lt;html&gt;&lt;p&gt;MLPACK is a scalable C++ machine learning library. Its aim is to make large-scale machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and maximum flexibility for expert users.
&lt;/p&gt;
&lt;p&gt;The following methods are provided:
&lt;/p&gt;
&lt;ul&gt;
 &lt;li&gt;
     Density Estimation Trees
 &lt;/li&gt;

 &lt;li&gt;
     Euclidean Minimum Spanning Trees
 &lt;/li&gt;

 &lt;li&gt;
     Fast Exact Max-Kernel Search (FastMKS)
 &lt;/li&gt;

 &lt;li&gt;
     Gaussian Mixture Models (GMMs)
 &lt;/li&gt;

 &lt;li&gt;
     Hidden Markov Models (HMMs)
 &lt;/li&gt;

 &lt;li&gt;
     Kernel Principal Components Analysis (KPCA)
 &lt;/li&gt;

 &lt;li&gt;
     K-Means Clustering
 &lt;/li&gt;

 &lt;li&gt;
     Least-Angle Regression (LARS/LASSO)
 &lt;/li&gt;

 &lt;li&gt;
     Local Coordinate Coding
 &lt;/li&gt;

 &lt;li&gt;
     Locality-Sensitive Hashing (LSH)
 &lt;/li&gt;

 &lt;li&gt;
     Naive Bayes Classifier
 &lt;/li&gt;

 &lt;li&gt;
     Neighborhood Components Analysis (NCA)
 &lt;/li&gt;

 &lt;li&gt;
     Nonnegative Matrix Factorization (NMF)
 &lt;/li&gt;

 &lt;li&gt;
     Principal Components Analysis (PCA)
 &lt;/li&gt;

 &lt;li&gt;
     RADICAL (ICA)
 &lt;/li&gt;

 &lt;li&gt;
     Rank-Approximate Nearest Neighbor (RANN)
 &lt;/li&gt;

 &lt;li&gt;
     Simple Least-Squares Linear Regression
 &lt;/li&gt;

 &lt;li&gt;
     Sparse Coding
 &lt;/li&gt;

 &lt;li&gt;
     Tree-based Neighbor Search (all-k-nearest-neighbors, all-k-furthest-neighbors), using either kd-trees or cover trees
 &lt;/li&gt;

 &lt;li&gt;
     Tree-based Range Search
 &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Command-line executables are provided for each of these, and the C++ classes which define the methods are highly flexible, extensible, and modular.  More information (including documentation, tutorials, and bug reports) is available at http://www.mlpack.org/.
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ryan Curtin, James Cline, Neil Slagle, Matthew Amidon, Ajinkya Kale, Bill March, Nishant Mehta, Parikshit Ram, Dongryeol Lee, Rajendran Mohan, Trironk Kiatkungwanglai, Patrick Mason, Alexander Gray</dc:creator><pubDate>Wed, 19 Jun 2013 19:21:07 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1327</comments><guid>http://mloss.org/revision/view/1327/</guid><category>gmm</category><category>hmm</category><category>machine learning</category><category>sparse</category><category>dual tree</category><category>fast</category><category>scalable</category><category>tree</category></item><item><title>SonS and MDSonS, Software for hierarchical clustering visualization V1</title><link>http://mloss.org/revision/view/1333/</link><description>&lt;html&gt;&lt;p&gt;This toolbox consist of two Matlab functions "sons.m" and "mdsons.m". To learn how to use these functions (and to interpret these methods), check the folder named "Tutorial" and click on the "HELP" html file. Two cases of study are detailed in this file. Also you can replicate the results and content of html help file. For this purpose, there are two folders named "SonS(Synthetic data set)" and "MDSon_German_Elections" in which the two cases of study proposed in the html help file are presented. To do this, you must run the programs in the following order:
&lt;/p&gt;
&lt;p&gt;1) clustering.m: The program carries out a hierarchical clustering. It saves the centroids and the labels of the data of the different hierarchies (the labels indicate the belonging to each cluster).
&lt;/p&gt;
&lt;p&gt;2) example.m: This program calls the sons.m and mdsons.m functions for visualizing the results of the clustering provided by "clustering.m". "sons.m" and "mdsons.m" functions represent only one hierarchical level, so in "example.m" there are different calls of these functions in order to represent the different hierarchies.
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jose Maria Martinez Martinez</dc:creator><pubDate>Tue, 18 Jun 2013 12:18:05 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1333</comments><guid>http://mloss.org/revision/view/1333/</guid><category>matlab</category><category>clustering</category><category>hierarchical clustering</category><category>data visualization</category></item><item><title>Darwin 1.6</title><link>http://mloss.org/revision/view/1332/</link><description>&lt;html&gt;&lt;p&gt;The framework includes a wide range of standard machine learning and graphical models algorithms as well as reference implementations for common applications particularly in computer vision. It also includes Matlab wrappers for many components of the library and an experimental GUI for designing machine learning data flows.
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Stephen Gould</dc:creator><pubDate>Tue, 18 Jun 2013 07:39:41 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1332</comments><guid>http://mloss.org/revision/view/1332/</guid><category>machine learning</category><category>graphical models</category><category>computer vision</category></item><item><title>GPstuff 4.2</title><link>http://mloss.org/revision/view/1329/</link><description>&lt;html&gt;&lt;p&gt;GPstuff is a toolbox for Bayesian Modeling with Gaussian Processes with following features and more:
&lt;/p&gt;
&lt;ul&gt;
 &lt;li&gt;
     Several covariance functions (e.g. squared exponential, exponential, Matérn, periodic and a compactly supported piece wise polynomial function)&lt;ul&gt;
 &lt;li&gt;
     Sums, products and scaling of covariance functions
 &lt;/li&gt;

 &lt;li&gt;
     Euclidean and delta distance
 &lt;/li&gt;
&lt;/ul&gt;

 &lt;/li&gt;

 &lt;li&gt;
     Several mean functions with marginalized parameters
 &lt;/li&gt;

 &lt;li&gt;
     Several likelihood/observation models&lt;ul&gt;
 &lt;li&gt;
     Continuous observations: Gaussian, Gaussian scale mixture (MCMC only), Student's-t, quantile regression
 &lt;/li&gt;

 &lt;li&gt;
     Classification: Logit, Probit, multinomial logit (softmax), multinomial probit
 &lt;/li&gt;

 &lt;li&gt;
     Count data: Binomial, Poisson, (Zero truncated) Negative-Binomial, Hurdle model, Zero-inflated Negative-Binomial, Multinomial
 &lt;/li&gt;

 &lt;li&gt;
     Survival: Cox-PH, Weibull, log-Gaussian, log-logistic
 &lt;/li&gt;

 &lt;li&gt;
     Point process: Log-Gaussian Cox process
 &lt;/li&gt;

 &lt;li&gt;
     Density estimation and regression: logistic GP
 &lt;/li&gt;

 &lt;li&gt;
     Other: derivative observations (for sexp covariance function only)
 &lt;/li&gt;
&lt;/ul&gt;

 &lt;/li&gt;

 &lt;li&gt;
     Hierarchical priors for hyperparameters
 &lt;/li&gt;

 &lt;li&gt;
     Sparse models&lt;ul&gt;
 &lt;li&gt;
     Sparse matrix routines for compactly supported covariance functions
 &lt;/li&gt;

 &lt;li&gt;
     Fully and partially independent conditional (FIC, PIC)
 &lt;/li&gt;

 &lt;li&gt;
     Compactly supported plus FIC (CS+FIC)
 &lt;/li&gt;

 &lt;li&gt;
     Variational sparse (VAR), Deterministic training conditional (DTC), Subset of regressors (SOR) (Gaussian/EP only)
 &lt;/li&gt;

 &lt;li&gt;
     PASS-GP
 &lt;/li&gt;
&lt;/ul&gt;

 &lt;/li&gt;

 &lt;li&gt;
     Latent inference&lt;ul&gt;
 &lt;li&gt;
     Exact (Gaussian only)
 &lt;/li&gt;

 &lt;li&gt;
     Laplace, Expectation propagation (EP), Parallel EP, Robust-EP
 &lt;/li&gt;

 &lt;li&gt;
     marginal posterior corrections (cm2 and fact)
 &lt;/li&gt;

 &lt;li&gt;
     Scaled Metropolis, Hamiltonian Monte Carlo (HMC), Scaled HMC, Elliptical slice sampling 
 &lt;/li&gt;
&lt;/ul&gt;

 &lt;/li&gt;

 &lt;li&gt;
     Hyperparameter inference&lt;ul&gt;
 &lt;li&gt;
     Type II ML/MAP
 &lt;/li&gt;

 &lt;li&gt;
     Leave-one-out cross-validation (LOO-CV), Laplace/EP LOO-CV
 &lt;/li&gt;

 &lt;li&gt;
     Metropolis, HMC, No-U-Turn-Sampler (NUTS), Slice Sampling (SLS), Surrogate SLS, Shrinking-rank SLS, Covariance-matching SLS
 &lt;/li&gt;

 &lt;li&gt;
     Grid, CCD, Importance sampling
 &lt;/li&gt;
&lt;/ul&gt;

 &lt;/li&gt;

 &lt;li&gt;
     Model assessment&lt;ul&gt;
 &lt;li&gt;
     LOO-CV, Laplace/EP LOO-CV, IS-LOO-CV, k-fold-CV
 &lt;/li&gt;

 &lt;li&gt;
     WAIC, DIC
 &lt;/li&gt;

 &lt;li&gt;
     Average predictive comparison
 &lt;/li&gt;
&lt;/ul&gt;

 &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you use GPstuff, please use the reference:
   Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, Aki Vehtari (2013). GPstuff: Bayesian Modeling with Gaussian Processes. In Journal of Machine Learning Research, 14:1175-1179.
&lt;/p&gt;
&lt;p&gt;See also user guide at http://arxiv.org/abs/1206.5754
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jarno Vanhatalo, Jaakko Riihimaki, Jouni Hartikainen, Pasi Jylanki, Ville Tolvanen and Aki Vehtari</dc:creator><pubDate>Mon, 17 Jun 2013 13:22:52 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1329</comments><guid>http://mloss.org/revision/view/1329/</guid><category>classification</category><category>regression</category><category>machine learning</category><category>nonparametric bayes</category><category>gaussian process</category><category>bayesian inference</category></item><item><title>ClusterEval 1.0</title><link>http://mloss.org/revision/view/1331/</link><description>&lt;html&gt;&lt;p&gt;This program compares a clustering to a ground truth set of categories according to multiple different measures. It also includes a novel approach called 'Divergence from a Random Baseline' that augments existing measures to correct for ineffective clusterings. It has been used in the evaluation of clustering at the INEX XML Mining track at INEX in 2009 and 2010, and the Social Event Detection task at MediaEval in 2013.
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">chris de vries</dc:creator><pubDate>Sun, 16 Jun 2013 04:15:30 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1331</comments><guid>http://mloss.org/revision/view/1331/</guid><category>clustering</category><category>evaluation</category><category>metrics</category><category>quality</category></item><item><title>OpenOpt 0.50</title><link>http://mloss.org/revision/view/1330/</link><description>&lt;html&gt;&lt;p&gt;Universal Python-written numerical optimization toolbox. 
   Problems: NLP, LP, QP, SDP, SOCP, DFP(Non-linear Data Fit), 
   NSP(nonsmooth), MILP, LSP, LLSP, MMP, GLP, MINLP, MOP etc.
   Connects to dozens of solvers (some are C- or Fortran-written).
&lt;/p&gt;
&lt;p&gt;Provides graphic output of convergence and some more numerical
   optimization "MUST HAVE" features.
&lt;/p&gt;
&lt;p&gt;Our another tool FuncDesigner allows to involve automatic 
   differentiation, uncertainty and interval analysis, categorical variables, general logical constraints, more convenient modeling of some optimization problems, systems of (non)linear equations (solver "interalg" can find ALL roots), possibly sparse/overdetermined,  systems of ordinary differential equations and much more. 
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Dmitrey Kroshko</dc:creator><pubDate>Sat, 15 Jun 2013 17:09:30 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1330</comments><guid>http://mloss.org/revision/view/1330/</guid><category>python</category><category>optimization</category></item><item><title>AISAIC 1.0.0610</title><link>http://mloss.org/revision/view/1328/</link><description>&lt;html&gt;&lt;p&gt;AISAIC software for analyzing human DNA copy numbers and detecting significant copy number alterations
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Guoqiang Yu, Bai Zhang, Xiguo Yuan, Xuchu Hou</dc:creator><pubDate>Thu, 13 Jun 2013 21:54:55 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1328</comments><guid>http://mloss.org/revision/view/1328/</guid><category>copy number</category><category>normal cell contamination</category><category>significant copy number alterations</category></item><item><title>MultiBoost 1.2.01</title><link>http://mloss.org/revision/view/1326/</link><description>&lt;html&gt;&lt;p&gt;AdaBoost [Freund-Schapire, 1997] is one of the best off-the-shelf supervised classification methods developed in the last fifteen years. Despite (or perhaps due to?) its simplicity and versatility, it is suprisingly under-represented in the family of open softwares. The goal of this submission is to fill this gap.
&lt;/p&gt;
&lt;p&gt;Our implementation is based on the AdaBoost.MH algorithm [Schapire-Singer, 1999]. It is an intrinsically multi-class classification method (unlike SVM for example), and it was easy to extend to multi-label or multi-task classification (when one item can belong to several classes). The program package can be divided into four modules that can be changed more-or-less independently depending on the application.
&lt;/p&gt;
&lt;ol&gt;
 &lt;li&gt;&lt;p&gt;The strong learner. It tells you &lt;em&gt;how&lt;/em&gt; to boost. The main boosting engine is AdaBoost.MH, but we have also implemented FilterBoost for a research project, and Arc-GV which is basically straightforward once you have the main engine. Other possible strong learners could be LogitBoost and ADTrees. 
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;The base (or weak) learner. It tells you &lt;em&gt;what&lt;/em&gt; features to boost. Right now we have two basic (feature-wise) base learners: decision stumps for real-valued features and indicators for nominal features. We have two meta base learners: trees and products. They can use any base learner and construct a generic complex base learner using a "classic" tree-structure (decision trees), or using the product of simple base learners (self advertisement: &lt;a href="http://users.web.lal.in2p3.fr/kegl/research/PDFs/keglBusafekete09.pdf"&gt;boosting products of stumps&lt;/a&gt; is the best reported no-domain-knowledge algorithm on &lt;a href="http://yann.lecun.com/exdb/mnist/"&gt;MNIST&lt;/a&gt; after Hinton and Salakhutdinov's deep belief nets). We have also implemented Haar filters [Viola-Jones, 2004] for image classification, a meta base learner that uses stumps over a high dimensional feature space computed "on the fly". It is a nice example of a domain dependent base learner that works hand-in-hand with its appropriate data structure.
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;The data representation. The basic data structure is a matrix of observations with a vector of labels. We also have multi-label classification when the label data is also a full matrix. In addition, we have sparse data representation for both the observation matrix and the label matrix. In general, base learners are implemented to work with their own data representation (for example, sparse stumps work on sparse observation matrices, or Haar filters work on a integral image data representation). 
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;Data parser. We can read in data in arff and svmlight formats.
&lt;/p&gt;

 &lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The base learner/data structure combinations cover a large spectrum of possible applications, but the main advantage of the package is that it is easy (for the advanced user) to adapt MultiBoost to a specific (non-standard) application by implementing the base learner and data structure interfaces that work together.
&lt;/p&gt;
&lt;p&gt;The source code is available from the website multiboost.org. It can be compiled on Mac OS X, Linux, and Microsoft Windows. The interface is command line execution with switches. 
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">robert busa fekete, norman casagrande, balazs kegl, djalel benbouzid, francoisdavid collin</dc:creator><pubDate>Wed, 12 Jun 2013 22:34:04 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1326</comments><guid>http://mloss.org/revision/view/1326/</guid><category>large scale learning</category><category>adaboost</category><category>boosting</category><category>multilabel classification</category><category>arff</category><category>multiclass classification</category><category>icml2010</category></item><item><title>Information Theoretical Estimators 0.39</title><link>http://mloss.org/revision/view/1325/</link><description>&lt;html&gt;&lt;p&gt;ITE can estimate 
&lt;/p&gt;
&lt;ul&gt;
 &lt;li&gt;&lt;p&gt;&lt;code&gt;entropy&lt;/code&gt;: Shannon entropy, Rényi entropy, Tsallis entropy (Havrda and Charvát entropy), complex entropy,
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;&lt;code&gt;mutual information&lt;/code&gt;: generalized variance, kernel canonical correlation analysis, kernel generalized variance, Hilbert-Schmidt independence criterion, Shannon mutual information, L2 mutual information, Rényi mutual information, Tsallis mutual information, copula-based kernel dependency, multivariate version of Hoeffding's Phi, complex mutual information, Cauchy-Schwartz quadratic mutual information, Euclidean distance based quadratic mutual information, distance covariance, distance correlation, approximate correntropy independence measure,
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;&lt;code&gt;divergence&lt;/code&gt;: Kullback-Leibler divergence (relative entropy, I directed divergence), L2 divergence, Rényi divergence, Tsallis divergence, Hellinger distance, Bhattacharyya distance, maximum mean discrepancy (kernel distance, an integral probability metric), J-distance (symmetrised Kullback-Leibler divergence, J divergence), Cauchy-Schwartz divergence, Euclidean distance based divergence, energy distance (specially the Cramer-Von Mises distance), Jensen-Shannon divergence, Jensen-Rényi divergence, K divergence, L divergence, certain f-divergences (Csiszár-Morimoto divergence, Ali-Silvey distance), non-symmetric Bregman distance (Bregman divergence), Jensen-Tsallis divergence, symmetric Bregman distance,
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;&lt;code&gt;association measures&lt;/code&gt;, including &lt;code&gt;measures of concordance&lt;/code&gt;: multivariate extensions of Spearman's rho (Spearman's rank correlation coefficient, grade correlation coefficient), correntropy, centered correntropy, correntropy coefficient, correntropy induced metric, centered correntropy induced metric, multivariate extension of Blomqvist's beta (medial correlation coefficient), multivariate conditional version of Spearman's rho, lower/upper tail dependence via conditional Spearman's rho,
&lt;/p&gt;

 &lt;/li&gt;

 &lt;li&gt;&lt;p&gt;&lt;code&gt;cross quantities&lt;/code&gt;: cross-entropy.
&lt;/p&gt;

 &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;ITE offers solution methods for
&lt;/p&gt;
&lt;ul&gt;
 &lt;li&gt;
     Independent Subspace Analysis (ISA) and
 &lt;/li&gt;

 &lt;li&gt;
     its extensions to different linear-, controlled-, post nonlinear-, complex valued-, partially observed models, as well as to systems with nonparametric source dynamics.
 &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;ITE is
&lt;/p&gt;
&lt;ul&gt;
 &lt;li&gt;
     written in Matlab/Octave,
 &lt;/li&gt;

 &lt;li&gt;
     multi-platform (tested extensively on Windows and Linux),
 &lt;/li&gt;

 &lt;li&gt;
     free and open source (released under the GNU GPLv3(&amp;gt;=) license).
 &lt;/li&gt;
&lt;/ul&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Zoltan Szabo</dc:creator><pubDate>Wed, 12 Jun 2013 13:12:52 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1325</comments><guid>http://mloss.org/revision/view/1325/</guid><category>entropy</category><category>mutual information</category><category>divergence</category><category>independent subspace analysis</category><category>separation principles</category><category>independent process analysis</category><category>association measure</category><category>measure of concordance</category><category>measure of independence</category><category>nonparametric estimation</category></item><item><title>APCluster 1.3.2</title><link>http://mloss.org/revision/view/1324/</link><description>&lt;html&gt;&lt;p&gt;Affinity propagation (AP) is a clustering algorithm that has been introduced by Brendan J. Frey and Delbert Dueck. The authors themselves describe affinity propagation as follows:
&lt;/p&gt;
&lt;p&gt;"An algorithm that identifies exemplars among data points and forms clusters of data points around these exemplars. It operates by simultaneously considering all data point as potential exemplars and exchanging messages between data points until a good set of exemplars and clusters emerges."
&lt;/p&gt;
&lt;p&gt;AP has been applied in various fields recently, among which bioinformatics is becoming increasingly important. Frey and Dueck have made their algorithm available as Matlab code. Matlab, however, is relatively uncommon in bioinformatics. Instead, the statistical computing platform R has become a widely accepted standard in this field. In order to leverage affinity propagation for bioinformatics applications, we have implemented affinity propagation as an R package. Note, however, that the given package is in no way restricted to bioinformatics applications. It is as generally applicable as Frey’s and Dueck’s original Matlab code.
&lt;/p&gt;
&lt;p&gt;The package further implements leveraged affinity propagation, exemplar-based agglomerative clustering, and various tools for visual analysis of clustering results. 
&lt;/p&gt;&lt;/html&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ulrich Bodenhofer, Andreas Kothmeier, Johannes Palme</dc:creator><pubDate>Wed, 12 Jun 2013 11:38:01 -0000</pubDate><comments>http://mloss.org/software/rss/comments/1324</comments><guid>http://mloss.org/revision/view/1324/</guid><category>clustering</category><category>kernels</category><category>distance function</category><category>clustering algorithm</category><category>affinity propagation</category><category>similarity measure</category></item></channel></rss>