Projects running under platform independent.
Showing Items 1-20 of 56 on page 1 of 3: 1 2 3 Next

Logo MLPACK 1.0.5

by rcurtin - May 2, 2013, 07:24:32 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 20248 views, 3566 downloads, 4 subscriptions

About: A scalable, fast C++ machine learning library, with emphasis on usability.

Changes:

Speedups of cover tree traversers; addition of rank-approximate nearest neighbor (RANN); addition of fast exact max-kernel search (FastMKS); fix for EM covariance estimation; more parameters for GMM estimation; force GMM and GaussianDistribution covariance matrices to be positive definite during training; add a tolerance parameter to the Baum-Welch algorithm for HMM training; fix for compilation with clang; fix for k-furthest neighbor search.


Logo GPstuff 4.1

by avehtari - April 25, 2013, 11:07:06 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 1585 views, 233 downloads, 1 subscription

About: The GPstuff toolbox is a versatile collection of Gaussian process models and computational tools required for inference. The tools include, among others, various inference methods, sparse approximations and model assessment methods.

Changes:

2013-04-24 Version 4.1

New features:

  • Multinomial probit classification with nested-EP. Jaakko Riihimäki, Pasi Jylänki and Aki Vehtari (2013). Nested Expectation Propagation for Gaussian Process Classification with a Multinomial Probit Likelihood. Journal of Machine Learning Research 14:75-109, 2013.
  • Marginal posterior corrections for latent values. Cseke & Heskes (2011). Approximate Marginals in Latent Gaussian Models. Journal of Machine Learning Research 12 (2011), 417-454
    • Laplace: cm2 and fact
    • EP: fact

Improvements

  • lgpdens ignores now NaNs instead of giving error
  • gp_cpred has a new option 'target' accpeting values 'f' or 'mu'
  • unified gp_waic and gp_dic
    • by default return mlpd
    • option 'form' accetps now values 'mean' 'all' 'sum' and 'dic'
  • improved survival demo demo_survival_aft (accalerated failure time)
    • renamed and improved from demo_survival_weibull
  • rearranged some files to more logical directories
  • bug fixes

New files

  • gp_predcm: marginal posterior corrections for latent values.
  • demo_improvedmarginals: demonstration of marginal posterior corrections
  • demo_improvedmarginals2: demonstration of marginal posterior corrections
  • lik_multinomprobit: multinomial probit likelihood
  • demo_multiclass_nested_ep: demonstration of nested EP with multinomprobit

Logo APCluster 1.3.1

by UBod - April 23, 2013, 08:53:15 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 8803 views, 1609 downloads, 1 subscription

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 2 votes)

About: The apcluster package implements Frey's and Dueck's Affinity Propagation clustering in R. The package further provides leveraged affinity propagation, exemplar-based agglomerative clustering, and various tools for visual analysis of clustering results.

Changes:
  • re-implementation of heatmap() method: dendrograms can now be plotted even for APResult and ExClust objects as well as for cluster hierarchies based on prior clusterings; color bars can now be switched off and colors can be changed by user (by new 'sideColor' argument); dendrograms can be switched on and off (by 'Rowv' and 'Colv' arguments);
  • added as.hclust() and as.dendrogram() methods
  • added new arguments 'base', 'showSamples', and 'horiz' to the plot() method with signature (x="AggExResult", y="missing"); moreover, parameters for changing the appearance of the height axis are now respected as well
  • streamlining of methods (redundant definition of inherited methods removed)
  • various minor improvements of code and documentation

Logo MICP 1.04

by kay_brodersen - March 26, 2013, 12:42:04 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2588 views, 477 downloads, 2 subscriptions

About: This toolbox implements models for Bayesian mixed-effects inference on classification performance in hierarchical classification analyses.

Changes:

In addition to the existing MATLAB implementation, the toolbox now also contains an R package of the variational Bayesian algorithm for mixed-effects inference.


Logo cbMDS Correlation Based Multi Dimensional Scaling 1.1

by emstrick - March 11, 2013, 11:47:39 CET [ BibTeX BibTeX for corresponding Paper Download ] 865 views, 223 downloads, 1 subscription

About: The aim is to embed a given data relationship matrix into a low-dimensional Euclidean space such that the point distances / distance ranks correlate best with the original input relationships. Input relationships may be given as (asymmetric) distances, dissimilarities, or (negative) scores. Input-output relations are modelled as row-conditioned. (Weighted) Pearson and soft Spearman rank correlation, and unweighted soft Kendall correlation are supported correlation measures for input/output object neighborhood relationships.

Changes:
  • Initial release (Ver 1.0): Weighted Pearson and correlation and soft Spearman rank correlation, Tue Dec 4 16:14:51 CET 2012

  • Ver 1.1 Added soft Kendall correlation, Fri Mar 8 08:41:09 CET 2013


Logo JMLR GPML Gaussian Processes for Machine Learning Toolbox 3.2

by hn - January 21, 2013, 15:34:50 CET [ Project Homepage BibTeX Download ] 9706 views, 2705 downloads, 3 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarWhole Star
(based on 2 votes)

About: The GPML toolbox is a flexible and generic Octave 3.2.x and Matlab 7.x implementation of inference and prediction in Gaussian Process (GP) models.

Changes:

We now support inference on large datasets using the FITC approximation for non-Gaussian likelihoods for EP and Laplace's approximation. New likelihood functions: mixture likelihood, Poisson likelihood, label noise. We added two MCMC samplers.


Logo LIBOL 0.1.0

by stevenhoi - December 27, 2012, 18:09:54 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2118 views, 132 downloads, 1 subscription

About: LIBOL is an open-source library that consists of a family of state-of-the-art online learning algorithms for machine learning and data mining research.

Changes:

Initial Announcement on mloss.org.


Logo FABIA 2.4.0

by hochreit - December 20, 2012, 14:20:58 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 5357 views, 1061 downloads, 1 subscription

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 1 vote)

About: FABIA is a biclustering algorithm that clusters rows and columns of a matrix simultaneously. Consequently, members of a row cluster are similar to each other on a subset of columns and, analogously, members of a column cluster are similar to each other on a subset of rows. Biclusters are found by factor analysis where both the factors and the loading matrix are sparse. FABIA is a multiplicative model that extracts linear dependencies between samples and feature patterns. Applications include detection of transcriptional modules in gene expression data and identification of haplotypes/>identity by descent< consisting of rare variants obtained by next generation sequencing.

Changes:

CHANGES IN VERSION 2.4.0

o spfabia bugfixes

CHANGES IN VERSION 2.3.1

NEW FEATURES

o Getters and setters for class Factorization

2.0.0:

  • spfabia: fabia for a sparse data matrix (in sparse matrix format) and sparse vector/matrix computations in the code to speed up computations. spfabia applications: (a) detecting >identity by descent< in next generation sequencing data with rare variants, (b) detecting >shared haplotypes< in disease studies based on next generation sequencing data with rare variants;
  • fabia for non-negative factorization (parameter: non_negative);
  • changed to C and removed dependencies to Rcpp;
  • improved update for lambda (alpha should be smaller, e.g. 0.03);
  • introduced maximal number of row elements (lL);
  • introduced cycle bL when upper bounds nL or lL are effective;
  • reduced computational complexity;
  • bug fixes: (a) update formula for lambda: tighter approximation, (b) corrected inverse of the conditional covariance matrix of z;

1.4.0:

  • New option nL: maximal number of biclusters per row element;
  • Sort biclusters according to information content;
  • Improved and extended preprocessing;
  • Update to R2.13

Logo ELKI 0.5.5

by erich - December 14, 2012, 18:49:58 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4188 views, 748 downloads, 2 subscriptions

About: ELKI is a framework for implementing data-mining algorithms with support for index structures, that includes a wide variety of clustering and outlier detection methods.

Changes:

This is mostly a bug fix release. A lot of small issues have been fixed that improve performance, make error reporting a lot better, ease the use of sparse vectors and external precomputed distances, for example.

This will be the last ELKI release to support Java 6. The next ELKI release will require Java 7.

Algorithms

  • Some new LOF variants (LDF, SimpleLOF, SimpleKernelDensityLOF)
  • Correlation Outlier Probabilities (ICDM 2012)
  • A naive mean-shift clustering
  • Single-link clustering (SLINK algorithm) should be significantly faster due to optimized data structures
  • "Benchmarking" algorithms for measuring the performance of index structures

Index layer

  • Bulk loading R-Trees should be faster - in particular Sort Tile Recursive can work very well.
  • M-Trees have been refactored and optimized for double distances

Database layer

  • Bundle format (work in progress): low-level binary format for fast data exchange
  • DBID and DataStore layer received some additional classes for further performance improvements
  • KNN heap structures were revisited. The code is less clean now, but performs better in benchmarks.

Visualizations

  • General clean up and API simplifications
  • Some additional modules and improvements

Various

  • There is a new parameter class, RandomParameter
  • Some new distributions were added, also to the data set generator.

Tutorials

  • The website has new tutorials, including one on a k-means variation that produces equal sized clusters.

Logo gensim 0.8.6

by Radim - December 9, 2012, 13:15:16 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10570 views, 2112 downloads, 1 subscription

About: Python Framework for Vector Space Modelling that can handle unlimited datasets (streamed input, online algorithms work incrementally in constant memory).

Changes:
  • added the "hashing trick" (by Homer Strong)
  • support for adding target classes in SVMlight format (by Corrado Monti)
  • fixed problems with global lemmatizer object when running in parallel on Windows
  • parallelization of Wikipedia processing + added script version that lemmatizes the input documents
  • added class method to initialize Dictionary from an existing corpus (by Marko Burjek)

About: Stochastic neighbor embedding aims at the reconstruction of given distance, dissimilarity, or score neighborhood relations in a low-dimensional Euclidean space. This can be regarded as general approach to multi-dimensional scaling, but the reconstruction is based on the definition of input (and output) neighborhood probability alone. Probability of score exceedance is used for neighborhood probability estimation, which is connected to soft-rank optimization. The present implementation makes use of quasi 2nd order gradient-based (l-)BFGS optimization.

Changes:
  • scoretoprob.m replaced by d2p.m

  • protein score data set added

  • trank.m computes (mid/max -tied) ranks along columns of matrix

  • local P- neighborhood probability estimation added

  • experimental soft_rank_SNE added for minimizing KL between probabilities of exceedance in source and embedding space

  • symmetry option removed, because this was strange in previous version


Logo PLEASD 0.1

by heroesneverdie - September 10, 2012, 03:53:26 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 974 views, 156 downloads, 1 subscription

About: PLEASD: A Matlab Toolbox for Structured Learning

Changes:

Initial Announcement on mloss.org.


Logo libmind alpha 1

by neuromancer - September 4, 2012, 04:30:57 CET [ Project Homepage BibTeX Download ] 609 views, 133 downloads, 1 subscription

About: A general purpose library to process and predict sequences of elements using echo state networks.

Changes:

Initial Announcement on mloss.org.


Logo JMLR Mulan 1.4.0

by lefman - August 1, 2012, 09:49:21 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9662 views, 4625 downloads, 1 subscription

About: Mulan is an open-source Java library for learning from multi-label datasets. Multi-label datasets consist of training examples of a target function that has multiple binary target variables. This means that each item of a multi-label dataset can be a member of multiple categories or annotated by many labels (classes). This is actually the nature of many real world problems such as semantic annotation of images and video, web page categorization, direct marketing, functional genomics and music categorization into genres and emotions.

Changes:

Learners

  • BinaryRelevance.java: improved data handling that avoids copying the entire input space, leading to important speedups in case of large datasets and very large number of labels.
  • RAkEL.java: updated technical information, added a check for the case where the number of labels is less or equal than the size of the subset.
  • MultiLabelKNN.java: now checks whether the number of instances is less than the number of requested nearest neighbors.
  • Addition of AdaBoostMH.java, an explicit implementation of AdaBoost.MH as combination of AdaBoostM1 and IncludeLabelsClassifier.
  • Addition of MLPTO.java, the Multi Label Probabilistic Threshold Optimizer (MLTPTO) thresholding technique.
  • Addition of ApproximateExampleBasedFMeasureOptimizer.java, an approximate method for the maximization of example-based F-measure.

Measures/Evaluation

  • Addition of Specificity measure (example-based, micro/macro label-based)
  • Addition of Mean Average Interpolated Precision (MAiP), Geometric Mean Average Precision (GMAP), Geometric Mean Average Interpolated Precision (GMAiP).
  • New methods for stratified multi-label evaluation.
  • Added support for outputting per label results for all measures that implement the MacroAverageMeasure interface.
  • Simplifying the "strictness" issue of information retrieval measures, by adopting specific assumptions (outlined in the new class InformationRetrievalMeasures.java) to handle special cases, instead of the less clear and useful solution of outputting NaN and the less realistic solution or ignoring special cases.

Bug fixes

  • Bug fix in LabelsBuilder.java.
  • Bug fix in Ranker.java.
  • Bug-fix in ThresholdPrediction.java.
  • Fix for bug occurring when loading the XSD for mulan data outside the command-line environment (e.g. web applications).
  • Javadoc comment updates.

API changes

  • Upgrade to Java 1.6
  • Upgrade to JUnit 4.10
  • Upgrade to Weka 3.7.6.

Miscellaneous

  • Meaningful messages are now shown when a DataLoadException is thrown.
  • PT6(PT6Transformation.java): renamed to IncludeLabelsTransformation.java.
  • MultiLabelInstances now support serialization, as needed by the improved binary relevance transformation.
  • BinaryRelevanceAttributeEvaluator.java: updated according to latest BR improvements.

Logo JMLR Jstacs 2.0

by keili - July 30, 2012, 13:31:02 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9520 views, 2094 downloads, 2 subscriptions

About: A Java framework for statistical analysis and classification of biological sequences

Changes:

February 2, 2012: Jstacs 2.0 released

Jstacs 2.0 changes many names and the structure of several packages. It is not code-compatible with Jstacs 1.5 and earlier

RESTRUCTURING and RENAMING:

former ScoringFunction, NormalizableScoringFunction, Model

  • new base-interface SequenceScore
  • new sub-interface StatisticalModel of SequenceScore for all statistical models with sub-iterfaces DifferentiableStatisticalModel and TrainableStatisticalModel
  • new interface DifferentiableSequenceScore replaces ScoringFunction
  • new interface DifferentiableStatisticalModel replaces NormalizableScoringFunction
  • new interface TrainableStatisticalModel replaces Model
  • new abstract class AbstractDifferentiableSequenceScore
  • new abstract class AbstractDifferentiableStatisticalModel replaces AbstractNormalizableScoringFunction
  • new abstract class AbstractTrainableStatisticalModel replaces AbstractModel
  • former Models renamed to TrainSM
  • former ScoringFunction renamed to DiffSS or DiffSM
  • getProbFor removed from TrainableStatisticalModel (former Model) and conceptually replaced by getLogProbFor
  • getLogScore(Sequence,int,int) with changed meaning of arguments: getLogScore(Sequence,start,end) instead of getLogScore(Sequence,start,length)
  • isTrained() replaced by common method isInitialized()

Parameters and Results

  • new super-class of Parameters and Results: AnnotatedEntity
  • common list-type for Parameters and Results: AnnotatedEntityList
  • Renaming: CollectionParameter -> SelectionParameter, MultiSelectionCollectionParameter -> MultiSelectionParameter, new super-class AbstractSelectionParameter
  • major refactoring due to common hierarchy and code-cleanup
  • lazy evaluation of Parameter/ParameterSet hierarchies moved from ParameterSet (loadParameters()) to ParameterSetContainer (constructor on class)
  • SubclassFinder adapted to lazy evaluation

performance measures

  • new abstract super-class AbstractPerformanceMeasure of all performance measures
  • new interface NumericalPerformanceMeasure for all performance measures that return a single number (as opposed, e.g., to curves)
  • new class PerformanceMeasureParameterSet for a collection of general performance measures
  • new class NumericalPerformanceMeasureParameterSet for a collection of NumericalPerformanceMeasures
  • used in evaluate-method of AbstractClassifier and in ClassifierAssessments

further changes

  • Sample renamed to DataSet
  • evaluate and evaluateAll in AbstractClassifier joined
  • new class IndependentProductDiffSS as super-class of IndepedentProductDiffSM (former IndependentProductScoringFunction)
  • new class UniformDiffSS as super-class of UniformDiffSM (former UniformScoringFunction)

NEW FUNCTIONALITY:

  • multi-threaded implementation of Baum-Welch and Viterbi training of hidden Markov models
  • new Interface Singleton that can be used for singleton instances to save memory, current examples: DNAAlphabet, DNAAlphabetContainer, ProteinAlphabet
  • added ProteinAlphabet
  • added possibility to use NaN-values with ContinuousAlphabets
  • added ArbitraryFloatSequence including static methods for DataSet creation for cases where double-precision is not needed
  • new performance measure MaximumFMeasure
  • access to Parameters in ParameterSets and Results in ResultSets by name
  • emitDataSet in BayesianNetworkDiffSM
  • new static method Time.getTimeInstance that returns UserTime or RealTime depending on availability of shared lib
  • SubclassFinder allows for adding own base packages
  • new method overlaps() in LocatedSequenceAnnotationWithLength
  • AbstractTerminationCondition used in ScoreClassifier and sub-classes
  • public method propagateESS in HMMFactory
  • new method generateLog in DirichletMRG for drawing log-values
  • added DifferentiableStatisticalModelFactory

BUGFIXES/IMPROVEMENTS:

  • bugfix in propagation of equivalent sample size in HMMFactory
  • bugfix in random initialization of BasicHigherOrderTransition
  • improved Alignment implementation
  • SafeOutputStream with new static factory method getSafeOutputStream, write methods now work on Objects

DOCUMENTATION:

  • improved Javadocs in many classes and packages
  • new Cookbook with extensive documentation and explanation

MISC:

  • output of NonParsableException more verbose
  • Exceptions in multi-threaded code now lead to exit of program instead of only stopping the thread
  • update of RServe/RClient

Logo MLWizard 5.2

by remat - July 26, 2012, 15:04:14 CET [ Project Homepage BibTeX Download ] 1617 views, 350 downloads, 1 subscription

About: MLwizard recommends and optimizes classification algorithms based on meta-learning and is a software wizard fully integrated into RapidMiner but can be used as library as well.

Changes:

Faster parameter optimization using genetic algorithm with predefined start population.


Logo SVM with uncertain labels 0.2

by rflamary - July 17, 2012, 11:06:23 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2917 views, 567 downloads, 2 subscriptions

About: Matlab code for learning probabilistic SVM in the presence of uncertain labels.

Changes:

Added missing dataset function (thanks to Hao Wu)


Logo pymaBandits 1.0

by garivier - July 6, 2012, 18:32:41 CET [ BibTeX Download ] 2316 views, 370 downloads, 1 subscription

About: This package contains a python and a matlab implementation of the most widely used algorithms for multi-armed bandit problems. The purpose of this package is to provide simple environments for comparison and numerical evaluation of policies.

Changes:

Initial Announcement on mloss.org.


About: The package provides a Lagrangian approach to the posterior regularization of given linear mappings. This is important in two cases, (a) when systems are under-determined and (b) when the external model for calculating the mapping is invariant to properties such as scaling. The software may be applied in cases when the external model does not provide its own regularization strategy. In addition, the package allows to rank attributes according to their distortion potential to a given linear mapping.

Changes:

Version 1.1 (May 23, 2012) memory and time optimizations distderivrel.m now supports assessing the relevance of attribute pairs

Version 1.0 (Nov 9, 2011) * Initial Announcement on mloss.org.


Logo Partition Comparison 1.0

by andres - April 21, 2012, 03:26:47 CET [ Project Homepage BibTeX Download ] 1013 views, 240 downloads, 1 subscription

About: Fast C++ implementation of the variation of information (Meila 2003) and Rand index (Rand 1971) with MATLAB mex files

Changes:

Initial Announcement on mloss.org.


Showing Items 1-20 of 56 on page 1 of 3: 1 2 3 Next