Projects supporting the svmlight data format.


Logo JMLR dlib ml 18.10

by davis685 - August 29, 2014, 02:56:23 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 82615 views, 14316 downloads, 2 subscriptions

About: This project is a C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems.

Changes:

In addition to a number of usability improvements, this release adds an implementation of the recent paper "One Millisecond Face Alignment with an Ensemble of Regression Trees" by Vahid Kazemi and Josephine Sullivan. This includes tools for performing high quality face landmarking as well as tools for training new landmarking models. See the face_landmark_detection_ex.cpp and train_shape_predictor_ex.cpp example programs for an introduction.


Logo Harry 0.3

by konrad - July 30, 2014, 16:15:26 CET [ Project Homepage BibTeX Download ] 1831 views, 401 downloads, 2 subscriptions

About: A Tool for Measuring String Similarity

Changes:

This new release implements 21 similarity measures for strings (Option -M). It supports splitting the computation of large similarity matrices into blocks and thus allows comparing large sets of strings (Option -s as well as -x and -y). The command-line interface has been improved and several minor bugs have been fixed.


Logo Boosted Decision Trees and Lists 1.0.4

by melamed - July 25, 2014, 23:08:32 CET [ BibTeX Download ] 2766 views, 843 downloads, 3 subscriptions

About: Boosting algorithms for classification and regression, with many variations. Features include: Scalable and robust; Easily customizable loss functions; One-shot training for an entire regularization path; Continuous checkpointing; much more

Changes:
  • added ElasticNets as a regularization option
  • fixed some segfaults, memory leaks, and out-of-range errors, which were creeping in in some corner cases
  • added a couple of I/O optimizations

Logo JMLR Sally 0.9.0

by konrad - July 1, 2014, 22:43:51 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 20255 views, 4119 downloads, 2 subscriptions

About: A Tool for Embedding Strings in Vector Spaces

Changes:

Support for hash-based dimension reduction: simhash, minhash and Bloom filter. Support for several n-gram variants: regular, sorted, positional and blended n-grams. Simplified configuration.


Logo JMLR MultiBoost 1.2.02

by busarobi - March 31, 2014, 16:13:04 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 23301 views, 4108 downloads, 1 subscription

About: MultiBoost is a multi-purpose boosting package implemented in C++. It is based on the multi-class/multi-task AdaBoost.MH algorithm [Schapire-Singer, 1999]. Basic base learners (stumps, trees, products, Haar filters for image processing) can be easily complemented by new data representations and the corresponding base learners, without interfering with the main boosting engine.

Changes:

Major changes :

  • The “early stopping” feature can now based on any metric output with the --outputinfo command line argument.

  • Early stopping now works with --slowresume command line argument.

Minor fixes:

  • More informative output when testing.

  • Various compilation glitch with recent clang (OsX/Linux).


Logo JMLR SHOGUN 3.2.0

by sonne - February 17, 2014, 20:31:36 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 84886 views, 11787 downloads, 5 subscriptions

Rating Whole StarWhole StarWhole StarEmpty StarEmpty Star
(based on 6 votes)

About: The SHOGUN machine learning toolbox's focus is on large scale learning methods with focus on Support Vector Machines (SVM), providing interfaces to python, octave, matlab, r and the command line.

Changes:

This is mostly a bugfix release:

Features

  • Fully support python3 now
  • Add mini-batch k-means [Parijat Mazumdar]
  • Add k-means++ [Parijat Mazumdar]
  • Add sub-sequence string kernel [lambday]

Bugfixes

  • Compile fixes for upcoming swig3.0
  • Speedup for gaussian process' apply()
  • Improve unit / integration test checks
  • libbmrm uninitialized memory reads
  • libocas uninitialized memory reads
  • Octave 3.8 compile fixes [Orion Poplawski]
  • Fix java modular compile error [Bjoern Esser]

Logo LIBOL 0.3.0

by stevenhoi - December 12, 2013, 15:26:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 7605 views, 2353 downloads, 2 subscriptions

About: LIBOL is an open-source library with a family of state-of-the-art online learning algorithms for machine learning and big data analytics research. The current version supports 16 online algorithms for binary classification and 13 online algorithms for multiclass classification.

Changes:

In contrast to our last version (V0.2.3), the new version (V0.3.0) has made some important changes as follows:

• Add a template and guide for adding new algorithms;

• Improve parameter settings and make documentation clear;

• Improve documentation on data formats and key functions;

• Amend the "OGD" function to use different loss types;

• Fixed some name inconsistency and other minor bugs.


Logo KMLib sparse GPU SVM 0.1

by ksopyla - March 20, 2013, 14:30:08 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 1691 views, 416 downloads, 1 subscription

About: Support Vectors Machine library in .net with CUDA support. Library includes GPU SVM solver for kernels linear,RBF,Chi-Square and Exp Chi-Square which use NVIDIA CUDA technology. It allows for classification of feature rich sparse datasets through utilization of sparse matrix formats CSR, Ellpack-R or Sliced EllR-T

Changes:

Initial Announcement on mloss.org.


Logo pGBRT, Parallel Gradient Boosted Regression Trees 0.9

by swtyree - September 16, 2011, 22:15:46 CET [ Project Homepage BibTeX Download ] 6644 views, 1034 downloads, 1 subscription

About: Learns gradient boosted regression tree ensembles in parallel on shared memory or cluster systems

Changes:

Initial Announcement on mloss.org.


Logo mldata-utils 0.5.0

by sonne - April 8, 2011, 10:02:44 CET [ Project Homepage BibTeX Download ] 19340 views, 4061 downloads, 1 subscription

About: Tools to convert datasets from various formats to various formats, performance measures and API functions to communicate with mldata.org

Changes:
  • Change task file format, such that data splits can have a variable number items and put into up to 256 categories of training/validation/test/not used/...
  • Various bugfixes.

Logo redsvd 0.1.0

by hillbig - August 30, 2010, 18:13:55 CET [ Project Homepage BibTeX Download ] 3838 views, 823 downloads, 1 subscription

About: redsvd is a library for solving several matrix decomposition (SVD, PCA, eigen value decomposition) redsvd can handle very large matrix efficiently, and optimized for a truncated SVD of sparse matrices. For example, redsvd can compute a truncated SVD with top 20 singular values for a 100K x 100K matrix with 10M nonzero entries in about two second.

Changes:

Initial Announcement on mloss.org.


Logo sofia ml 0.1

by dsculley - December 29, 2009, 23:30:58 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4927 views, 897 downloads, 0 comments, 1 subscription

About: A fast implementation of several stochastic gradient descent learners for classification, ranking, and ROC area optimization, suitable for large, sparse data sets. Includes Pegasos SVM, SGD-SVM, Passive-Aggressive Perceptron, Perceptron with Margins, Logistic Regression, and ROMMA. Commandline utility and API libraries are provided.

Changes:

Initial Announcement on mloss.org.


Logo Elefant 0.4

by kishorg - October 17, 2009, 08:48:19 CET [ Project Homepage BibTeX Download ] 16829 views, 7370 downloads, 2 subscriptions

Rating Whole StarWhole Star1/2 StarEmpty StarEmpty Star
(based on 2 votes)

About: Elefant is an open source software platform for the Machine Learning community licensed under the Mozilla Public License (MPL) and developed using Python, C, and C++. We aim to make it the platform [...]

Changes:

This release contains the Stream module as a first step in the direction of providing C++ library support. Stream aims to be a software framework for the implementation of large scale online learning algorithms. Large scale, in this context, should be understood as something that does not fit in the memory of a standard desktop computer.

Added Bundle Methods for Regularized Risk Minimization (BMRM) allowing to choose from a list of loss functions and solvers (linear and quadratic).

Added the following loss classes: BinaryClassificationLoss, HingeLoss, SquaredHingeLoss, ExponentialLoss, LogisticLoss, NoveltyLoss, LeastMeanSquareLoss, LeastAbsoluteDeviationLoss, QuantileRegressionLoss, EpsilonInsensitiveLoss, HuberRobustLoss, PoissonRegressionLoss, MultiClassLoss, WinnerTakesAllMultiClassLoss, ScaledSoftMarginMultiClassLoss, SoftmaxMultiClassLoss, MultivariateRegressionLoss

Graphical User Interface provides now extensive documentation for each component explaining state variables and port descriptions.

Changed saving and loading of experiments to XML (thereby avoiding storage of large input data structures).

Unified automatic input checking via new static typing extending Python properties.

Full support for recursive composition of larger components containing arbitrary statically typed state variables.


Logo Dirichlet Forest LDA 0.1.1

by davidandrzej - July 16, 2009, 21:59:53 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4567 views, 887 downloads, 1 subscription

About: This software implements the Dirichlet Forest (DF) Prior within the Latent Dirichlet Allocation (LDA) model. When combined with LDA, the Dirichlet Forest Prior allows the user to encode domain knowledge (must-links and cannot-links between words) into the prior on topic-word multinomials.

Changes:

Initial Announcement on mloss.org.


Logo LibSGDQN 1.1

by antojne - July 2, 2009, 15:02:44 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 6566 views, 1310 downloads, 1 subscription

About: LibSGDQN proposes an implementation of SGD-QN, a carefully designed quasi-Newton stochastic gradient descent solver for linear SVMs.

Changes:

small bug fix (thx nicolas ;)


Logo OLaRankGreedy 1.0

by antojne - June 24, 2009, 17:07:57 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4128 views, 899 downloads, 1 subscription

About: OLaRankGreedy is an online solver of the dual formulation of support vector machines for sequence labeling using greedy inference.

Changes:

Initial Announcement on mloss.org.


Logo OLaRankExact 1.0

by antojne - June 24, 2009, 17:03:48 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3815 views, 918 downloads, 1 subscription

About: OLaRank is an online solver of the dual formulation of support vector machines for sequence labeling using viterbi decoding.

Changes:

Initial Announcement on mloss.org.


Logo BMRM 2.1

by chteo - May 8, 2009, 08:08:20 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 5640 views, 1095 downloads, 1 subscription

About: BMRM is an open source, modular and scalable convex solver for many machine learning problems cast in the form of regularized risk minimization problem.

Changes:

Initial Announcement on mloss.org.


Logo CoFiRank 0.1

by alexis - March 30, 2009, 17:17:34 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4947 views, 995 downloads, 2 subscriptions

About: CoFiRank is a Collaborative Filtering system based on matrix factorization. CoFiRank is based on the idea that it is better to predict the relative order of preferences (ranking) instead of the absolute rating.

Changes:

Initial Announcement on mloss.org.