Projects supporting the svmlight data format.


Logo Harry 0.3.1

by konrad - October 22, 2014, 13:00:57 CET [ Project Homepage BibTeX Download ] 2005 views, 426 downloads, 2 subscriptions

About: A Tool for Measuring String Similarity

Changes:

This release feature several runtime improvements. Moreover, support for Soundex transformations and output modules for Matlab and JSON have been added. The distribution package also contains a new tutorial with examples.


Logo JMLR dlib ml 18.10

by davis685 - August 29, 2014, 02:56:23 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 83040 views, 14358 downloads, 2 subscriptions

About: This project is a C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems.

Changes:

In addition to a number of usability improvements, this release adds an implementation of the recent paper "One Millisecond Face Alignment with an Ensemble of Regression Trees" by Vahid Kazemi and Josephine Sullivan. This includes tools for performing high quality face landmarking as well as tools for training new landmarking models. See the face_landmark_detection_ex.cpp and train_shape_predictor_ex.cpp example programs for an introduction.


Logo Boosted Decision Trees and Lists 1.0.4

by melamed - July 25, 2014, 23:08:32 CET [ BibTeX Download ] 2806 views, 856 downloads, 3 subscriptions

About: Boosting algorithms for classification and regression, with many variations. Features include: Scalable and robust; Easily customizable loss functions; One-shot training for an entire regularization path; Continuous checkpointing; much more

Changes:
  • added ElasticNets as a regularization option
  • fixed some segfaults, memory leaks, and out-of-range errors, which were creeping in in some corner cases
  • added a couple of I/O optimizations

Logo JMLR Sally 0.9.0

by konrad - July 1, 2014, 22:43:51 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 20427 views, 4132 downloads, 2 subscriptions

About: A Tool for Embedding Strings in Vector Spaces

Changes:

Support for hash-based dimension reduction: simhash, minhash and Bloom filter. Support for several n-gram variants: regular, sorted, positional and blended n-grams. Simplified configuration.


Logo JMLR MultiBoost 1.2.02

by busarobi - March 31, 2014, 16:13:04 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 23450 views, 4122 downloads, 1 subscription

About: MultiBoost is a multi-purpose boosting package implemented in C++. It is based on the multi-class/multi-task AdaBoost.MH algorithm [Schapire-Singer, 1999]. Basic base learners (stumps, trees, products, Haar filters for image processing) can be easily complemented by new data representations and the corresponding base learners, without interfering with the main boosting engine.

Changes:

Major changes :

  • The “early stopping” feature can now based on any metric output with the --outputinfo command line argument.

  • Early stopping now works with --slowresume command line argument.

Minor fixes:

  • More informative output when testing.

  • Various compilation glitch with recent clang (OsX/Linux).


Logo JMLR SHOGUN 3.2.0

by sonne - February 17, 2014, 20:31:36 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 85191 views, 11813 downloads, 5 subscriptions

Rating Whole StarWhole StarWhole StarEmpty StarEmpty Star
(based on 6 votes)

About: The SHOGUN machine learning toolbox's focus is on large scale learning methods with focus on Support Vector Machines (SVM), providing interfaces to python, octave, matlab, r and the command line.

Changes:

This is mostly a bugfix release:

Features

  • Fully support python3 now
  • Add mini-batch k-means [Parijat Mazumdar]
  • Add k-means++ [Parijat Mazumdar]
  • Add sub-sequence string kernel [lambday]

Bugfixes

  • Compile fixes for upcoming swig3.0
  • Speedup for gaussian process' apply()
  • Improve unit / integration test checks
  • libbmrm uninitialized memory reads
  • libocas uninitialized memory reads
  • Octave 3.8 compile fixes [Orion Poplawski]
  • Fix java modular compile error [Bjoern Esser]

Logo LIBOL 0.3.0

by stevenhoi - December 12, 2013, 15:26:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 7692 views, 2368 downloads, 2 subscriptions

About: LIBOL is an open-source library with a family of state-of-the-art online learning algorithms for machine learning and big data analytics research. The current version supports 16 online algorithms for binary classification and 13 online algorithms for multiclass classification.

Changes:

In contrast to our last version (V0.2.3), the new version (V0.3.0) has made some important changes as follows:

• Add a template and guide for adding new algorithms;

• Improve parameter settings and make documentation clear;

• Improve documentation on data formats and key functions;

• Amend the "OGD" function to use different loss types;

• Fixed some name inconsistency and other minor bugs.


Logo KMLib sparse GPU SVM 0.1

by ksopyla - March 20, 2013, 14:30:08 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 1717 views, 419 downloads, 1 subscription

About: Support Vectors Machine library in .net with CUDA support. Library includes GPU SVM solver for kernels linear,RBF,Chi-Square and Exp Chi-Square which use NVIDIA CUDA technology. It allows for classification of feature rich sparse datasets through utilization of sparse matrix formats CSR, Ellpack-R or Sliced EllR-T

Changes:

Initial Announcement on mloss.org.


Logo pGBRT, Parallel Gradient Boosted Regression Trees 0.9

by swtyree - September 16, 2011, 22:15:46 CET [ Project Homepage BibTeX Download ] 6670 views, 1036 downloads, 1 subscription

About: Learns gradient boosted regression tree ensembles in parallel on shared memory or cluster systems

Changes:

Initial Announcement on mloss.org.


Logo mldata-utils 0.5.0

by sonne - April 8, 2011, 10:02:44 CET [ Project Homepage BibTeX Download ] 19447 views, 4069 downloads, 1 subscription

About: Tools to convert datasets from various formats to various formats, performance measures and API functions to communicate with mldata.org

Changes:
  • Change task file format, such that data splits can have a variable number items and put into up to 256 categories of training/validation/test/not used/...
  • Various bugfixes.

Logo redsvd 0.1.0

by hillbig - August 30, 2010, 18:13:55 CET [ Project Homepage BibTeX Download ] 3857 views, 825 downloads, 1 subscription

About: redsvd is a library for solving several matrix decomposition (SVD, PCA, eigen value decomposition) redsvd can handle very large matrix efficiently, and optimized for a truncated SVD of sparse matrices. For example, redsvd can compute a truncated SVD with top 20 singular values for a 100K x 100K matrix with 10M nonzero entries in about two second.

Changes:

Initial Announcement on mloss.org.


Logo sofia ml 0.1

by dsculley - December 29, 2009, 23:30:58 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4943 views, 897 downloads, 0 comments, 1 subscription

About: A fast implementation of several stochastic gradient descent learners for classification, ranking, and ROC area optimization, suitable for large, sparse data sets. Includes Pegasos SVM, SGD-SVM, Passive-Aggressive Perceptron, Perceptron with Margins, Logistic Regression, and ROMMA. Commandline utility and API libraries are provided.

Changes:

Initial Announcement on mloss.org.


Logo Elefant 0.4

by kishorg - October 17, 2009, 08:48:19 CET [ Project Homepage BibTeX Download ] 16911 views, 7374 downloads, 2 subscriptions

Rating Whole StarWhole Star1/2 StarEmpty StarEmpty Star
(based on 2 votes)

About: Elefant is an open source software platform for the Machine Learning community licensed under the Mozilla Public License (MPL) and developed using Python, C, and C++. We aim to make it the platform [...]

Changes:

This release contains the Stream module as a first step in the direction of providing C++ library support. Stream aims to be a software framework for the implementation of large scale online learning algorithms. Large scale, in this context, should be understood as something that does not fit in the memory of a standard desktop computer.

Added Bundle Methods for Regularized Risk Minimization (BMRM) allowing to choose from a list of loss functions and solvers (linear and quadratic).

Added the following loss classes: BinaryClassificationLoss, HingeLoss, SquaredHingeLoss, ExponentialLoss, LogisticLoss, NoveltyLoss, LeastMeanSquareLoss, LeastAbsoluteDeviationLoss, QuantileRegressionLoss, EpsilonInsensitiveLoss, HuberRobustLoss, PoissonRegressionLoss, MultiClassLoss, WinnerTakesAllMultiClassLoss, ScaledSoftMarginMultiClassLoss, SoftmaxMultiClassLoss, MultivariateRegressionLoss

Graphical User Interface provides now extensive documentation for each component explaining state variables and port descriptions.

Changed saving and loading of experiments to XML (thereby avoiding storage of large input data structures).

Unified automatic input checking via new static typing extending Python properties.

Full support for recursive composition of larger components containing arbitrary statically typed state variables.


Logo Dirichlet Forest LDA 0.1.1

by davidandrzej - July 16, 2009, 21:59:53 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4579 views, 888 downloads, 1 subscription

About: This software implements the Dirichlet Forest (DF) Prior within the Latent Dirichlet Allocation (LDA) model. When combined with LDA, the Dirichlet Forest Prior allows the user to encode domain knowledge (must-links and cannot-links between words) into the prior on topic-word multinomials.

Changes:

Initial Announcement on mloss.org.


Logo LibSGDQN 1.1

by antojne - July 2, 2009, 15:02:44 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 6590 views, 1312 downloads, 1 subscription

About: LibSGDQN proposes an implementation of SGD-QN, a carefully designed quasi-Newton stochastic gradient descent solver for linear SVMs.

Changes:

small bug fix (thx nicolas ;)


Logo OLaRankGreedy 1.0

by antojne - June 24, 2009, 17:07:57 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4144 views, 902 downloads, 1 subscription

About: OLaRankGreedy is an online solver of the dual formulation of support vector machines for sequence labeling using greedy inference.

Changes:

Initial Announcement on mloss.org.


Logo OLaRankExact 1.0

by antojne - June 24, 2009, 17:03:48 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3825 views, 920 downloads, 1 subscription

About: OLaRank is an online solver of the dual formulation of support vector machines for sequence labeling using viterbi decoding.

Changes:

Initial Announcement on mloss.org.


Logo BMRM 2.1

by chteo - May 8, 2009, 08:08:20 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 5658 views, 1099 downloads, 1 subscription

About: BMRM is an open source, modular and scalable convex solver for many machine learning problems cast in the form of regularized risk minimization problem.

Changes:

Initial Announcement on mloss.org.


Logo CoFiRank 0.1

by alexis - March 30, 2009, 17:17:34 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4965 views, 998 downloads, 2 subscriptions

About: CoFiRank is a Collaborative Filtering system based on matrix factorization. CoFiRank is based on the idea that it is better to predict the relative order of preferences (ranking) instead of the absolute rating.

Changes:

Initial Announcement on mloss.org.