Projects that are tagged with large scale learning.

Logo revrand 1.0.0

by dsteinberg - January 29, 2017, 04:33:54 CET [ Project Homepage BibTeX Download ] 47571 views, 12395 downloads, 0 subscriptions

Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty Star
(based on 1 vote)

About: A library of scalable Bayesian generalised linear models with fancy features

  • 1.0 release!
  • Now there is a random search phase before optimization of all hyperparameters in the regression algorithms. This improves the performance of revrand since local optima are more easily avoided with this improved initialisation
  • Regression regularizers (weight variances) associated with each basis object, this approximates GP kernel addition more closely
  • Random state can be set for all random objects
  • Numerous small improvements to make revrand production ready
  • Final report
  • Documentation improvements

Logo Rabit 0.1.0

by crowwork - January 21, 2015, 18:48:46 CET [ Project Homepage BibTeX Download ] 7079 views, 2843 downloads, 0 subscriptions

About: rabit (Reliable Allreduce and Broadcast Interface) is a light weight library that provides a fault tolerant interface of Allreduce and Broadcast for portable , scalable and reliable distributed machine learning programs. Rabit programs can run on various platforms such as Hadoop, MPI and no installation is needed. Rabit now support kmeans clustering, and distributed xgboost: an extremely efficient disrtibuted boosted tree(GBDT) toolkit.


Initial Announcement on

Logo JMLR MultiBoost 1.2.02

by busarobi - March 31, 2014, 16:13:04 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 87133 views, 14333 downloads, 0 subscriptions

About: MultiBoost is a multi-purpose boosting package implemented in C++. It is based on the multi-class/multi-task AdaBoost.MH algorithm [Schapire-Singer, 1999]. Basic base learners (stumps, trees, products, Haar filters for image processing) can be easily complemented by new data representations and the corresponding base learners, without interfering with the main boosting engine.


Major changes :

  • The “early stopping” feature can now based on any metric output with the --outputinfo command line argument.

  • Early stopping now works with --slowresume command line argument.

Minor fixes:

  • More informative output when testing.

  • Various compilation glitch with recent clang (OsX/Linux).

Logo JMLR EnsembleSVM 2.0

by claesenm - March 31, 2014, 08:06:20 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 32331 views, 8939 downloads, 0 subscriptions

About: The EnsembleSVM library offers functionality to perform ensemble learning using Support Vector Machine (SVM) base models. In particular, we offer routines for binary ensemble models using SVM base classifiers. Experimental results have shown the predictive performance to be comparable with standard SVM models but with drastically reduced training time. Ensemble learning with SVM models is particularly useful for semi-supervised tasks.


The library has been updated and features a variety of new functionality as well as more efficient implementations of original features. The following key improvements have been made:

  1. Support for multithreading in training and prediction with ensemble models. Since both of these are embarassingly parallel, this has induced a significant speedup (3-fold on quad-core).
  2. Extensive programming framework for aggregation of base model predictions which allows highly efficient prototyping of new aggregation approaches. Additionally we provide several predefined strategies, including (weighted) majority voting, logistic regression and nonlinear SVMs of your choice -- be sure to check out the esvm-edit tool! The provided framework also allows you to efficiently program your own, novel aggregation schemes.
  3. Full code transition to C++11, the latest C++ standard, which enabled various performance improvements. The new release requires moderately recent compilers, such as gcc 4.7.2+ or clang 3.2+.
  4. Generic implementations of convenient facilities have been added, such as thread pools, deserialization factories and more.

The API and ABI have undergone significant changes, many of which are due to the transition to C++11.

Logo JMLR BudgetedSVM v1.1

by nemanja - February 12, 2014, 20:53:45 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 11740 views, 2157 downloads, 0 subscriptions

About: BudgetedSVM is an open-source C++ toolbox for scalable non-linear classification. The toolbox can be seen as a missing link between LibLinear and LibSVM, combining the efficiency of linear with the accuracy of kernel SVM. We provide an Application Programming Interface for efficient training and testing of non-linear classifiers, supported by data structures designed for handling data which cannot fit in memory. We also provide command-line and Matlab interfaces, providing users with an efficient, easy-to-use tool for large-scale non-linear classification.


Changed license from LGPL v3 to Modified BSD.

Logo MyMediaLite 3.10

by zenog - October 8, 2013, 22:29:29 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 178799 views, 34849 downloads, 0 subscriptions

About: MyMediaLite is a lightweight, multi-purpose library of recommender system algorithms.


Mostly bug fixes.

For details see:

Logo LIBOCAS 0.93

by vf - June 20, 2010, 12:22:05 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 23172 views, 3938 downloads, 0 subscriptions

About: The library implements Optimized Cutting Plane Algorithm (OCAS) for efficient training of linear SVM classifiers from large-scale data.


Implemented COFFIN framework which allows efficient training of invariant image classifiers via virtual examples.

Logo asp 0.3

by sonne - May 7, 2010, 10:25:39 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 20776 views, 3911 downloads, 0 subscriptions

About: Accurate splice site predictor for a variety of genomes.


Asp now supports three formats:

-g fname for gff format

-s fname for spf format

-b dir for a binary format compatible with mGene.

And a new switch

-t which switches on a sigmoid-based transformation of the svm scores to get scores between 0 and 1.

Logo arts 0.2

by sonne - May 25, 2009, 09:56:31 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 11787 views, 2525 downloads, 0 subscriptions

About: ARTS is an accurate predictor for Transcription Start Sites (TSS).


Initial Announcement on

Logo BMRM 2.1

by chteo - May 8, 2009, 08:08:20 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 13950 views, 2995 downloads, 0 subscriptions

About: BMRM is an open source, modular and scalable convex solver for many machine learning problems cast in the form of regularized risk minimization problem.


Initial Announcement on

Logo JMLR Nieme 1.0

by francis - April 2, 2009, 10:57:38 CET [ Project Homepage BibTeX Download ] 33733 views, 5263 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole Star1/2 StarEmpty Star
(based on 3 votes)

About: Nieme is a C++ machine learning library for large-scale classification, regression and ranking. It provides a simple interface available in C++, Python and Java and a user interface for visualization.


Released Nieme 1.0

Logo Disco 0.1

by tuulos - October 6, 2008, 11:14:48 CET [ Project Homepage BibTeX Download ] 14603 views, 2921 downloads, 0 subscriptions

About: Disco is an open-source implementation of the [Map-Reduce framework]( for distributed computing. As the original framework, Disco supports parallel [...]


Initial Announcement on


by biconnect - September 3, 2008, 17:35:24 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 30611 views, 3761 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 2 votes)

About: LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, L1-loss linear SVM, and multi-class SVM


Initial Announcement on

Logo LaRank 1.1

by antojne - July 15, 2008, 15:58:25 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 14731 views, 2979 downloads, 0 subscriptions

About: LaRank is an online solver for multiclass Support Vector Machines.


Initial Announcement on

Logo BSVM 2.06

by biconnect - January 30, 2008, 10:27:13 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 15457 views, 3034 downloads, 0 subscriptions

About: BSVM solves support vector machines (SVM) for the solution of large classification and regression problems. It includes three methods


Initial Announcement on

Logo PLearn 0.92

by vincentp - November 30, 2007, 07:51:26 CET [ Project Homepage BibTeX Download ] 12442 views, 3475 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarEmpty Star
(based on 1 vote)

About: PLearn is a large C++ machine-learning library with a set of Python tools and Python bindings. It is mostly a research platform for developing novel algorithms, and is being used extensively at [...]


Initial Announcement on