Projects that are tagged with large datasets.


Logo scikit multilearn 0.0.5

by niedakh - February 25, 2017, 03:51:59 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3446 views, 750 downloads, 3 subscriptions

About: A native Python, scikit-compatible, implementation of a variety of multi-label classification algorithms.

Changes:
  • a general matrix-based label space clusterer has been added which can cluster the output space using any scikit-learn compatible clusterer (incl. k-means)
  • support for more single-class and multi-class classifiers you can now use problem transformation approaches with your favourite neural networks/deep learning libraries: theano, tensorflow, keras, scikit-neuralnetworks
  • support for label powerset based stratified kfold added
  • graph-tool clusterer supports weighted graphs again and includes stochastic blockmodel calibration
  • bugs were fixed in: classifier chains and hierarchical neuro fuzzy clasifiers

Logo Universal Java Matrix Package 0.3.0

by arndt - July 31, 2015, 14:23:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 16241 views, 3082 downloads, 3 subscriptions

About: The Universal Java Matrix Package (UJMP) is a data processing tool for Java. Unlike JAMA and Colt, it supports multi-threading and is therefore much faster on current hardware. It does not only support matrices with double values, but instead handles every type of data as a matrix through a common interface, e.g. CSV files, Excel files, images, WAVE audio files, tables in SQL data bases, and much more.

Changes:

Updated to version 0.3.0


Logo Linear SVM with general regularization 1.0

by rflamary - October 5, 2012, 15:34:21 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 6276 views, 1733 downloads, 1 subscription

About: This package is an implementation of a linear svm solver with a wide class of regularizations on the svm weight vector (l1, l2, mixed norm l1-lq, adaptive lasso). We provide solvers for the classical single task svm problem and for multi-task with joint feature selection or similarity promoting term.

Changes:

Initial Announcement on mloss.org.


Logo SMIDAS 1.1

by ambujtewari - August 15, 2010, 18:51:51 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10721 views, 2181 downloads, 1 subscription

About: A stochastic variant of the mirror descent algorithm employing Langford and Zhang's truncated gradient idea to minimize L1 regularized loss minimization problems for classification and regression.

Changes:

Fixed major bug in implementation. The components of the iterate where the current example vector is zero were not being updated correctly. Thanks to Jonathan Chang for pointing out the error to us.


Logo SCD 2.1

by ambujtewari - December 3, 2009, 22:21:45 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 11088 views, 2124 downloads, 1 subscription

About: A (randomized) coordinate descent procedure to minimize L1 regularized loss for classification and regression purposes.

Changes:

Fixed some I/O bugs. Lines that ended with whitespace were not read correctly in the previous version.