Projects that are tagged with large datasets.


Logo scikit multilearn 0.0.5

by niedakh - February 25, 2017, 03:51:59 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 19047 views, 4081 downloads, 0 subscriptions

About: A native Python, scikit-compatible, implementation of a variety of multi-label classification algorithms.

Changes:
  • a general matrix-based label space clusterer has been added which can cluster the output space using any scikit-learn compatible clusterer (incl. k-means)
  • support for more single-class and multi-class classifiers you can now use problem transformation approaches with your favourite neural networks/deep learning libraries: theano, tensorflow, keras, scikit-neuralnetworks
  • support for label powerset based stratified kfold added
  • graph-tool clusterer supports weighted graphs again and includes stochastic blockmodel calibration
  • bugs were fixed in: classifier chains and hierarchical neuro fuzzy clasifiers

Logo Universal Java Matrix Package 0.3.0

by arndt - July 31, 2015, 14:23:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 29381 views, 5641 downloads, 0 subscriptions

About: The Universal Java Matrix Package (UJMP) is a data processing tool for Java. Unlike JAMA and Colt, it supports multi-threading and is therefore much faster on current hardware. It does not only support matrices with double values, but instead handles every type of data as a matrix through a common interface, e.g. CSV files, Excel files, images, WAVE audio files, tables in SQL data bases, and much more.

Changes:

Updated to version 0.3.0


Logo Linear SVM with general regularization 1.0

by rflamary - October 5, 2012, 15:34:21 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 12543 views, 3275 downloads, 0 subscriptions

About: This package is an implementation of a linear svm solver with a wide class of regularizations on the svm weight vector (l1, l2, mixed norm l1-lq, adaptive lasso). We provide solvers for the classical single task svm problem and for multi-task with joint feature selection or similarity promoting term.

Changes:

Initial Announcement on mloss.org.


Logo SMIDAS 1.1

by ambujtewari - August 15, 2010, 18:51:51 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 20510 views, 4024 downloads, 0 subscriptions

About: A stochastic variant of the mirror descent algorithm employing Langford and Zhang's truncated gradient idea to minimize L1 regularized loss minimization problems for classification and regression.

Changes:

Fixed major bug in implementation. The components of the iterate where the current example vector is zero were not being updated correctly. Thanks to Jonathan Chang for pointing out the error to us.


Logo SCD 2.1

by ambujtewari - December 3, 2009, 22:21:45 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 24358 views, 4616 downloads, 0 subscriptions

About: A (randomized) coordinate descent procedure to minimize L1 regularized loss for classification and regression purposes.

Changes:

Fixed some I/O bugs. Lines that ended with whitespace were not read correctly in the previous version.