Projects that are tagged with data mining.


Logo Waffles 2010-06-09

by mgashler - June 9, 2010, 23:08:38 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4702 views, 1407 downloads, 1 subscription

About: A C++ library of machine learning algorithms and tools. Several demos are included that show how to use the library. Also, there is a script-friendly command-line interface that makes the algorithms [...]

Changes:

See the change log at http://waffles.sourceforge.net/changelog.html


Logo JMLR Model Monitor 1.0

by traeder - August 17, 2009, 11:05:06 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 5768 views, 999 downloads, 0 comments, 1 subscription

About: Model Monitor is a Java toolkit for the systematic evaluation of classifiers under changes in distribution. It provides methods for detecting distribution shifts in data, comparing the performance [...]

Changes:

Improved AUROC calculation. Several minor bug fixes.


Logo Debellor 1.0

by mwojnars - July 30, 2009, 16:48:05 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3419 views, 923 downloads, 1 subscription

About: Debellor is a scalable and extensible platform which provides common architecture for data mining and machine learning algorithms of various types.

Changes:
  • Naming of numerous classes/methods/fields changed to be more accurate and comprehensible
  • Weka and Rseslib libraries updated to the newest versions: Weka 3.6.1 & Rseslib 3.0.1. Debellor's wrappers adapted
  • New class: CrossValidation - evaluator of trainable cells through cross-validation
  • New class: RMSE - calculation of Root Mean Squared Error score
  • Data objects can be compared and used in collections
  • ArffReader can read from a user-provided java.io.InputStream
  • More convenient use of parameters (setting values)
  • More convenient use of data objects and data types (construction, type casting)
  • Other minor improvements to existing classes
  • Javadoc extended

Logo Disco 0.1

by tuulos - October 6, 2008, 11:14:48 CET [ Project Homepage BibTeX Download ] 3060 views, 561 downloads, 1 subscription

About: Disco is an open-source implementation of the [Map-Reduce framework](http://en.wikipedia.org/wiki/MapReduce) for distributed computing. As the original framework, Disco supports parallel [...]

Changes:

Initial Announcement on mloss.org.


Logo Sleipnir 1.0

by chuttenh - June 30, 2008, 03:22:19 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2701 views, 603 downloads, 1 subscription

About: The Sleipnir C++ library implements a variety of machine learning and data manipulation algorithms focusing on heterogeneous data integration and efficiency for large biological data collections.

Changes:

Initial Announcement on mloss.org.


Logo MinorThird 20080414

by frank - June 9, 2008, 09:08:30 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3212 views, 1055 downloads, 1 subscription

About: MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text. It was written primarily by William W. Cohen, a professor at [...]

Changes:

Initial Announcement on mloss.org.