Projects supporting the txt data format.


Logo Harry 0.3.1

by konrad - October 22, 2014, 13:00:57 CET [ Project Homepage BibTeX Download ] 2138 views, 448 downloads, 2 subscriptions

About: A Tool for Measuring String Similarity

Changes:

This release feature several runtime improvements. Moreover, support for Soundex transformations and output modules for Matlab and JSON have been added. The distribution package also contains a new tutorial with examples.


Logo JMLR MLPACK 1.0.10

by rcurtin - August 29, 2014, 21:26:18 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 33732 views, 6674 downloads, 6 subscriptions

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 1 vote)

About: A scalable, fast C++ machine learning library, with emphasis on usability.

Changes:
  • Bugfix for NeighborSearch regression which caused very slow allknn/allkfn. Speeds are nwo restored to approximately 1.0.8 speeds, with significant improvement for the cover tree (#365).
  • Detect dependencies correctly when ARMA_USE_WRAPPER is not defined (i.e. libarmadillo.so does not exist).
  • Bugfix for compilation under Visual Studio (#366).

Logo Salad 0.5.0

by chwress - August 22, 2014, 17:54:56 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4153 views, 746 downloads, 1 subscription

About: A Content Anomaly Detector based on n-Grams

Changes:

Lots and lots of cool new features and bugfixes ;)

  • Refinements to the user interface: This includes a progress indicator, colors, etc.
  • Determine the expected error (salad-inspect)
  • Enable the user to echo the used parametrization: salad [train|predict|inspect] --echo-params
  • Allow to set the input batch size as program argument: salad [train|predict|inspect] --batch-size
  • libsalad: The library allows to access salad's basic functions
  • Installers and precompiled binaries: Windows installer, Debian (ppa:chwress/salad) & RPM packages as well a generic linux installers.
  • Various minor bug fixes
  • Support for "length at end" zip files
  • Improve salad's usage in a 2-class setting: salad [train|predict|inspect] --input-filter

Logo JMLR Sally 0.9.0

by konrad - July 1, 2014, 22:43:51 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 20656 views, 4158 downloads, 2 subscriptions

About: A Tool for Embedding Strings in Vector Spaces

Changes:

Support for hash-based dimension reduction: simhash, minhash and Bloom filter. Support for several n-gram variants: regular, sorted, positional and blended n-grams. Simplified configuration.


Logo DCABags 0.7

by wbuntine - June 5, 2014, 05:34:44 CET [ Project Homepage BibTeX Download ] 2495 views, 556 downloads, 4 subscriptions

About: Document/Text preprocessing for topic models: suite of Perl scripts for preprocessing text collections to create dictionaries and bag/list files for use by topic modelling software.

Changes:

Moved distribution and code across to GitHub. Changed "ldac" format to have 0 offset for word indices. Added "document frequency" (df) filtering on selection of tokens for linkTables. Playing with linkParse but its still unuseable generally.


Logo A Parallel LDA Learning Toolbox 1.0

by yanjianfeng - January 24, 2014, 11:48:07 CET [ BibTeX Download ] 957 views, 314 downloads, 1 subscription

About: We introduces PLL, a parallel LDA learning toolbox for big topic modeling.

Changes:

Fix some compiling errors.


Logo Malheur 0.5.4

by konrad - December 25, 2013, 13:20:31 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 12530 views, 2396 downloads, 1 subscription

About: Automatic Analysis of Malware Behavior using Machine Learning

Changes:

Support for new version of libarchive. Minor bug fixes.


Logo JMLR CAM Java 3.1

by wangny - October 14, 2013, 22:46:03 CET [ Project Homepage BibTeX Download ] 5718 views, 2665 downloads, 1 subscription

About: The CAM R-Java software provides a noval way to solve blind source separation problem.

Changes:

In this version, we fix the problem of not working under newest R version R-3.0.


About: TBEEF, a doubly ensemble framework for recommendation and prediction problems.

Changes:

Updated the included documentation.


About: A fast and robust learning of Bayesian networks

Changes:

Initial Announcement on mloss.org.


Logo fastICA 0.1

by maitra - February 28, 2013, 06:30:20 CET [ Project Homepage BibTeX Download ] 1535 views, 379 downloads, 1 subscription

About: The open-source C-package fastICA implements the fastICA algorithm of Aapo Hyvarinen et al. (URL: http://www.cs.helsinki.fi/u/ahyvarin/) to perform Independent Component Analysis (ICA) and Projection Pursuit. fastICA is released under the GNU Public License (GPL).

Changes:

Initial Announcement on mloss.org.


Logo Neural network designer 1.1.1

by bragi - December 28, 2012, 11:38:10 CET [ Project Homepage BibTeX Download ] 2731 views, 763 downloads, 1 subscription

About: a dbms for resonating neural networks. Create and use different types of machine learning algorithms.

Changes:

AIML compatible (AIML files can be imported); new 'Grid channel' for developing board games; improved topics editor; new demo project: ALice (from AIML); lots of bug-fixes and speed improvements


Logo Reranker Framework 1.0

by zenog - October 29, 2012, 10:05:30 CET [ Project Homepage BibTeX Download ] 1168 views, 389 downloads, 1 subscription

About: ReFr is a software architecture for specifying, training and using reranking models.

Changes:

Initial Announcement on mloss.org.


Logo Pattern 2.4

by tomdesmedt - August 31, 2012, 02:26:01 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 6863 views, 1769 downloads, 1 subscription

About: "Pattern" is a web mining module for Python. It bundles tools for data retrieval, text analysis, clustering and classification, and data visualization.

Changes:
  • Small bug fixes in overall + performance improvements.
  • Module pattern.web: updated to the new Bing API (Bing API has is paid service now).
  • Module pattern.en: now includes Norvig's spell checking algorithm.
  • Module pattern.de: new German tagger/chunker, courtesy of Schneider & Volk (1998) who kindly agreed to release their work in Pattern under BSD.
  • Module pattern.search: the search syntax now includes { } syntax to define match groups.
  • Module pattern.vector: fast implementation of information gain for feature selection.
  • Module pattern.graph: now includes a toy semantic network of commonsense (see examples).
  • Module canvas.js: image pixel effects & editor now supports live editing

Logo Random Forests 5.1

by zenog - September 21, 2011, 14:14:17 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2602 views, 465 downloads, 1 subscription

About: The original Random Forests implementation by Breiman and Cutler.

Changes:

Initial Announcement on mloss.org.


Logo gWT graph indexing wavelet tree 1.0.0

by ytabei - May 12, 2011, 23:01:17 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3081 views, 589 downloads, 1 subscription

About: Software for graph similarity search for massive graph databases

Changes:

Initial Announcement on mloss.org.


Logo JMLR Surrogate Modeling Toolbox 7.0.2

by dgorissen - September 4, 2010, 07:48:59 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 11814 views, 3473 downloads, 1 subscription

About: The SUMO Toolbox is a Matlab toolbox that automatically builds accurate surrogate models (also known as metamodels or response surface models) of a given data source (e.g., simulation code, data set, script, ...) within the accuracy and time constraints set by the user. The toolbox minimizes the number of data points (which it selects automatically) since they are usually expensive.

Changes:

Incremental update, fixing some cosmetic issues, coincides with JMLR publication.


Logo HSSVM 1.0.1

by xjbean - June 8, 2010, 16:16:05 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9271 views, 1849 downloads, 1 subscription

Rating Whole StarWhole StarWhole StarWhole StarWhole Star
(based on 1 vote)

About: HSSVM is a software for solving multi-class problem using Hyper-sphere Support Vector Machines model, implemented by Java.

Changes:
  1. From this version, the version number is normalized to hssvm1.0.1;
  2. In this version, we delete the features about running parameter searching and run-all from Ant script, that is, commands "ant search-param" and "ant run-all" which exist in previous version are no longer available, and they are replaced with commands "svm search conf" and "svm runall conf", both of them are used on Linux(or all other POSIX systems).If you want to use this program on Windows, the cygwin is required to be installed.

Logo Bilingual Text Classification 0.1

by jorcisai - April 9, 2010, 15:13:08 CET [ BibTeX BibTeX for corresponding Paper Download ] 2514 views, 914 downloads, 1 subscription

About: This software package implements a series of statistical mixture models for bilingual text classificacion trained by the EM algorihtm.

Changes:

Initial Announcement on mloss.org.


Logo Universal Java Matrix Package 0.2.5

by arndt - February 9, 2010, 15:55:23 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10148 views, 1826 downloads, 1 subscription

About: The Universal Java Matrix Package (UJMP) is a data processing tool for Java. Unlike JAMA and Colt, it supports multi-threading and is therefore much faster on current hardware. It does not only support matrices with double values, but instead handles every type of data as a matrix through a common interface, e.g. CSV files, Excel files, images, WAVE audio files, tables in SQL data bases, and much more.

Changes:

Meta data updated.