Projects supporting the txt data format.
Showing Items 1-20 of 25 on page 1 of 2: 1 2 Next

Logo Harry 0.4.2

by konrad - April 16, 2016, 10:50:38 CET [ Project Homepage BibTeX Download ] 7878 views, 1724 downloads, 3 subscriptions

About: A Tool for Measuring String Similarity

Changes:

This release fixes the incorrect implementation of the bag distance.


About: TBEEF, a doubly ensemble framework for recommendation and prediction problems.

Changes:

Included the final technical report.


Logo JMLR MLPACK 2.0.1

by rcurtin - March 3, 2016, 18:52:03 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 56916 views, 10543 downloads, 6 subscriptions

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 1 vote)

About: A scalable, fast C++ machine learning library, with emphasis on usability.

Changes:
  • Fix CMake to properly detect when MKL is being used with Armadillo.
  • Minor parameter handling fixes to mlpack_logistic_regression.
  • Properly install arma_config.hpp.
  • Memory handling fixes for Hoeffding tree code.
  • Add functions that allow changing training-time parameters to HoeffdingTree class.
  • Fix infinite loop in sparse coding test.
  • Documentation spelling fixes.
  • Properly handle covariances for Gaussians with large condition number, preventing GMMs from filling with NaNs during training (and also HMMs that use GMMs).
  • CMake fixes for finding LAPACK and BLAS as Armadillo dependencies when ATLAS is used.
  • CMake fix for projects using mlpack's CMake configuration from elsewhere.

Logo Salad 0.6.0

by chwress - December 1, 2015, 16:17:35 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9894 views, 1871 downloads, 3 subscriptions

About: A Content Anomaly Detector based on n-Grams

Changes:

After a full year of development we proudly present you several new features, plenty of bug fixes and better performance :)

  • It now is possible to process data on bit granularity salad [train|inspect] --binary
  • Performance improvements while simultaneously preserving and further advancing readability of the source code.
  • Suppress the verbose output of Salad salad [train|predict] -q
  • Extend the (unit) testing framework to support test of the overall application and memchecks using valgrind.
  • Testing mode was renamed: salad dbg -> salad test
  • Allow to select either client or server-side data when processing network communication.
  • libfoodstoragebox A library encapsulating advanced data structures such as bloom filters.
  • Fixes for a critical bug when using group input and several minor issues.
  • An optionally compressed, text-based model file format salad train -F (txt|archive)
  • The default hashset ('simple2') makes use of djb2 hash
  • Flawless builds using gcc, mingw and clang

Logo Java Data Mining Package 0.3.0

by arndt - August 19, 2015, 15:44:46 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 1824 views, 354 downloads, 3 subscriptions

About: A Java library for machine learning and data analytics

Changes:

Initial Announcement on mloss.org.


Logo jLDADMM 1.0

by dqnguyen - August 19, 2015, 12:52:36 CET [ Project Homepage BibTeX Download ] 1261 views, 325 downloads, 2 subscriptions

About: The Java package jLDADMM is released to provide alternative choices for topic modeling on normal or short texts. It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models.

Changes:

Initial Announcement on mloss.org.


Logo Universal Java Matrix Package 0.3.0

by arndt - July 31, 2015, 14:23:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 13401 views, 2548 downloads, 3 subscriptions

About: The Universal Java Matrix Package (UJMP) is a data processing tool for Java. Unlike JAMA and Colt, it supports multi-threading and is therefore much faster on current hardware. It does not only support matrices with double values, but instead handles every type of data as a matrix through a common interface, e.g. CSV files, Excel files, images, WAVE audio files, tables in SQL data bases, and much more.

Changes:

Updated to version 0.3.0


Logo JMLR Sally 1.0.0

by konrad - March 26, 2015, 17:01:35 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 36735 views, 7133 downloads, 3 subscriptions

About: A Tool for Embedding Strings in Vector Spaces

Changes:

Support for explicit selection of granularity added. Several minor bug fixes. We have reached 1.0


Logo Machine Learning Support System MALSS 0.5.0

by canard0328 - February 20, 2015, 15:56:02 CET [ Project Homepage BibTeX Download ] 1505 views, 422 downloads, 1 subscription

About: MALSS is a python module to facilitate machine learning tasks.

Changes:

Initial Announcement on mloss.org.


About: Learns dynamic network changes across conditions and visualize the results in Cytoscape.

Changes:

Initial Announcement on mloss.org.


About: a parallel LDA learning toolbox in Multi-Core Systems for big topic modeling.

Changes:

Initial Announcement on mloss.org.


Logo DCABags 0.7

by wbuntine - June 5, 2014, 05:34:44 CET [ Project Homepage BibTeX Download ] 5175 views, 1144 downloads, 4 subscriptions

About: Document/Text preprocessing for topic models: suite of Perl scripts for preprocessing text collections to create dictionaries and bag/list files for use by topic modelling software.

Changes:

Moved distribution and code across to GitHub. Changed "ldac" format to have 0 offset for word indices. Added "document frequency" (df) filtering on selection of tokens for linkTables. Playing with linkParse but its still unuseable generally.


Logo A Parallel LDA Learning Toolbox 1.0

by yanjianfeng - January 24, 2014, 11:48:07 CET [ BibTeX Download ] 2486 views, 955 downloads, 1 subscription

About: We introduces PLL, a parallel LDA learning toolbox for big topic modeling.

Changes:

Fix some compiling errors.


Logo Malheur 0.5.4

by konrad - December 25, 2013, 13:20:31 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 17820 views, 3416 downloads, 1 subscription

About: Automatic Analysis of Malware Behavior using Machine Learning

Changes:

Support for new version of libarchive. Minor bug fixes.


Logo JMLR CAM Java 3.1

by wangny - October 14, 2013, 22:46:03 CET [ Project Homepage BibTeX Download ] 10625 views, 4248 downloads, 1 subscription

About: The CAM R-Java software provides a noval way to solve blind source separation problem.

Changes:

In this version, we fix the problem of not working under newest R version R-3.0.


About: A fast and robust learning of Bayesian networks

Changes:

Initial Announcement on mloss.org.


Logo fastICA 0.1

by maitra - February 28, 2013, 06:30:20 CET [ Project Homepage BibTeX Download ] 3103 views, 758 downloads, 1 subscription

About: The open-source C-package fastICA implements the fastICA algorithm of Aapo Hyvarinen et al. (URL: http://www.cs.helsinki.fi/u/ahyvarin/) to perform Independent Component Analysis (ICA) and Projection Pursuit. fastICA is released under the GNU Public License (GPL).

Changes:

Initial Announcement on mloss.org.


Logo Neural network designer 1.1.1

by bragi - December 28, 2012, 11:38:10 CET [ Project Homepage BibTeX Download ] 5457 views, 1296 downloads, 1 subscription

About: a dbms for resonating neural networks. Create and use different types of machine learning algorithms.

Changes:

AIML compatible (AIML files can be imported); new 'Grid channel' for developing board games; improved topics editor; new demo project: ALice (from AIML); lots of bug-fixes and speed improvements


Logo Reranker Framework 1.0

by zenog - October 29, 2012, 10:05:30 CET [ Project Homepage BibTeX Download ] 2179 views, 685 downloads, 1 subscription

About: ReFr is a software architecture for specifying, training and using reranking models.

Changes:

Initial Announcement on mloss.org.


Logo Pattern 2.4

by tomdesmedt - August 31, 2012, 02:26:01 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9974 views, 2902 downloads, 1 subscription

About: "Pattern" is a web mining module for Python. It bundles tools for data retrieval, text analysis, clustering and classification, and data visualization.

Changes:
  • Small bug fixes in overall + performance improvements.
  • Module pattern.web: updated to the new Bing API (Bing API has is paid service now).
  • Module pattern.en: now includes Norvig's spell checking algorithm.
  • Module pattern.de: new German tagger/chunker, courtesy of Schneider & Volk (1998) who kindly agreed to release their work in Pattern under BSD.
  • Module pattern.search: the search syntax now includes { } syntax to define match groups.
  • Module pattern.vector: fast implementation of information gain for feature selection.
  • Module pattern.graph: now includes a toy semantic network of commonsense (see examples).
  • Module canvas.js: image pixel effects & editor now supports live editing

Showing Items 1-20 of 25 on page 1 of 2: 1 2 Next