All entries.
Showing Items 91-100 of 552 on page 10 of 56: First Previous 5 6 7 8 9 10 11 12 13 14 15 Next Last

Logo ELKI 0.6.0

by erich - January 10, 2014, 18:32:28 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 11108 views, 1999 downloads, 3 subscriptions

About: ELKI is a framework for implementing data-mining algorithms with support for index structures, that includes a wide variety of clustering and outlier detection methods.

Changes:

Additions and Improvements from ELKI 0.5.5:

Algorithms

Clustering:

  • Hierarchical Clustering - the slower naive variants were added, and the code was refactored
  • Partition extraction from hierarchical clusterings - different linkage strategies (e.g. Ward)
  • Canopy pre-Clustering
  • Naive Mean-Shift Clustering
  • Affinity propagation clustering (both with distances and similarities / kernel functions)
  • K-means variations: Best-of-multiple-runs, bisecting k-means
  • New k-means initialization: farthest points, sample initialization
  • Cheng and Church Biclustering
  • P3C Subspace Clustering
  • One-dimensional clustering algorithm based on kernel density estimation

Outlier detection

  • COP - correlation outlier probabilities
  • LDF - a kernel density based LOF variant
  • Simplified LOF - a simpler version of LOF (not using reachability distance)
  • Simple Kernel Density LOF - a simple LOF using kernel density (more consistent than LDF)
  • Simple outlier ensemble algorithm
  • PINN - projection indexed nearest neighbors, via projected indexes.
  • ODIN - kNN graph based outlier detection
  • DWOF - Dynamic-Window Outlier Factor (contributed by Omar Yousry)
  • ABOD refactored, into ABOD, FastABOD and LBABOD

Distances

  • Geodetic distances now support different world models (WGS84 etc.) and are subtantially faster.
  • Levenshtein distances for processing strings, e.g. for analyzing phonemes (contributed code, see "Word segmentation through cross-lingual word-to-phoneme alignment", SLT2013, Stahlberg et al.)
  • Bray-Curtis, Clark, Kulczynski1 and Lorentzian distances with R-tree indexing support
  • Histogram matching distances
  • Probabilistic divergence distances (Jeffrey, Jensen-Shannon, Chi2, Kullback-Leibler)
  • Kulczynski2 similarity
  • Kernel similarity code has been refactored, and additional kernel functions have been added

Database Layer and Data Types

Projection layer * Parser for simple textual data (for use with Levenshtein distance) Various random projection families (including Feature Bagging, Achlioptas, and p-stable) Latitude+Longitude to ECEF Sparse vector improvements and bug fixes New filter: remove NaN values and missing values New filter: add histogram-based jitter New filter: normalize using statistical distributions New filter: robust standardization using Median and MAD New filter: Linear discriminant analysis (LDA)

Index Layer

  • Another speed up in R-trees
  • Refactoring of M- and R-trees: Support for different strategies in M-tree New strategies for M-tree splits Speedups in M-tree
  • New index structure: in-memory k-d-tree
  • New index structure: in-memory Locality Sensitive Hashing (LSH)
  • New index structure: approximate projected indexes, such as PINN
  • Index support for geodetic data - (Details: Geodetic Distance Queries on R-Trees for Indexing Geographic Data, SSTD13)
  • Sampled k nearest neighbors: reference KDD13 "Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles"
  • Cached (precomputed) k-nearest neighbors to share across multiple runs
  • Benchmarking "algorithms" for indexes

Mathematics and Statistics

  • Many new distributions have been added, now 28 different distributions are supported
  • Additional estimation methods (using advanced statistics such as L-Moments), now 44 estimators are available
  • Trimming and Winsorizing
  • Automatic best-fit distribution estimation
  • Preprocessor using these distributions for rescaling data sets
  • API changes related to the new distributions support
  • More kernel density functions
  • RANSAC covariance matrix builder (unfortunately rather slow)

Visualization

  • 3D projected coordinates (Details: Interactive Data Mining with 3D-Parallel-Coordinate-Trees, SIGMOD2013)
  • Convex hulls now also include nested hierarchical clusters

Other

  • Parser speedups
  • Sparse vector bug fixes and improvements
  • Various bug fixes
  • PCA, MDS and LDA filters
  • Text output was slightly improved (but still needs to be redesigned from scratch - please contribute!)
  • Refactoring of hierarchy classes
  • New heap classes and infrastructure enhancements
  • Classes can have aliases, e.g. "l2" for euclidean distance.
  • Some error messages were made more informative.
  • Benchmarking classes, also for approximate nearest neighbor search.

Logo Torch 5 5.1

by andresy - October 1, 2008, 04:25:12 CET [ Project Homepage BibTeX Download ] 10968 views, 1574 downloads, 2 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarEmpty Star
(based on 1 vote)

About: Torch5 provides a matlab-like environment for state-of-the-art machine learning algorithms. It is easy to use and provides a very efficient implementation, thanks to a easy and fast scripting [...]

Changes:

Initial Announcement on mloss.org.


Logo MALLET 2.0-rc4

by jacktanner - August 24, 2009, 23:10:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10943 views, 1747 downloads, 1 subscription

About: MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to [...]

Changes:

MALLET 2.0 RC4 Release Notes July 16, 2009

Major updates:

An implementation of generalized expectation criteria training of MaxEnt classifiers and methods for obtaining constraints (c.f. Gregory Druck, Gideon Mann, Andrew McCallum "Learning from Labeled Features using Generalized Expectation Criteria.")

PagedInstanceList has been substantially rewritten by Mike Bond.

Bug fixes to topic model hyperparameter optimization and topic inference.


Logo BioSig for Octave and Matlab 2.31

by schloegl - July 28, 2009, 13:41:01 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10873 views, 1807 downloads, 0 subscriptions

About: BioSig is a software library for biomedical signal processings. Besides several other modules, one modul (t400) provides a common interface (train_sc.m and test_sc.m) to various classification [...]

Changes:

Update of project information: machine learning and classification tools are moved to the NaN-toolbox.


Logo LIBSVM 2.9

by cjlin - February 27, 2010, 01:09:23 CET [ Project Homepage BibTeX Download ] 10793 views, 2207 downloads, 1 subscription

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 7 votes)

About: LIBSVM is an integrated software for support vector classification, (C-SVC, nu-SVC ), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM). It supports multi-class [...]

Changes:

Initial Announcement on mloss.org.


Logo r-cran-rgp 0.2-4

by r-cran-robot - April 1, 2013, 00:00:08 CET [ Project Homepage BibTeX Download ] 10727 views, 1787 downloads, 0 subscriptions

About: R genetic programming framework

Changes:

Fetched by r-cran-robot on 2013-04-01 00:00:08.163887


Logo SGD 2.0

by leonbottou - October 11, 2011, 20:59:41 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10469 views, 1670 downloads, 5 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarEmpty Star
(based on 2 votes)

About: The SGD-2.0 package contains implementations of the SGD and ASGD algorithms for linear SVMs and linear CRFs.

Changes:

Version 2.0 features ASGD.


Logo Libra 1.0.1

by lowd - March 30, 2014, 09:42:00 CET [ Project Homepage BibTeX Download ] 10457 views, 2288 downloads, 1 subscription

About: The Libra Toolkit is a collection of algorithms for learning and inference with discrete probabilistic models, including Bayesian networks, Markov networks, dependency networks, sum-product networks, arithmetic circuits, and mixtures of trees.

Changes:

Version 1.0.1 (3/30/2014):

  • Several new algorithms -- acmn, learning ACs using MNs; idspn, SPN structure learning; mtlearn, learning mixtures of trees
  • Several new support programs -- spquery, for exact inference in SPNs; spn2ac, for converting SPNs to ACs
  • Renamed aclearnstruct to acbn
  • Replaced aclearnstruct -noac with separate bnlearn program
  • ...and many more small changes and fixes, throughout!

Logo r-cran-ipred 0.9-1

by r-cran-robot - November 14, 2012, 00:00:00 CET [ Project Homepage BibTeX Download ] 10373 views, 2462 downloads, 1 subscription

About: Improved Predictors

Changes:

Fetched by r-cran-robot on 2013-04-01 00:00:05.613011


Logo Universal Java Matrix Package 0.2.5

by arndt - February 9, 2010, 15:55:23 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10286 views, 1906 downloads, 1 subscription

About: The Universal Java Matrix Package (UJMP) is a data processing tool for Java. Unlike JAMA and Colt, it supports multi-threading and is therefore much faster on current hardware. It does not only support matrices with double values, but instead handles every type of data as a matrix through a common interface, e.g. CSV files, Excel files, images, WAVE audio files, tables in SQL data bases, and much more.

Changes:

Meta data updated.


Showing Items 91-100 of 552 on page 10 of 56: First Previous 5 6 7 8 9 10 11 12 13 14 15 Next Last