Project details for Apache Mahout

Logo Apache Mahout 0.2

by gsingers - January 3, 2010, 22:33:30 CET [ Project Homepage BibTeX Download ]

view ( today), download ( today ), 0 subscriptions

Description:

Apache Mahout is an Apache Software Foundation project with the goal of creating both a community of users and a scalable, Java-based framework consisting of many machine learning algorithm implementations. The project currently has map-reduce enabled (via Apache Hadoop) implementations of several clustering algorithms (k-Means, Mean-Shift, Fuzzy k-Means, Dirichlet, Canopy), Naïve Bayes and Complementary Naïve Bayes classifiers, Latent Dirichlet Allocation, Frequent Patternset Mining, Random Decision Forests, collaborative filtering, as well as support for distributed evolutionary computing. We are also planning implementations of neural nets, expectation maximization, hierarchical clustering, Support Vector Machines, regression techniques, and Principal Component Analysis, amongst others.

Changes to previous version:

Focus on performance and cleanup of APIs on the way to a 1.0 release. Added several new algorithms (LDA, Frequent Patternset Mining, Random Decision Forests). See 0.2 release announcement: http://lucene.apache.org/mahout/index.html#17+Nov.+2009+-+Apache+Mahout+0.2+released

BibTeX Entry: Download
Supported Operating Systems: Agnostic
Data Formats: Arff, Lucene, Mahout Vector
Tags: Lda, R, Classification, Clustering, Machine Learning, K Nearest Neighbor Classification, Genetic Algorithms, Mapreduce, Collaborative Filtering, Frequent Patternset Mining, Latent Dirichlet Allocation
Archive: download here

Comments

No one has posted any comments yet. Perhaps you'd like to be the first?

Leave a comment

You must be logged in to post comments.