Projects that are tagged with big data.

Logo OpenNN 2.2

by Sergiointelnics - August 10, 2015, 16:56:57 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3861 views, 689 downloads, 3 subscriptions

About: OpenNN is an open source class library written in C++ programming language which implements neural networks, a main area of deep learning research. The library has been designed to learn from both data sets and mathematical models.


New algorithms, correction of bugs.

Logo JMLR BudgetedSVM v1.1

by nemanja - February 12, 2014, 20:53:45 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3000 views, 628 downloads, 1 subscription

About: BudgetedSVM is an open-source C++ toolbox for scalable non-linear classification. The toolbox can be seen as a missing link between LibLinear and LibSVM, combining the efficiency of linear with the accuracy of kernel SVM. We provide an Application Programming Interface for efficient training and testing of non-linear classifiers, supported by data structures designed for handling data which cannot fit in memory. We also provide command-line and Matlab interfaces, providing users with an efficient, easy-to-use tool for large-scale non-linear classification.


Changed license from LGPL v3 to Modified BSD.

Logo MLlib 0.8

by atalwalkar - October 10, 2013, 00:56:25 CET [ Project Homepage BibTeX Download ] 2938 views, 562 downloads, 1 subscription

About: MLlib provides a distributed machine learning (ML) library to address the growing need for scalable ML. MLlib is developed in Spark (, a cluster computing system designed for iterative computation. Moreover, it is a component of a larger system called MLbase ( that aims to provide user-friendly distributed ML functionality both for ML researchers and domain experts. MLlib currently consists of scalable implementations of algorithms for classification, regression, collaborative filtering and clustering.


Initial Announcement on

Logo ClowdFlows 0.9

by janezkranjc - October 8, 2013, 02:57:49 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2913 views, 583 downloads, 1 subscription

About: ClowdFlows is a web based platform for service oriented data mining publicly available at . A web based interface allows users to construct data mining workflows that are hosted on the web and can be (if allowed by the author) accessed by anyone by following a URL of the workflow.


Initial Announcement on